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NOVEL POLYPEPTIDES AND NUCLEIC ACIDS ENCODING THE SAME 

FIFT Pr^F^"" INVFNTION 
The present invenuon relates generally to the .demificanon and .solauon of novel DNA and to the 
recombinant production of novel polypeptides encoded by that DNA 

5 

p^r|rrrp^^lNr> Of THE INVENTION 
Extracellular protons play an important role in the formanon. differenuat.on and maintenance of 
mukicellular organisms. The fate of m.ny mdividual cells, e.g.. pioliferatton. migration, differentiation, or 
interaction with other cells, is typically governed by informac.on received from other cells and/or the immediate 
10 environment. TTus informaoon .s often aansmined by secreted polypeptides (for instance, mitogenic factors, survival 
factors, cytotoxic factors, d.ffcrennat.on factors, neuropeptides, and hormones) which are. m turn, received and 
interpreted by diverse cell receptors or membrane-bound proteuis. These secreted polypeptides or signalmg 
molecules normally pass through the cellular secretory pathway to reach their site of acnon in the extracellular 
environment. 

15 Secreted proteins have various iiKlustrial applicauons. including pharmaceuucals. diagnosucs. biosensors 

and bioreactors. Most protein dnigs available at present, such as thrombolyi.c agents, interferons, mterleukins. 
erythropoietins, colony stimulating factors, and various other oy.ok.nes. are secretory proteins. TTieir receptors, 
which are membrane proteins, also have potential as therapem.c or diagnost.c agents. Efforts are being undertaken 
by bodi indusny and academia .o .dentify new. native secreted prote.ns. Many efforts are focused on the screemng 
20 of mammalian recombinant DNA libraries to identify the codmg sequences for novel secreted proteins. Examples 
of screening medKxls and techniques are described in the literature [see. for example. Klem ei al.. Prw. Nai l . AC8< I 
SsL, 22:7108-7113 (1996); U.S. Patem No. 5,536.637)1. 

Membrane-bound proteins and receptors can play an important role m the formation, differentiation and 
maintenance of multicellular organisms. Tlte fate of many individual cells, e.g.. proliferat.on. migration. 
25 differentiation, or interaction with other cells, is typically governed by informauon received from other cells and/or 
the immediate cnviromnent. This information is often transmined by secre.ed polypepudes (for insumce. mitogenic 
fectors. survival factors, cytotoxic factors, differentiation factors, neuropeptides, and hormones) which are. m turn, 
received and inttrpieted by diverse cell receptors or membrane-bound prote.ns. Such membrane-bound protems and 
ceU receptors include, but are not limited to. cytokine receptors, receptor kinases, receptor phosphauues. receptors 
30 involved in celKell interactions, and cellular adhcsin molecules like select.ns and miegrins. For instance, 
transduction of signals that regulate cell growth and differentiation is regulated in part by phosphorylauon of various 
cellular proteins. Protein tyrosine kinases, enzymes that catalyze that process, can also act as growth factor 
receptors. Examples include fibroblast growth factor receptor and nerve growth factor receptor. 

Membrane-bound proteins and receptor molecules have various industrial applicaoons. includmg as 
35 pharmaceutical and diagnostK agents. Receptor immunoadhesins. for instance, can be employed as therapeuuc agents 

1 
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u, block reccptor-bgand ,n.eracti^„. The membrane-bound prcems can also be emXd for scrccmng of pcenual 
pepddc or small molecule inhibitor, of *e relevant receptor/hgand mteracuon. Effort, are bems undertaken by both 
mdustry and academia to idenUfy new. native receptor proteins. Many efforts arc focused on the screening of 
mammalian recombinant DNA Ubraries to identify the coding sequences for novel receptor protems. 

We herem describe the idendficaoon and chaiacterizadon of novel secreted and transmembrane polypepudes 
and novel nucleic acids encoding those polypeptides. 

1- PR0213 

Human growth arrest-specific gene 6 (gas6) encodes a protein that is expressed in a vanefy of differenl 
..ssues and which has been reported to be highly expressed during periods of sentm smrvation and negatively 
regulated dunng growth induction. See Manfiolett. et al.. Mot. Cell. B,oi 13(8):497(h.985 (1993) and SlUt et al 
Cell 80:661.670 (1995). Maafioletti e. al. (.993). have suggested that the gas6 protein is member of the 
vttamm K-dependent family of proteins, wherem the members of the laaer family of protetns (which include for 
example. Protein S. Protein C and Factor X) all play regulatory roles in die blood coagulation pathway TTius it has 
been suggested .hat gas6 may play a role in the regulauon of a protease cascade relevant in growth regulat.on or in 
15 the blood coagulation cascade. 

Given die physiological importance of the gas6 protein, efforts are cunemly beine underuken by both 
mdustry and academia to identify new. native proteins which are homologous to gas6. Many of *ese efforts are 
focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel 
secreted and membrane-bound receptor proteins, specifically diose having homology to gas6. Examples of such 
screenmg methods and techniques are described in the iiteranire (see. for example. Klein et aJ.. Proc Nmi i^.^ ^ 
22:7108-71 ,3 (.996); U.S. Patent No. 5.536.637)1. We herein describe the identification of a novel polypeptide 
which has homology to die gas6 polypeptide. 



25 



2. 

TTie 7-,ransmembiane OTTM^ proteins or receptors, also referred to in the Iiteranire as G-pro,ein coupled 
receptors, arc specialized proteins designed for recognition of ligands and dic subsequent signal transduction of 
uifonnation conta^d within tfu«e Ugands to the machinety of the ceU. TTie primary purpose of cell surface receptors 
.s to discriminate appropriate ligands from the various extracellular stimuli which each cell encounters then to 
acdvate an effector system diat produces an intracellular signal. Uiereby controlling celluhir processes. fDohlman. 
30 H.. Am , Rcv Pinrhrm . fij:653 (1991)]. Tlie ability of 7TM receptors to bind ligand to a recognition domain anti 
allostencalfy transmit the information to an intracellular domain is a specialized feanire of TTM proteins fKenakin 
T.. EtaCQL^. 48.413 (1996)1. The gene family which encodes the TTM receptors or G-protein linked 
recepton encode receptors which recognize a large number of ligands. including bu, not limited to. C5a interleukin 
8 and related chemokines. Research in diis area suggests that distinct signals at the cell surface feed into common 
P-hways of ceU activation. fGeraid. C. and Gerard. N..C«rr, Op, Immimol . fi:140 (1994). Gerard. C. and Gerard 
N. Ain . RcvlmmUTK)L . 12:775 (1994)J. n,e superfamily of 7TM or G-protein coupled receptors contains several 
hundred members able to recognize various messages such as photons, ions and amino acids among odiers (Schwaru. 
T.W.. et al.. H..Trends in Pharmacn|, gyj, j[2(a:213 (1996)). 

munm. H.. Am . Rev Biorhrm . SQ:6S2 (1991)]. (Schwanz. T.W.. et al.. H.. Eur. J. Pham. Sr^ 2 85 (1994)] 
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We describe herein the idemifkation of a novel polypeptide (designated herein as PR0274) which has homology to 
the 7 iransmembrane segment receptor proteins and the Fn54 protein. 

3. SBQM 

The Diff 33 protein is over-expressed in mouse testicular nimors. At present its role is unclear, however. 
5 it may play a role in cancer. Given the medical importance of understanding the physiology of cancer, efforts are 
currently being under taken to identify new. native proteins which are involved m cancer. We describe herein the 
identification of a novel polypeptide which has homology to Diff 33. designated herein as PRO300. 

4. EBOiM 

10 Effons arre currently being undertaken to identify and characterize novel transmembrane proteins. We 

herein describe the identification and characterization of a novel transmembrane polypeptide, designated herein as 
PR0284. 

5. 

15 Cancerous cells often express numerous proteins that are not expressed in the corresponding normal cell 

type or arc expressed at differem levels than in the corresponding normal cell type. Many of these proteins arc 
invoWed in inducing the transformation from a normal ceU to a cancerous cell or in mainaining the cancer phenotype. 
As sudi. there is significant interest in identifying and characterizing proteins that are expressed in cancerous cells. 
We herein describe the idemificauon and characterization of a novel polypeptide having homology to the sarcoma- 

20 amplified protein SAS, designated herein as PR0296. 

6. £Bj2222 

Immunoglobulin molecules play roles in many inqjonant mammalian physiological processes. The stnicmre 
of immunoglobulin molecules has been extensively smdied and it has been well documented that intact 

25 immunoglobulins possess distinct domains, one of which is the consunt domain or F. region of the irarmuioglobulin 
molecule. Tte F. domain of an immunogtobulin. \rtule not being directly involved in antigen recognition and binding, 
does mediate the ability of the immunoglobulin molecule, either uncomptexcd or complexed with its respective 
antigen, to bind to F. receptors either circulating in die scrum or on the surface of cells. The ability of an F, domain 
of an immunoglobulin to bind to an F, receptor molecule results in a variety of important activities, including for 

30 cvanp\e, in mounting an immune response against unwanted foreign particles. As such, there is substantial interest 
in identifying novel F, receptor proteins and subunits thereof. We herein describe the identification and 
characterization of a novel polypeptide having homology to a high affinity immunoglobulin F. receptor protem. 
designated herein as PR0329. 

35 7. iEom 

Cdorectal carcinoma is a malignant neoplastic disease which has a high incidence in the Western world, 
particularly in the Uiuted States. Tumors of this type often metastasize through lymphatic and vascular channels and 
resuh in the death of some 62.000 persons in the United States annually. 
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Monoclonal annbody A33 (mAbA33) is a murine unmunoglobulin that has undergone extensive preclinical 
analysis and localizauon smdics in patients inflicted with colorectal carcinoma (Welt ct al., / Clin. Oncol. 8: 1894- 
1906 (1990) and Well etal.. J. Clin. Oncol. 12:1561-1571 (1994)). mAbA33 has been shown to bind to an antigen 
found in and on the surfece of normal colon cells and colon cancer cells. In carcinomas originating from the colomc 
mucosa, the A33 antigen is expressed homogeneously in more than 95% of the cases. The A33 antigen, however, 
has not been detecting in a wide range of other nonnal tissues, i.e.. its expression appears to be rather organ specific. 
TTiercfore. the A33 antigen appears to play an important role in the induction of colorectal cancer. 

Given the obvious importance of the A33 antigen in tumor cell fonnaiion and/or proliferation, there is 
substantial interest in idemifymg homologs of the A33 antigen. In this regard, we herein describe the idemificauon 
and characterization of a novel polypeptide having homology to the A33 antigen protein, designated herein as 



10 PR0362 



8. PROW 

The ceU surfiice protein HCAR is a membrane-bound protein that acts as a receptor for subgroup C of the 
adenoviruses and subgroup B of the coxsackieviruses. Thus. HCAR may provide a means for mediating viral 
infixtion of cells in that the presence of the HCAR receptor on the cellular surface provides a binding site for viral 
panicles, thereby faciliuting viral infection. 

In light of dK physiological importance of membrane-bound proteins and spcficially those which serve a cell 
surfece receptor for viruses, efforts are currently being underuken by bodi industry and academia to idemify new. 
native membrane-bound reeptor proteins. Many of these efforo are focused on die screening of mammalian 
recombinant DNA libraries to identify die coding sequences for novel receptor proteins. We herein describe a novel 
membrane-bound polypeptide having homology to the cell surface protein HCAR and to various tumor anugens 
inchiding A33 and caicinoembryonic antigen, designated herein as PR0363. wherein dus polypeptide may be a novel 
cell surface virus receptor or tumor antigen. 

25 9. PROg68 

Control of cell numbers in mammals is believed to be determined, in part, by a balance between cell 
proliferation and cett deadi. One form of cell dcadi. sometimes referred to as necrotic cell death, is typically 
characterized as a padtologic form of cell death resulting from some trauma or celhilar injury. In contrast, diere is 
anoOw. "physiologic" form of cell deadi which usually proceeds in an orderly or controlled manner. This orderly 

30 or coniroUed fonn of cell dead, is often referred to as "apoptosis" (see. e.g. . Barr el al., Bio/Technnln„ 12:487-493 
(1994); Steller et al.. Ssacna. 2fi2: 1445-1449 (1995)]. Apoptotic cell dead) nanirally occurs in many physiological 
pnxxsses, including embryonic development and clonal selection in die immune system Rtoh et al.. Cdl. 66:233-243 
(1991)]. Decreased levels of apopujtic cell dcadi have been associated widi a variety of padiological conditions, 
including cancer, lupus, and herpes vims infection (Thompson. SoSLSi, 262:1456-1462 (1995)]. Increased levels 

35 of apoptotic ceU deadi may be associated widi a variety of other padiological conditions, including AIDS. Alzheimer's 
disease. Parkinson's disease, amyotrophic lateral sclerosis, multiple sclerosis, retinitis pigmentosa, cerebellar 
degeneration, apiastic anemia, myocardial in&rcuon. sttoke, reperfusion injury, and toxin-induced liver disease (see. 
Thoaq)son, supral . 
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Apoptoiic cell death is typically accompanied by one or more characteristic morphological and biochemical 
chanees in cells, such as condensation of cytoplasm, loss of plasma membrane microvilli, segmentation of the 
nucleus, degradation of chromosomal DNA or loss of mitochondrial funciion. A vanct>' of extrinsic and mtrinsic 



400 (1992); Siellcr, supra : Sachs et al.. Blood . S2:15 {1993)1. For instance, they can be triggered by hormonal 
5 stimuli, such as glucoconicoid hormones for immamre thymocytes, as well as withdrawal of certain growth factors 
fWatanabc-Fukunaga et al., N^nire . 25fi:3l4-3l7 (1992)]. Also, some identified oncogenes such as myc. reL and 
EIA, and rumor suppressors, like p53, have been reported to have a role in inducing apc^iosis. Certain 
chemotherapy drugs and some forms of radiation have likewise been observed to have apoptosis-inducing activity 
(Thompson, supral . 

10 Various molecules, such as nimor necrosis factor-a ("TNF-a"). tumor necrosis factor-P ("TNF-P" or 

"lymphotoxin-a"), lymphoioxin^p ("LT-P"), CD30 ligand, CD27 ligand. CD40 hgand, OX 40 ligand. 4-lBB ligand, 
Apo-1 ligand (also referred to as Fas ligand or CD95 ligand), and Apo-2 ligand (also referred lo as TRAIL) have been 
identified as members of the tumor necrosis factor f'TNF") family of cytokines [See, e.g.. Gruss and Dower. Blfisd. 
25:3378-3404 (1995); Pitti et al.. J. Biol Chem. . 221:12687-12690 (1996): Wiley et al. ImmuniW. 2:673-682 (1995); 
15 Browning et al.. Cdl. 22:847-856 (1993); Armiiagc et al. £jsiurs, 252:80-82 (1992), WO 97/01633 published January 
16. 1997: WO 97/25428 published July 17. 1997). Among these molecules, TNF-a, TNF-p, CD30 ligand, 4-lBB 
ligand, Apo-1 ligand, and Apo-2 ligand (TRAIL) have been reported to be involved in apoptoiic cell death. Both 
TNF-a and TNF-p have been reported to induce apoptoiic death in susceptible tumor cells (Schmid et al., Prpc. Nati. 
Acad. Sci. . 22:1881 (1986): Dcaltry el al., Eur. J. Immunol. . 17:689 (1987)]. Zheng el al. have reponed that TNF-a 
20 is involved in post-srimulation apoptosis of CD8-positive T cells [Zheng et al.. Nature. 322:348-35 1 (1995)]. Other 
investigators have reported that CD30 ligand may be involved in deletion of self-reactive T cells in the thymus 
(Amakawa et al.. Cold Spring Harbor Laboratory Symposium on Programmed Cell Death, Abstr. No. 10. (1995)). 

Mutations in the mouse Fas/ Apo-1 receptor or ligand genes (called Ipr and gld, respectively) have been 
associated with some autoimmune disorders, indicating diat Apo-1 ligand may play a role in regulating the clonal 
25 deletion of self-reactive lymphocytes in the periphery (Krammer ei ah, C'TT Immunol.. fi:279-289 (1994); 
Nagau ei al., Science . 262:1449-1456 (1995)). Apo-1 ligand is also reponed to induce post-stimulation apoptosis 
in CD4-posiiive T lymphocytes and in B lymphocytes, and may be involved in the elimination of activated 
lymphocytes when their funcuon is no loiiger needed [Krammer et al., sujUii; Nagata et al., sUDQI- Agonist mouse 
monoclonal antibodies specifically binding to the Apo-1 receptor have been reported to exhibit cell killing activity 
30 that is comparable to or similar to ihai of TNF-a fYonehara et al., J. Exp. Med.. 1^:1747-1756 (1989)1. 

Induction of various cellular responses mediated by such TNF family cytokines is believed to be initiated 
by their binding to specific cell receptors. Two distinct TNF receptors of approximately 55-kDa (TNFRl) and 75- 
kDa (TNFR2) have been identified fHohman et al.. J. Biol. Chem. . 264:14927-14934 (1989); Brockhaus et al., Proc. 
Natl. Acad. Sci. . 22:3127-3131 (1990); EP 417,563, published March 20, 1991] and human and mouse cDNAs 
35 corresponding to both receptor types have been isolated and characterized ILoetscher et al., £s3l* 61:351 (1990); 
Schall ct al., Cdl. 61:361 (1990); Smidi et al.. Science . 248:1019-1023 (1990); Lewis ei al., Prpc, NaiL Acal Sa. 
ffi;2830-2834 (1991): Goodwin ci al., Mr^l. Cell. Biol. . 11:3020-3026 (1991)1. Extensive polymorphisms have been 
associated with both TNF receptor genes (see, e.g., Takao ei al., Immunogenetics. 27:199-203 (1993)1. Bodi TNFRs 
share the typical structure of cell surface receptors including extracellular, transmembrane and mtraccllular regions. 



signals are believed to trigger or induce such morphological and biochenucal cellular changes [Raff, NiSUIS. 226:397' 
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The cxiracclluJar ponions of both receptors are found naturally also as soluble T^binding proteins fNophar. Y 
ei al., EMBCLL, 2:3269 (1990): and Kohno. T. ei al.. Proc. Natl. Acnd . Set U.S.A. . £7:8331 (1990)]. More 
recently, the cloning of recombinant soluble TNF receptors was reported by Hale et al. fJ. Cell Rinchp^ 
Supplement 15F. 1991. p. 113 (P424)]. 

The extraceUuJar pomon of type I and type 2 TNFRs (TNFRl and TNFR2) contains a repetitive amino acid 
sequence pattern of four cysieine-nch domains (CRDs) designated 1 through 4. stanmg from the NH^-terminus. Each 
CRD is about 40 amino acids long and contains 4 to 6 cysteine residues at posiuons which are well conserved [Schall 
et al.. Siicra; Loeischer et al., sucxa; Smith ci al., supra : Nophar ei aL, suora : Kohno el al.. suora^ . In TNFRl, the 
approximate boundaries of the four CRDs are as follows: CRD I - amino acids 14 to about 53; CRD2- ammo acids 
from about 54 to about 97; CRD3- amino acids from about 98 lo about 138: CRD4- amino acids from about 139 to 
about 167. In TNFR2. CRDI includes amino acids 17 to about 54; CRD2- ammo acids from about 55 to about 97; 
CRD3- amino acids from about 98 to about 140; and CRD4- amino acids from about 141 to about 179 (Banner et al.. 
Cell, 22:431-435 (1993)1. The potential role of the CRDs in ligand binding is also described by Banner et al,. siiCia. 

A similar repetitive panern of CRDs exists in several other cell-surface proteins, including the p75 nerve 
growth factor receptor fNGFR) [Johnson et al.. CcU, 47-545 (1986); Radeke et al., Namre . 225:593 (1987)|, the B 
15 ceU antigen CD40 fStamenkovic et al., EMfiJLL. 8:1403 (1989)1. the T cell antigen 0X40 [Mallet ec al.. EMBQJ. 
2:1063 (1990)1 and the Fas antigen [Yonehara ci aL, agaa and lioh et al., ££ll. fifi:233-243 (1991)]. CRDs are also 
found in the soluble TNFR (sTNFR)-like T2 proteins of the Shope and myxoma poxviruses [Upton et al., Virology 
M):20-29 (1987); Smith et al.. Biochem. Bionhvs. Re.v romTmin I7fi:335 (1991); Upion et al . Virolopy , 184:370 
(1991)]. Opdmal alignment of these sequences indicates that the positions of the cysteine residues are well conserved. 
These receptors are sometimes collectively referred to as members of the TNF/NGF receptor superfamily. Recent 
studies on p75NGFR showed that the deletion of CRDI [Welcher, A. A. et al. , Proc. Natl. Acad. Sci URf\ £8: 159- 
163 (1991)] or a5-amino acid insertion in this domain (Van, H. and Chao. M.V., J. Biol. Chem. , 266:12099-19104 
(1991)] had Utile or no effect on NGF binding [Yan. H. and Chao, M, V., supra) . p75 NGFR contains a proline-rich 
stretch of about 60 amino acids, between its CRD4 and transmembrane region, which is not involved in NGF binding 
25 [Pectrc, C. ei al., E^r, J. Hcmoi,, 11:414-419 (1988); Seckmger, P. et al.. J. Biol. Chem. . 264:11966-11973 
(1989): Yan, H. and Chao, M.V.. jujaaj. A similar proline-rich region is found in TNFR2 but not in TNFRl. 

The TNF family ligands identified to date, with the exception of lymphoioxin^x, are type II transmembrane 
proteins, whose Cnerminus is extracellular. In contrast, most receptors in the TNF receptor (TNFR) family identified 
to date are type I transmembrane proteins. In both the TNF ligand and receptor families, however, homology 
30 identified between family members has been found mainly in the extracellular domain ("ECD"). Several of the TNF 
family cytokines, including TNF-a, Apo-l ligand and CD40 ligand. are cleaved proteolytically at the cell surface; 
the resulting protein in each case typically forms a homotrimcric molecule that functions as a soluble cytoicine. TNF 
receptor family proteins arc also usually cleaved proteolytically to release soluble receptor ECDs that can funcuon 
as inhibitors of the cognate cytokines. 
35 Recentiy, other members of the TNFR family have been identified. Such newly identified members of the 

TNFR family iiKiude CARl, HVEM and osteoprotcgenn (OPG) fBrojatsch et al., Cdl, £7:845-855 (1996); 
Momgonieiy et al.. Cdl. E:427^36 (1996): Marstcrs ei al.. J. Biol. Chem. . 222:14029-14032 (1997); Simonei et 
al.. Celi^ 82:309-319 (1997)]. Unlike otiier known TNFR-like molecules, Simonet ci al., sSffiQ. repon that OPG 
contains no hydrophobic transmembrane-spanning sequence. 
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Moreover, a new member of the TNF/NGF receptor family has been identified in mouse, a receptor referred 
to as "GITR" for '*glucocorbcoid-indiiced tumor necrosis factor receptor family-related gene" [Nocentini ei a!., El2£* 
|Maf! Acad. Sci. USA 94:6216-6221 (1997)). The mouse GITR receptor is a 228 amino acid type 1 transmembrane 
protein thai is expressed in normal mouse T lymphocytes from thymus, spleen and lymph nodes Expression of the 
mouse GITR receptor was induced in T lymphocytes upon activation with anti-CD3 antibodies. Con A or phorbol 

5 l2-myrisiate 13-acetate. It was speculated by the authors that the mouse GITR receptor was involved in the 
regulation of T cell receptor-mediated cell death. 

In Marsters ci al.. Curt. Biol. . 6:750 (1996). investigators describe a full length native sequence human 
polypeptide, called Apo-3. which exhibits similarity to the TNFR family in its cxiracellular cysieme-rich repeats and 
resetribles TNFRl and CD95 in that it contains a cytoplasmic death domain sequence Isee also Marsters ei al.. QiiL 

10 Biol. . 6:1669 (1996)]. Apo-3 has also been referred to by other investigators as DR3. wsl-1 and TRAMP 
(Cnunnaiyan ct al.. SeifflDCS, 274-990 (1996); Kitson ei al.. Nature . 2M-372 (1996); Bodmer et al.. ImmunitY , fi:79 
(1997)1. 

Pan ei al. have disclosed another TNF receptor family member referred to as ■•DR4'' (Pan et al., SfiifiDfic, 
27fi:llM 13 (1997)1. The DR4 was reported to coniam a cytoplasmic death domain capable of engaging the cell 
15 suicide apparatus. Pan et al. disclose that DR4 is believed to be a receptor for the ligand known as Apo-2 ligand or 
TRAIL. 

In Sheridan ct al,. Science . 222:818-821 (1997) and Pan et al., Sciencs. 222:815-818 (1997), another 
molecule beUcved to be a receptor for the Apo-2 ligand (TRAIL) is described. That molecule is referred to as DR5 
(it has also been alternatively referred to as Apo.2). Like DR4, DR5 is reported to contain a cytoplasmic death 
20 domain and be capable of signaling apoptosis. 

In Sheridan ci al.. ssfflia. a receptor called DcRl (or alternatively. Apo-2DcR) is disclosed as being a 
potential decoy receptor for Apo.2 ligand (TRAIL). Sheridan et al. repon that DcRl can inhibit Apo-2 ligand 
function in vitro. See also, Pan et al.. sma, for disclosure on the decoy receptor referred to as TRID. 
For a review of the TNF family of cytokines and their receptors, see Gruss and Dower. suctS- 
25 As presently understood, the cell death program contains ai least three important elements - activators, 

inhibitors, and effectors; in C. elegans, these elements are encoded respectively by three genes. C£rf-9 and 

Ced-3 [SteUer, SsiSDCS. 262:1445 (1995); Chinnaiyan et al.. Sfiknofi, 22S: 1 122-1 126 (1997): Wang et al.. Cdl. 2Q:l- 
20 (1997)]. Two of the TNFR family members, TNFRl and Fas/Apol (CD95). can activate apopioiic cell death 
[Chinnaiyan and Dixit, Current Biology . fi:555-562 (1996); Frascr and Evan. CfiU; 85:781-784 (1996)1. TNFRl is 
30 also known to mediate activation of the transcription factor, NF-kB [Tartaglia et al., Cdl, H:845-853 (1993); Hsu 
cial.. Cell, 84:299-308 (1996)1. In addition to some ECD homology, these two receptors share homology in their 
iniracelhilar domain QCD) in an oUgomerizaiion interface known as the death domain [Tartaglia et al., suDia; Nagata, 
Cell . 88:355 (1997)1. Death domains arc also found *m several meiazoan proteins that regulate apoptosis, namely, 
the Drosophila protein. Reaper, and the mammalian proteins refened to as FADD/MORTl. TRADD. and RIP 
35 ICleaveland and Ihle, Cell. 81:479-482 (1995)1. 

Upon Ugand binding and receptor clustering, TNFRl and CD95 are believed to recniii FADD into a death- 
inducing signalling complex. CD95 purportedly binds FADD directly, while TNFRl bmds FADD indirectly via 
TRADD [Chinnaiyan ctal.. Cdl. 81:505-512 (1995); Boldin et al., 1 Bml Chem.. 27Q:387-391 (1995); Hsu et al., 
supra : Chinnaiyan ct al.. 1 Binl. Chem. . 221:4961-4965 (1996)1. It has been reported that FADD serves as an 
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adaptor proiein which recruits the Ced-J-nhicd protease, MACHa/FLICE (caspase 8). inio the death signallms 
complex (Boldin ei al.. £ell, 35:803-815 (1996): Muzio et aJ.. £ell, M 817-827 (1996)). MACHa/FLICE appears 
to be the trigger that sets off a cascade of apopiotic proteases, mcJuding the micrleukin-ip convening enzyme (ICE) 
and CPPBZnrama, which may execute some critical aspects of the cell death programme [Fraser and Evan, siyjral. 

It was recenUy disclosed that programmed ccU death involves the activity of members of a family of cysteine 
proteases related to the C. eiegans cell death gene, ced-S. and to the mammalian IL-1 -convening enzyme, ICE. The 
acdvicy of the ICE and CPP32A'ama proteases can be inhibited by the product of the cowpox vims gene, crmA (Ray 
ei al.. CfiU. 62:597-604 (1992); Tcwan ct al„ CsJI. 41:801-809 (1995)|. Recem studies show diat CrmA can inhibit 
TNFRl- and CD95-induced cell death fEnari ei al.. liaiutt. 225:78-81 (1995): Tewari et al., J. Biol r^^^ ni. 
27Q:3255-3260 (1995)1. 

As reviewed recently by Tewari et al.. TNFRl. TNFR2 and CD40 modulate the- expression of 
proinflammatory and costimulatoi7 cytokines, cytokine receptors, and cell adhesion molecules through activation of 
the transcription factor, NF-kB (Tewari et al., Curr. On. Genet. Q^vfi^p, 6:39-44 (1996)1. NF-kB is the prototype 
of a family of dimeric transcription factors whose subunits contam conserved Rel regions [Verma et al.. Genes 
DfiVfiliaL. 2:2723-2735 (1996): Baldwin. Ann. Rev. Immnnnl 14:649-681 (1996)) In its laiem form. NF-kB is 
15 complexed with members of the IkB inhibitor family: upon inactivation of the IkB in response to cenam stimuli, 
released NF-kB translocates to the micleus where it binds to specific DNA sequences and activates gene transcription. 

10, EBI22S2 

Proteases arc enzymatic proteins which arc involved in a large number of very important biological 
processes in mammalian and non-mammalian organisms. Numerous different protease enzymes from a vanety of 
different mammalian and non-mammalian ofganisms have been boih idcmified and characterized, including the scnne 
proteases which exhibit specific activity toward various serine-containing proteins. The mammalian protease enzymes 
play important roles in biological processes such as, for example, protein digestion, activation, inactivation. or 
modulation of peptide hormone activity, and alteration of the physical properties of proteins and enzymes. 
^ ^ig*^* of the important physiological roles played by protease enzymes, efforu are currently being 

undertaken by both industry and academia to identify new, native protease homologs. Many of these efforts are 
focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel 
membrane-bound receptor proteins. Examples of screening methods and techniques are described in the literawre 
Iscc. for example. Klein ct al., Proc. Natl. Acad f^ri 22:7108-7113 (1996): U.S. Patent No. 5,536.637)]. We 
30 herein describe the identification of novel polypeptides having homology to serine protease enzymes, designated 
herein as PR0382 polypeptides. 

The ADAM (A Disintegrin And Metalloprotease) family of proteins of which melirin is a member may have 
35 an in^nant role in cell interactions and in modulating cellular responses, (see. for example. Gilpin ct ah. J. Biol. 
ChsOL. 222tD: 157-166 (1998)J. The ADAM proteins have been implicated m carcinogenesis. Meltrin-a (ADAM12) 
is a myoblast gene product reported to be required for cell fusion. (Hanis et al., J. Cell. Biochem. £70^:136-142 
(1997). Yagami-Hiromasa et al.. HsMSi. 277:652-656 (1995)), The meltnns contain disintegrin and metalloprotease 
domains and are implicated in cell adhesive events involved in development, through the integrin-binding disintegrin 
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doirain. but also have an anti-adhesive function through a zinc-dependent mctalloprotease domain. (Alfandan ct al.. 
n,^^l Biol. . 12201:314-330 (1997)). Given the medical importance of cell fusion and modulation of cellular 
respoi^es in carcinogenesis and other disease mechanisms, efforts are currently being under taken to identify new. 
native proteins which are involved in cell fusion and modulation of cellular responses. We describe herein the 
identification of a novel polypeptide which has homology to melirin. designated herein as PR0545. 

5 

12. SBQSll 

CD24 is a protein thai is associated with the cell surface of a variety of diffcrem cells of the mammalian 
immtme system. includii« for example, neutrophils, monocytes and some lymphocytes, for example. B lymphocytes. 
CD24 has been shown to be a Ugand for the platelet-associated surface glycoprotein P-selectin (also known as granule 

10 membrane piotein-140 or GMP-140). a glycoprotein that is const.wiively synthesized in both platelets and endothelial 
cells and becomes exposed on the surface of platelets when those cells become activated. In this way. P-selectin 
mediates the calcium-dependent adhesion of activated platelets and endothelial cells to the various cells of the immune 
system that express one or mote ligands for the P-selectin molecule, panicularly CD24. TOs mechanism allows for 
recruimiem of immune system cells to locations where they are most needed, for example, sites of injury. Thus. 

15 there is substantial interest m idemifying novel polypeptides that exhibit homology to the cell surface anrigcns of the 
immune system cells. We herein describe the idemiftcation and characterizarion of a novel polypeptide having 
homology to the CD24 protein, wherein that novel polypeptide is herein designated PR0617. 
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13. rnfriM 

Protein-disulfide isomerasc (PDI) is a caulyst of disulfide formation and isoraerizarion dunng protem 
folding U has two catalytic sites housed in two domains homologous to thioredoxin. one near the N terminus and 
the other near the C temunus. [See for example. Gilbert HF. } Pi^l Chsm .. 47:29399-29402 (1997). Mayf.eld KJ. 
Ssicnes. 278:1954-1957 (1997) and Puig et al.. LBiolXHaiU. 52:32988-32994 (1997)]. PDI is useful for formation 
of naniral type disulfide bonds in a protein which is produced in aprokaryotic cell. (See also. U.S. Patent Nos. 

25 5.700.659 and 5.700.678). 

-nws. PDI and motecuks rcUted thereto arc of interest, panicularly for ability to catalyze the formation of 
disulfide bonds. Moreover, these molecules arc generally of interest in the study of redox reactions and related 
processes. PDI and related molecules are fiirther described in Darby, et al.. Bififihcmiaut 34. 1 1725-11735 (1995). 
Wc herein describe the identification and characterization of novel polypeptides having homology to protein disulfide 

30 isomerase. designated herein as PRO700 polypeptides. 



14. mxm 

Conglutinin is a bovine senim protein that was originally described as a vertebrate lecun protein and whtth 
belongs to the family of C-type lectins thai have four characteristic domains. (1) an N-terminal cysieme-rich domain. 
35 (2) a coUagen-like domain. (3) a neck domain and (4) a carbohydrate recognition domain (CRD). Recent reports have 
demonstrated that bovine conglutinin can inhibit hemagglutination by infiuenza A viruses as a result of iheir lectin 
properties (Eda et al.. Mnctem l. 316:43-48 (1996)). U has also been suggested that lectins such as conglutinin can 
function as immunoglobuiin-indepcndent defense molecules due to complement-mediated mechanisms. THus. 
conglutinin has been shown to be useful for purifying immune complexes in mro and for removing circulating 
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unmunc complexes from padems plasma ,n (Lun e, al.. Biochem. Rinnhvc p.c r.^^,^ 218:260-266 (1996)) 
We herein describe *e idemficadon and characterization of a novel polypeptide havng homology to the congluunin 
protein, designated herein as PRO702. 

15. EBOm 

Veo.-long<hain acyl-CoA synthetase fVLCAS") is a long-chain fatty acid transpon prote.n which is active 
m d« celhdar transpon of long and very long cham fatty acids, [see for example. Uchtda et al.. L^bm (Tokyo) 
n9(3):565-571 (1996) and Uchiyama et al.. / Bifll fhcni 271(48):30360.30365 (1996). Given die biological 
unportance of fatty acid transport mechanisms, efforts are cmently being under taken to idemtiy new. «„,ve proteim 
wh,ch are involved in fatty acid cranspor.. Wc describe herein dte idenUficatton of a novel polypeptide which has 
homology to VLCAS, designated herein as PRO703. 

16. PRO70S 

nie glypicans aie a family of glycosylphosphatidylinosi.ol (GPD-anchored proteoglycans ftat. by v.rme of 
thetr cell surface localization and possession of heparin sulfate chains, may regulate d,e responses of cells to 
tmmerous hcparin-Wnding growth factors, cell adhesion molecules and extracellular matnx components. Mutations 
.n one glypican protein cause of syndrome of human bir* defects, suggesting that die glypicans may play an 
tmporumt role in developmem (Utwack e. al.. DcvJ^ml 211:72-87 (1998)). Also, since die glypicans may interact 
widi die various extraccUutar matrices, ihey may also play important roles m wound healing (McGrad. et al.. fiaihaL 
183:251-252 (1997)). Furthermore, since glypicans ai« expressed in neurons and glioma cells, diey may also play 
an miportam role in die regulation of cell division and survival of cells of the nervous system (Uang et al L£6lL 
fiioL 139:851-864 (1997)). is evident, dierefore. that d« glypicans are an extremely important femily of 
proteoglycans. TTicre is. dier^fore. substantial interest in identifying novel polypeptides having homology to members 
of the glypican family. We herein describe die identification and characterization of a novel polypeptide having 
homology to K-glypican, designated herein as PRO705. 



17. £BQ298 

Aiyl sul&tases are enzymes diat exist in a number of diffcrem isoforms. including aiyl sulfatase A (ASA) 
aryl sulfatase B (ASB) and aryl sulfatase C (ASC). and du« luncuon to hydrolyze a variety of differem aromatic 
sulfides. Anrl sulfittases have been isolated fiom a variety of different animal tissues and microbial sources and d«ir 
30 smicm^s and have been extensively smdied (see. e.g., NIchol and Roy. LMiSim. 55:643-651 (1<>64». 

ASA deficiency has been reported to be associated wiUi metachromatic leukodystrophy (MLD) (Giles et al EmaL 
fiiaglL 7(4):245-252 (1987) and Herska et al.. Am, J. M^Ttl rirrf 26(3):629.635 (1987)). Additionally, odter 
groups have reported Uiat aryl sulfatases have been found in high leveU in namral killer cells of d« immune system 
and have hypodiesized a possible role for d«se enzymes in NK ceU-mediated ceUular lysis (see. e.g.. Zucker-Fraaklin 
« al.. Proc. Nf^tl Arad 80(22):6977.6981 (1983)). Given die obvious physiological importance of the 

aoi sulfatase enzymes, diere is a substantial interest in identifying novel aryl sulfatase homolog polypeptides. We 
herein describe the identification and characterization of novel polypeptides having homology to d» aryl sulfatases. 
wherein diese novel polypeptides are herein designated PRO708 polypeptides. 
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18. sBom 

FibuUn-l is a cysteine-nch. caJcium-binding exnaceUular matrix (ECM) component of basement membranes 
and connective tissue elastic fibers and plasma protein, which has four isofonns. A-D. derived from alicniative 
splicing. Fibulin-1 is a modular glycoprotein with amino-terminal anaphlatoxin-like modules followed by nine 
epidermal growth factor (EGF)-like modules and. depending on alternative splicing, four possible carboxyl termini 
5 Fibulin-2 is a novel extracellular matrix protein frequently found in close association with microfibrils contaming 
either f.broneciin or fibrillin. There arc multiple forms of fibulin-l that differ in their C-terminal regions that are 
produced through the process of alternative spUcing of their precursor RNA |see for example Iran ct al.. Mailii 
Bial 15(7):479-493(1997).) 

Northern and Western blotting analysis of 16 ccU lines established from mmors formed in athymic mice and 
10 malignant ceU lines derived from patients indicate that low expression of fibulin-lD plays a role in mmor formation 
and invasion. IQing ct al.. Oncogene . 18:2159-2168 (1997)1. Ovanan-cancer cells are characterized by their ability 
to invade freely the peritoneal cavity. U has been demonstrated that estradiol stimulates the proliferation of estrogen- 
receptor (ER)-positive ovanan<ancer cells, as well as expression of fibuun-l. Snidies on the effect of fibutin-1 on 
motility of the MDA-MB231 breast-cancer cell line, indicated inhibition of haptotacuc migradon of MDA-MB231 
15 cells, and the authors concluded that fibulin-1 can inhibit cancer cell motility in vitro and therefore has the potential 
to inhibit tumor invasion. (Hayashido et al.. \n T CanCCf . 75(4):654-658 (1998)] 

TTius. fibuhn. and molecules related thereto arc of interest, particularly for the use of preventing cancer. 
Moreover, these molecules are generaUy of interest in the smdy of connective tissue and aaachment molecules and 
relared mechanisms. Fibulin and related molecules are further described in Adams, et al.. I . Mfll . Bwl - 
20 272(2):226-36 (1997): Kielty and Shuttleworth. Micrffiit. PcS. T«h- 38{4):413-27 (1997); and ChiW , J. Card . Surg. - 
12(2Supp.): 131-5 (1997). 

We herein describe the idendficaiion and characterization of novel polypeptides having homology to fibuhn. 
designated herein as PRO320 polypeptides. 



25 19. PR0??4 

Oxidoreductases arc enzymes that catalyze a reaction in which two molecules of a compound interact so that 
one molecule is oxidized and the other is reduced, with a molecule of water entering the reaction. THere are many 
different types of oxidoreductase enzymes that play very important physiological roles in the mammalian organism. 
Some of the most important oxidoreductases include, for example, lyases, lactases, cholesterol oxidases, and the like. 
These enzymes play roles in such essential processes as digesuon. signal transduction, maintenance of ionic 
homeostasis, airf the like. As such, given that oxidoreductase enzymes find various essential uses in the mammalian 
organism, there is a substantial interest in identifying novel oxidoreductase enzyme horaologs. We herein describe 
the identification and chancterizauon of a novel polypeptide having homology to oxidoreductases. designated herein 
as PR0324. 



30 



35 



20. PR0351 

Prostasin is a novel human serine proteinase purified from human seminal fluid, taimunohistochemical 
localization reveals that prostasin is prcsem in epithelial cells and ducts of the prostate gland. The cDNA for 
prostasin has been cloned and characterized. Southern blot analysis, following a reverse transcription polymerase 
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chain reacuon. indicates that prosiasin mRNA .s expressed in prostate, liver, salivary gland, kidney, lung, pancreas, 
colon, bronchus, renal proximal tubular ccUs. and prostate carcinoma LNCaP cells. Cellular localization of prostasiii 
mRNA was identified witfun epithelial ceUs of the human prostate gland by in sim hybridizauon histochemistry. (See 
for example. Yu et al.. Ifiiel£h£n.. (,994) 269a9):18843-18848. and Yu et al.. J Biol rh.m (,994) 270(22). 1 3483- 
13489]. 

5 Thus, prostasin. and molecules related thereto are of interest, particularly for the study, diagnosis and 

inamient of medical conditions involving the prostate. Prostasin and related molecules arc further described in Yu 
et al.. Qsimm (1996) 32(3):334-340 We herein describe the identification and charactenzation of novel 
polypeptides having homology to prostasin. designated herein as PR0351 polypeptides. 

10 21. £EQ2a 

Butyrophilin is a milk glycoprotein that constitutes more than 40% of the total protein associated with the 
fat globule membrane in mammalian milk. Expression of butyrophilin mRNA has been shown to correlate with the 
onset of milk fat production toward the end pregnancy and is maintained throughout lactation. Butyrophilin has been 
identified in bovine, murine and human (see Taylor et al.. Biochim Rinnhv, Ar„ ,306: M (1996). Ishii et al.. 
B i whim-BigphYS , Acta 1245:285-292 (1995). Mather et al.. J. Dairy Sci. 76:3832-3850 (1993) and Banghart et al.. 

273:417M179 (1998)) and is a type I transmembrane protein diat is incorporated into the fat globulin 
membrane. It has been suggested that butyrophilin may play a role as the principle scaffold for the assembly of a 
complex with xandiine dehydrogenase/oxidase and other proteins diat function in the budding and release of milk-fat 
globules from the apical surface during lactation (Banghart et al., smU- 

Given that butyrophilin plays an obviously impoitant role in mammalian milk production, there is substantial 
interest in identifying novel butyrophilin homologs. We herein describe the identification and characterization of a 
novel polypeptide having homology co butyrophilin. designated herein as PR0352. 

22. t&sm 

25 -nt immunophilins are a family of proteins that function as receptors for immunosuppressant drugs, such 

as cyclosporin A. FK506. and rapamycin. The immunophilins occur in two separate classes. ( I) the FK506-binding 
FOteins (FKBPs). which hind to FK506 and rapamycm. and (2) tiie cyclophilins. which bind to cyclosporin A. With 
regard 10 the FK506-bindiiig proteins, it has been reported diat the FIC506/FKBP complex functions to inhibit tiie 
activity of the serine/Uireoninc protein phosphatase 2B (calcineuiin). tiiereby providing immunosuppressant activity 

30 (Gold. MflUilaualliaL 15:285.306 (1997)). It has also been reported that tiw FKBP immunophilins are found in 
tiie mammalian nervous system and may be involved in axonal regeneration in the central nervous system through 
a mechanism that is independent of the process by which immunostwression is achieved (Gold, supra). Tbm, there 
is substantial interest in identifying novel polypeptides having homology to the FKBP immunophilins. We herein 
describe the identification and characterization of a novel polypeptide having homology to an FKBP imnmnophilin 

35 protein, designated herein as PR038 1 . 



20 



23. PR03M 



Mammalian ceU membranes perform very imponam fiinctions relating to the smicmral integrity and activity 
of various cells and tissues. Of panicular interest in membrane physiology is the snidy of transmembrane ion 
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Channels which act to direcUy control a variety of physiological, phamuicological and cellular processes. Numerous 
ion channels have been idemified including calcium (Ca). sodium (Na) and potassium (K) channels, each of which 
have been analyzed in detail to determine their roles in physiological processes in vertebrate and insect cells. 

One type of cell membrane-associated ion channel, the sodium channel, plays an extremely important role 
in a cell's ability to maintain ionic homeostasis as well as transmit intracellular and extracellular signals. Voluge- 
5 gated sodium chaimels in brain neurons have been shown to be complexes of a pore-forming alpha unit with smaller 
beia-1 and be«a-2 subunits (Isoro et al.. £eU 83:433^2 (1995)). Given the obvious Importance of sodium channels 
in cellular homeostasis and other important physiological functions, there is a significant interest in identifying novel 
polypeptides having homology to sodium channel subunits. We herein describe the identificanon and characterization 
of a novel polypeptide having homology to the beta-2 subunit of the rat sodium channel, designated herein as 
10 PR0386. 

24. PRO540 

Lecithin-cholesterol acyltransferasc ("LCAT"), also known as phosphaiidylcholine-sterol acyltransferase is 
a key enzyme in the intravascular metabolism of high density lipoproteins, specifically in the process of cholesterol 
15 metabolism, (see. for example. Brousseau et al.. J.LipidRcs.. 38(12):2537-2547 (1997). HiU et al.. fiiflchanJ.. 
294:879-884 (1993). and Drayna et al.. Namre 327 (6123)-.632-634 (1987)]. Given the medical importance of lipid 
metabolism, efforts are curremly being under taken to identify new. native proteins which are mvolved in Upid 
transport. We describe herein the identification of a novel polypeptide which has homology to LCAT. designated 
herein as PROS40. 

20 

Synaptogyrin is a synaptic vesicle protein that is uniformly distributed in the nervous system: TTie cDNA 
encoding synaptogyrin has been cloned and sequenced and the sequence predicts a protein with a molecular mass of 
25.900 D with four membrane-spanning domains. Synaptogyrin has been implicated in membrane traffic to and from 

25 the plasma membrane. Stenius et al.. f , Ccll. BiOl. 131(6-2):1801-1809 (1995). In addition, a novel isoform of 
synaptogyrin caUed celhigyrin exhibits sequence identity with synaptogyrin. In rat tissues. ceUugyrin and 
synapiogyrins are expressed in mirror image patterns. Celhigyrin is ubiquitously present in all tissues tested with the 
lowest levels in brain tissue, whereas synaptogyrin protein is only dctecoble in brain. In rat tissues. ceUugyrin and 
synapiogyrins are expiessed in minor image panems. The synaptic vesicle protein synaptogyrin may be a specialized 

30 version of a ubiquitous protein. ceUugyrin. with the two proteins sharing su^cwral similarity but differing in 
localization. TWs finding supports the emerging concept of synaptic vesicles as the simpUfied and specialized form 
of a generic trafficking organelle. |Janz et al.. L.EiJ2LIto. 273(5):285 1-2857 (1998)J . Tbc sequence for 
ceUugyrin derived from the Norway rat. Rams norvegicus has been deposited in the Genbank database on 23 
December 1997. designated accession number AF039085. See also, Janz et al.. ] Riol.Chfim. 273 (1998). in press. 

35 Given the medical importance of synaptic transmission, efforts are currenUy being under taten to identify 

new. native proteins tiiat may be part of a simpUficd and specialized generic trafficking organelle in the form of 
synaptic veskte. We describe herein the idcnrificarion of a novel polypeptide which has homotogy to synaptogyrin. 
designated herein as PR061S. 
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Entcropepodasc is a key enzyme in ihe inesdnal digesiion cascade specifjcally cleaves the acidic propeptide 
from .rypsinogen to yield active trypsin. TOs cleavage initiates a cascade of proteolytic reactions leading to the 
activation of many pancreatic zymogens. 

Sec. for example. Matsushima et al.. /, Biol. Qism 269(31):19976-19982 (1994), Kitamoto et al.. Proc. N., Ar»^ 
Sd. 91(16):7588-7592 (1994). Entcrokinase (enteropeptidase) is a related to mammalian serine proteases involved 
in digestion, coagulation, and fibrinolysis. LaVallie et al.. J Biol Chep ,, 268(31):23311-23317 (1993). 

Given the medical importance of digestive processes, efforts arc currently being under taken to identify new. 
native proteins that may be involved in digesiion, coagulation, and fibrinolysis. We describe herem the ideniification 
of a novel polypeptide which has homology to enteropeptidase, designated herein as PR0618. 

27. PR0719 

Lipoprotein lipase is a key enzyme that mediates the hydrolysis of triglycerides and phosphoUpids present 
in circulating plasma lipoproteins (Dugi et al.. J. BiPl, Chgm,- 270:25396-25401 (1995)). Moreover, lipoprotein 
lipase has been shown to mediate the uptake of lipoproteins mto cells, wherein cellular uptake of lipoproteins is 
initiated by binding of lipoprotein lipase to cell surface proteoglycans and to the low density lipoprotein (LDL) 
receptor-related protein (Krapp et al.. f . Lipid RgS 36:2362-2373 (1995)). Thus, it is dear tha, lipoprotein lipase 
plays an extremely important role in lipoprotein and cholesterol metabolism. TTieie is. therefore, substantial interest 
in idennfying novel polypeptides that share setpicnce homology and/or biological activity with lipoprotein lipase. We 
herein describe the identification and characteiization of a novel polypeptide having sequence homology to lipoprotein 
lipase H, designated heein as PR0719. 

28- mom 

TTie low density Upoprotein (LDL) receptor b a membrane-bound protein that plays a key role in cholesterol 
homeostasis, mediating ceUular uptake of lipoprotein particles by high affinity binding to its ligands. apolipoprotein 
(apo) B-lOO and apoE. TTie ligand-binding domain of the LDL receptor contains 7 cysteine-rich repeats of 
approximately 40 amino acids, wherein each repeat contains 6 cysteines, which form 3 intra-repeat disulfide bonds. 
THese unique structural features provide the LDL receptor with its ability to specificaUy imeract with apo B-lOO and 
apoE. thereby allowing for tnuispon of these lipoprotein panicles across cellular membranes and metabolism of their 
conipoiiems. Soluble fragments conta>nii« the extracellular domain of the LDL receptor have been shown to retain 
the ability to interact with its specific lipoprotein Ugands (Simmons et al.. J. Biol. Ct^ , 272:25531-25536 (1997)). 
mis. it is clear that the LDL receptor is intimately im^lved in important physiological activities related to cholesterol 
mctaboKsm. As such, there is substantial interest in idcmifying novel LDL itceptor homolog proteins. We herein 
describe die identification and characterization of a novel polypeptide having homotogy to the human LDL receptor 
protein, designated herein as PR0724.. 

29. EEQ222 

Expression of the human gene A4 is enriched in the colonic epidieUum and is transcriptionally activated on 
differentiation of cotomc epithelial ceils in vitro (OUva a al.. Arch Rinrhen, u^^^^, 302:183-192 (1993) and Oliva 
et al.. Am . f.PHvsiftL 272:C957^ (1997)). A4 cDNA contains an open reading frame that predicts a polypeptide 
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of approumately 17 kilodaltons in size. Hydropathy analysis of the A4 protein revealed four putative membrane- 
spanning alpha-helices. Imniunocytochcmical studies of cells expressing A4 protein indicated that expression is 
localized to the endoplasmic reticulum. The four membrane-spanning domains and the biophysical characteristics 
of the A4 protein suggest ihat it belongs to a famUy of integral membrane proteins called proteolipids. some of which 
muUimetize lo form ton channels. In fact, preliminary evidence has suggested that A4 may itself multimerize and 
5 ,akeontheptopeitiesofanionchannel(OUvaetal..MUJElaa2L2^ Given the importance 

of ion channels in maintaining cellular homeostasis, there is a significant interest in identifying novel polypeptides 
having homology to known and putative ion chamiels. We herein describe the identification and characterization of 
a novel polypeptide having homology to the putative ion channel protein. A4. designated herein as PR0772. 

10 30. t&QSSi 

Proteases arc enzymatic proteins which are involved in a large number of very imponant biological 
processes in mammalian and non-mammalian organisms. Numerous different protease enzymes from a variety of 
different mammalian and non-mammalian organisms have been both identified and characterized. The mammaUan 
protease enzymes play important roles m many different biological processes including, for example, protem 
15 digesDon. activation, inactivadon. or modulation of peptide hormone activity, and alteration of the physical properues 
of proteins and enzymes. 

In light of the important physiological roles played by protease enzymes, efforts are currently being 
undertaken by both industry and academia to identify new. native protease homologs. Many of these efforts are 
focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel 
20 secreted and membrane -bound receptor proteins. Examples of screening methods and techniques are described in 
the literawre (see. for example. Klein et al.. Pror m Acad, Sci . 22:7108-7113 (1996); U.S. Patent No. 
5.536.637)1. We herein describe the identification of novel polypeptides having homology to various protease 
enzymes, designated herein as PR0852 polypeptides. 

25 31. £Bj28a 

Studies have reponed that the redox state of the cell is an imponant determinant of the fate of the cell. 
Fuithennore. reactive oxygen species have been reported to be cytotoxic, causing inflammatory disease, including 
tissue neciwis. organ failure, atherosclerosis, infertility, birth defects, prcmamre aging, mutations and malignancy. 
Thus, the control of oxidation and reduction is important for a number of reasons, including the control and 

30 prevention of strokes, heart atucks. oxidative stress, hypenension and may be associated with the development of 
malignancies. The levels of antioxidant enzymes, such as reductases, which catalyze the conversion of reactive 
oxygen species to water have been shown to be low in cancer cells. In particular, malignant prostate epithelium may 
have lowered expression of such antioxidant enzymes [Baker et .. hssm 32(4):229-233 (1997)J. In this regard, 
reductases, arc of interest. In addition, the transcription factors. NF-kappa B and AP-1 . are known to be regulated 

35 by redox state and to affect the expression of a large variety of genes thought to be involved in the pathogenesis of 
AIDS, cancer, atherosclerosis and diabetic complications. Publications further describing this subject matter include 
Engmaaet al.. ftn..r«nr.. (Greece^ . I7-.4599-»605 (1997). Kelsey. et al.. Pr j.Cancgr . 76(7):852.4 (1997): 
Friedrich and Weiss JJkflL^-. I87(4):529^ (1997) and Pieulle. et al.. LEaasiiflL. 179(1 8):5684-92 (1997) 
Given (he physiotogical importance of redox reactions in vivo, efforts are currently being under taken to identify new. 
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native preens which arc involved m redox reacuons. We descr>be herein dlSmificauon of a novel prosuce 
specfic polypepnde which has sequence similanty to reducu.sc. designated herein as PR0853 

32. mmn 

Neurofascin .s a member of the LI subgroup of the cellular adhesion molecule (-CAM") family of nervous 
system adhesion molecules and is involved in cellular aggregation. CelKell recogmdon and paaeming of cell 
conucts have a critical role in mediating reversible assembly of a wide variety or transceUular complexes in d« 
nervous syscem. CeU interacdons may be regulated through modulation of ankyrin binding to neurofascin. See. for 
example. Tuvia c. al.. Pr... Nat Ar.d Sr i . 94(24) 12957-12962 (1997). Neurofascin has been described as a 
n«mber of the LI subgroup of the immunoglobulin superfamily implicated in neurite extension duri,^ embryomc 
development for which numerous isoforms have been detected at various stages of development. See also Hasscl e. 
al.. LJiflLCtenL. 272(45) 28742-28749 (1997). Grumct.. CslLJQs;^^ 290(2) 423^28 (1997). Garver et al 
^' ^ ' ^l ' ^03-714 (1997). and Lamben ei al.. J. Neurn...,. 17:7025-7-36 (1997).. 

Given .he physiological importance of cellular adhesion molecules and developmem of the nervous system 
in wv.. efforts are currently being tuuler taken to identify new. nanve proteins which are involved in regulauon of 
cellular interacdons in the nervous system. We describe herein the identification and characterization of a novel 
polypeptide which has sequence similarity to neurofascin, designated herein as PRO860. 



33. Esam 

The CMRF35 monoclonal antibody was used to identify a cell membrane antigen, designated CMRF35 
20 Which is present on die surface of monocytes, neutrophils, a proportion of peripheral blood T and B lymphocytes and 
lynvhocytic cell lines. THe CMRF35 cDNA encodes a novel integral membrane glycoprotein member of die 
•nununoglobulin (Ig) gene superfamily. The molecule comprises (a, a single extracellular Ig variable domain 
^niarkably similar to d« Fc receptor for polymeric IgA and IgM. (b) a membrane-proximal domain containing a high 
propomon of proline, serine and Uueonine residues diat was predicted to be heavily 0-glycosylated. (c, an'unusual 
transmembrane anchor that contained a glutamic acid and a proline residue and (d) a short cytoplasmic uil 
Transcript, encoding die CMRF35 protein have been detected in early monocytic cell lines, in peripheral blood T 
ceBs a«l m some B lymphoblastoid cell lines, confirming die results of immunocytological staining Jackson e. al 
BlLUmma 22(5):1157-1163 (1992). CMRF-35 molecules are differentially expressed in hematopoietic cells' 
aiKl the expressKjn of .he antigen was shown to be markedly influenced by stimulation wid, mitogens and cytokines 
30 See. for example. Qark e. al.. EsaJtonaillL 25(8):759 (1997). Daish et al.. bmmoL 79(1):55^3 (1993). and 
Clark et al.. Tissue Aniig«;n{; 48:461 (1996). 

Given die physiological importance of die immune system and antigens associated widi various immune 
system cells, efforu are cunemly being under taken ,o identify new. native proteins which are expressed on various 
cells of die immune system. We describe herein die idemificauon of a novel polypeptide which has sequence 
35 similarity to CMRF35. designated herein as PR0846. 



34. Pkcvua 

Lysozyme is a protein which is widely distributed in several human tissues and secretions including milk 
tears and saliva. I, has been demonstrated to hydrolyze linkages benveen N-acetylglucosamines. h has been 
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demonsiraied to be an inhibitor of chcmotaxis and of the production of toxic oxygen free radicals and may also have 
some role in the calcification process. As such, there is substantial interest in idemifymg novel polypeptides having 
homolog>' to lysozyme. We describe herein the idemificaiion of a novel polypeptide which has sequence similarity 
to lysozymc. 

5 35. 

Wni-4 is a secreted glycoprotein which correlates with, and is required for. kidney nibulogenesis. Mice 
lacking Wm-4 acuvity fail to form prctubular cell aggregates; however, other aspects of mesenchymal and ureteric 
devetopmeni are unaffected. TTuis, Wni-4 appears to act as an autoinducer of the mesenchyme to epithelial transition 
that underlies nephron development. Stark cl al.. Hms. ;372(6507):679-683 (1994). In addition, members of the 
10 Wmgene femUy code for cysteine-rich, secreted proteins, which are differentially expressed m the developing brain 
and possibly act as intercellular signaling molecules. A Wnt gene, e.g., Wni-1 is known to be essential for 
specification of the midbrain ceU fate. Yoshioka et al. , Pio<:hfm. PiopttV^ Re? ConWDW^. 203(3): 1581-1588 (1994). 
Several member of the Wnt family of secreted factors arc strongly implicated as regulators of mammary cellular 
growth and differentiation. Shimmi et al.. Cell CTPwlh Piffer. 8(12) 1349-1358. Wm-4 is normally expressed in 
15 early pregnancy. Wnt-4 may therefore be a local signal driving epithelial branching in pregnancy. Edvrards PA. 

^■>^n^v?l.^4n998) . See also. Lipschutz JH. Am , J , mm PiS - 31(3):383-397. (1998). We 
describe herein the identificaiion and characterizaton of a novel polypeptide which has sequence similarity to Wnl-4, 
designated herein as PR0864. 



20 36. EBQZ22 

At least two cell-derived signals have been shown to be necessary for the induction of immunoglobulin 
isotype switching in B-cclls. The first signal is given by either of the soluble lymphokines, imerleukin (lL)-4 or IL- 
13. which induce germlinc epsilon transcript expression, but this alone is insufficiem to trigger secretion of 
immunoglobulin E OgE). The second signal is provided by a physical interaction between B-cells and activated T- 
25 cells, basophils and mast ceUs, and it has been shown that the CD40/CD40 ligand pairing is crucial for mediating 
IgE synthesis. Additionally, amongst the numerous pairs of surface adhesion molecules that are involved in IgE 
synthesis, the CD23/CD21 pair appears to play a key role in the generauon of IgE CD23 is a protein that is 
positively and negatively regulated by factors which increase or decrease IgE production, respectively. Antibodies 
to CD23 have been shown to inhibit IL^-induced human IgE production in vitro and to inhibit antigen-specific IgE 
30 responses in a lai modd. in an isotype selective mamier (Bonnefoy et al.. fm Pespir , J , SwpL 22:63S-66S (1996)). 
CD23 interacts with CD21 on B^:clU. preferemially driving IgE production. Given that the CD23 protein plays an 
extremely important role in the induction of a mammalian IgE response, there is significant interest in identifying 
novel polypeptides having homology to CD23. We herein describe the identification and characterization of a novel 
polypepude having homology to CD23. designated herein as PR0792. 



35 



37. YBSSSik 

Mindin and spondin proteins are secreted pi«ein$ that arc strucnirally related to one another and which have 
been identified in a variety of organisms. For example. Higashijima et al.. DsjlEiqL 192:21 1-227 (1997) have 
reported the identification of spondin and mmdin expression in floor plate cells in the zebrafisb embryonic axis. 
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thereby suggesting that mindin and spondin prtoieins play imponant roles in embryonic development. This same 
gnnip has reported that mmdu, and spondin proteins function as extracellular matrix proteins that have a high affinit)- 
for the basal lamna. (Id., I. has been reponed that F-spondm is a secreted protein that promotes neural adhesion 
and neuritc extension (Klar et al.. Cd! 69:95-1 10 (1992) and that M-spondin is an extraceUular matrix protem that 
localizes to muscle attachment sues m Drosophila (Umemiya et al.. Dev Biol 186:165-176 (1997)). TTius. there 
is significam inieest in idendfyir« novel polypepddes having homology to the miiulin and spondin proteins. We herein 
describe the identification and characterization of a novel polypeptide having homology to mindin2 and mindinl. 
designated herein as PR0866. 

38. EBOm 

Cyclophilins are a family of proteins that bind to cyclosporin A and possess peptidyl-prolyl cis-irans 
isomerase activity (Sherry e. al., Prgg, N„| , Acad ^ri US^ 95:1758-1763 (1998)). In addition, cyclophilins are 
secreted by activated cells and act in a cytokine-like manner, presumably via signaling through a cell surface 
cyclophilin receptor. Host cell-derived cyclophilin A has been shown to be incorporated into HIV-l virions and iu 
incoiponidon has been shown to be essential for viral infectivit>'. TTius. one or more the cyclophilins may be directly 
associated with HIV-l infccvity. Given the obvious importance of die cyclophilin proteins, there is substantial 
interest in identifying novel polypeptides which have sequence homology to one or more of the cyclophilin proteins. 
We herein describe the idcntificanon and characterization of a novel polypeptide havii« homology to cyclophilin-like 
protein CyP-60, designated herein as PR0871. 

20 39. mom 

Enzymatic proieins play important roles in die chemical reactions involved in the digestion of foods, the 
biosynthesis of macromolecuies. the controlled release and utilization of chemical energy, and other processes 
necessary id sustain life. Enzymes have also been shown to play important roles in combating various diseases and 
disorders. For example, bver carboxylestcrases have been reported to assist in sensitizing human tumor cells to the 

25 cancer pnxinigs. Danks ei al.. repon that stable expression of the cDNA encoding a carboxylesterase in Rh30 human 
thabdoniyosarcoma cells increased the sensitivity of the cells to the CPT-1 1 cancer prodrug 8.1-fold. fiauSELBa. 
(1998) 58(I):20-22. The authors propose that this prodrug/enzyme combination could be exploited tiierapeutically 
in a manner analogous to approaches currentty under investigation witi, the combinations of ganciclovir/herpes 
simplex virus thymidine kinase and 5-fluorocytosinc/cytosine deaminase, van Pelt <r al. demonstrated that a 55 kD 

30 human Bver carboxylesterase inhibits the invasion of Plasmodium falcipanim malaria sporozoites into primary human 
hepatocytes in culnire. LHsmA (1997) 27(4):688-698. 

Carboxylestcrases have also been found to be of importance in the detoxification of drugs, pesticides and 
other xenobiotics. Purified human liver carboxylestcrases have been shown to be involved in die metabolism of 
various dnigs including cocaine and heroin. Prindel et al. describe the purification and cloning of a broad substrate 

35 specificity human liver carboxylesterase which catalyzes the hydrolysis of cocaine and heroin and which may play 
an impoitam lole in the degiadarion of diese drugs in human tissues. J. Biol. Ch.n, (1997) 6:272(23):14769-I4775 
Brzenziaski describe a spectrophotometric competitive inhibition assay used to identify dnig or environmental 
esters that are metabolized by carboxylestcrases. Drug Metah rti.pn^ (1997) 25(9): 1089-1096. 



18 



eNSOOCIO: <wo 



• PCT.1.IS99/05028 

In light of the important physiological roles played by carboxylesterascs. efforts arc being undenaken by 
both industry and academia to idenrify new. native carboxylesterase homologs. We herein describe the idenuficauon 
and characterization of a novel polypeptide having homology to carboxylesterase. designated herein as PR0873. 

40. PRW40 

5 CD33 is a cell-surface protein that is a member of the sialoadhesin family of proteins that arc capable of 

mediating sialic-acid dependent binding with distinct specificities for both the type of sialic acid and its linkage to 
subterminal sugars. CD33 is specifically expressed in early myeloid and some monocyte cell lineages and has been 
shown to be strongly associated with various n^-eloid tumors including, for example, acute non-lymphocytic leukemia 
(ANLL). As such. CD33 has been suggested as a potential target for the ireaunent of cancers associated with high 

10 level expression of the protein. There is. therefore, significant interest in the identification of novel polypeptides 
having homology lo CD33. In fact, one CD33 homolog (designated CD33L) has already been identified and 
described (see Takei et al.. ry.np^n^. r^ll Genet. 78:295-300 (1997)). We herein describe the identifKation of 
another novel polypeptide having homology to CD33. designated herein as PRO940. The novel polypeptide described 
herein also exhibits significant homology lo the human OB binding proteins designated HSU71382_1 and 

15 HSU71383_1 in the Dayhoff database (version 35.45 SwissProt 35). 

41. ^Bom 

Cadherins are a large femlly of transmembrane proteins. Cadherins comprise a family of calcium-dependent 
glycoproteins that function in mediating cell-cell adhesion in virtually all solid tissues of multicellular organisms. 

20 At least cadherins 1-13 as well as types B. E. EP. M. N, P and R have been identified and characterized. Among 
the ftmctions cadherins are known for, with some exceptions, are that cadherins panicipate in cell aggregation and 
are associated wid. cell-cell adhesion sites. Recently, it has been reported that while all cadherins share multiple 
repeats of a cadherin specific motif believed to correspond to folding of extracellular domains, members of die 
cadherin superfamily have divcrgem structures and. possibly, functions. In particular it has been reported that 

25 members of the cadherin stverfennly are involved in signal transduction. See. Suzuki. } Cell BiOChcm -. 61(4):531- 
542 (1996). Cadherins arc ftmher described in Tanihara ct al.. J, Cell Sci-. 107(6): 1697-1704 (1994). Aberle et al.. 
I rril Biochem .. 61(4):514-523 (1996) and Tanihara et al.. Ctll A<ttl65. Cotnmun-. 2(l):15-26 (1994). We herein 
describe the identification and characterization of a novel polypeptide having homology to a cadherin protein, 
designated herein as PR0941. 

30 

42. PR0944 

Clostridium perfringens enterotoxin (CPE) is considered to be the virulence factor responsible for causing 
the symptoms of C. perfnngens type A food poisoning and may also be involved in other human and veterinary 
illnesses (McClane. Toxicon. 34:1335-1343 (1996)). CPE carries out its adverse cellular functions by binding to an 
35 approximately 50 kD ceU surface receptor protein designated the Clostridium perfringens enterotoxin receptor (CPE- 
R) to form an approximately 90.000 kD complex on the surface of the cell. cDNAs encoding the CPE-R protein have 
been identified characterized in both human and mouse (KataWra ct al.. J. Cell Biol. 136:1239-1247 (1997) and 
Katahiiaetal.. J. Biol. Chem. 272:26652-26658 (1997)). Since the CPE toxin has been reported to cause a variety 
of illnesses in mammalian hosts and those iltoesses arc initiated by binding of the CPE toxin to the CPE-R. diere is 
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sigDiJicam inures. In idennfymg novel CPE-R homologs. Wc herein describe chSificauon and characcrizauon 
of a novel polypepude having homology to ihe CPE-R. designated herein as PR0944. 

43. PR091H 

Membrane-bound proteins include not only cell-surface membrane-bound proteins, but also proteins that are 
found on the surface of intracellular vesicles. Tl^se vesicles are involved m exocyiosis. which i, die fusion of 
secretory vesicles with the cellular plasma membrane, and have two main functions. One is the discharge of the 
vesicle contents into .he extracellular space, and .he second is the incotporation of new proteins and lipids into the 
plasma membrane itself. Exocytosis can be either constimUve or regulated. All eukaryotic cells exhibit constinitive 
exocytosis. which is marked by the immediate fusion of .he secretory vesicle al^er formation. In contrast, regulated 
exocytosis result in the accumulation of the secretory vesicles that fiise with the plasma membrane upon receipt of 
an appropriate signal by vcsicle-associated membrane proteins. Usually, aiis signal is an increase in the cytosolic 
free Ca'* concentration. However, regulated exocytosis that is independent of * has been reported (see eg 
Fujita-Yoshigaki « al. ), m , Chrm (1996) 31:271(22):13130-I3134). Regulated exocytosis is crucial to many 
specialized cells, including neurons (neurotransmitter release from synaptic vesicles), adrenal chromaffin cells 
(adrenaline secretion), pancreatic acinar ccUs (digestive enzyme secretion), pancreatic P-cells (insulin secretion) mast 
cells (hiscunine secretion), mammary cells (milk protein secretion), sperm (enzyme secretion), egg cells (creation 
of fertilization cmrelope) and adipocytes (insertion of glucose transporters imo die plasma membrane). 

Disorders involving exocytosis are known. For example, inflammatory mediator iclease fiom mast cells 
leads to a variety of disorders, including astiuna. Similarly. Chediak-Higashi Syndrome (CHS) is a nii« autosomal 
recessive disease in which neutrophils, monocytes and lymphocytes contain giam cytoplasmic granules. Accordingly, 
the proteins involved in exocytosis are of paramount interest and efforts are being undertaken by both industry and 
academia to identify new. vesicle-associated proteins. For example. Skehel « al. identified a 33-kilodaIton membrane 
protein in Aplysia. termed VAP.33. which is required for the exocytosis of neurotransmitter. SfiifiDCfi (1995) 
15:269(5230): 1580-1583. and Nguropharmago l m (1995) 34(11):1379-1385. Many efforts are focused on die 
screening of mammalian recombinant DNA libraries to identify the coding sequences for novel vesicle- associated 
membrane proteins. It is an object of die invemion to provide proteins having homology to die vesicle associated 
protein. VAP-33. designated herem as PR0983. 
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44. PRO10S7 

Proteases are enzymatic proteins which are Involved in a large number of very important biological 
processes in mammalian and non-mamnialian organisms. Numerous differem protease enzymes from a variety of 
different mammalian and non-mammalian organisms have been bodi identified and characterized. IT* mammalian 
protease enzymes play important roles in many different biological processes including, for example, protein 
digestion, activation, inacdvation. or modulation of peptide hormone activity, and alteration of d» physical properties 
35 of proteins and enzymes. 

In light of die important physiological roles played by protease enzymes, efforts are cuneraly being 
undertaken by bodi industry and academia to identify new. native protease homologs. Many of diesc efforts are 
focused on die screening of mammalian recombinant DNA libraries to identify the coding sequences for novel " 
secreted proteins. Examples of screening mediods and techniques are described in die literamre (see. for example. 
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Klein ei al.. Pr>v N=»l Ac^. Sci. . 22:7108-7113 (1996): U S. Patent No. 5,536.637)1. We heron describe the 
identification of novel polypeptides having homology to various protease enzymes, designated herein as PRO1057 
polypeptides. 

45. PRO1071 

5 Thrombospondin-1 is a trimeric high molecular weight glycoprotein that is released from platelet alpha- 

granules in response to thrombin siimulalion and that is also a transient component of the extracellular matrix in 
developing and repairing tissues (Adams. Int \ BlWhcm. CcH BiOl- 29:861-865 (1997) and Qian ci al.. 
F,p Biol Med. 212:199-207 (1996)). A variety of factors regulate ihrombospondin expression and the protein is 
degraded by both extraccUular and iraraceUular routes. THrombospondin-l functions as a cell adhesion molecule and 

10 also modulates cell movement, cell proliferation, neuriie outgrowth and angiogenesis. As such, diere is substantial 
interest in identifying novel polypeptides having homology to ihrombospondin. We herein describe the identification 
and characterization of a novel polypeptide having homology to thrombospondin. designated herein as PRO1071 . 

46. PRO1072 

15 Studies have reponed that the redox state of the cell is an important determinant of the fate of the cell. 

Furthermore, reactive oxygen species have been reponed to be cytotoxic, causing inflammatory disease, mcluding 
dssue necrosis, organ failure, atherosclerosis, infertility, birth defects, premature aging, mutations and malignancy. 
Thus, the control of oxidation and reduction is important for a mimber of reasons, including the control and 
prevention of suokes. heart attacks, oxidative stress, hypertension and may be associated with the developmem of 

20 maligmmcies. TTie levels of antioxidam enzymes, such as reductases, which catalyze the conversion of reactive 
oxygen species lo water have been shown to be low in cancer cells. In particular, malignant prostate epithelium may 
have lowered expression of such antioxidant enzymes [Baker et al.. E» 32(4):229-233 (1997)1. In this regard, 
reductases, are of interest. In addition, the transcription factors. NF-kappa B and AM . are known to be regulated 
by redox state and to affect the expression of a large variety of genes tiiought to be involved in tite pathogenesis of 

25 AIDS, cancer, atherosclerosis and diabetic complications. Publications further describing this subject matter inchide 
Eagmaneial.. Anri..n.Pr Re.. (Greece^ . 17:4599-4605 (1997). Kelsey. et al.. fiU^Caocsi. 76(7):852-854 (1997); 
Friedrich and Weiss. I Thenr Biol. . 187(4):529^ (1997) and Pieulle. et al.. L^SSSDSL, 179(18):5684-92 (1997). 
Given the physiotogical importance of redox reactions in v,Vo. efforts are currenUy being under taken to Identify new. 
native proteins which are involved m redox reactions. We describe herein the idemification of a novel polypeptide 
30 which has sequence similarity to reductase enzymes, designated herein as PRO1072. 

47. mmm 

Protein disulfide isomerase is an enzymatic protem which is involved in the promotion of correct refolding 
of proteins tiuough the establishment of correct disulfide bond formation. Protein disulfide isomerase was initially 
35 identified based upon its abiUty to catalyze the renawrauon of reduced denamred RNAse (Goldberger et al.. J. Biol. 
Chem. 239:1406-1410 (1964) and Epstein et al.. Cold Sprnng Harbor Symp. Quant. Biol. 28:439-»49 (1963)). 
Protein disulfide isomerase has been shown to be a residcm enzyme of the endoplasmic reticulum which is retained 
in die endoplasmic reticulum via a -KDEL or -HDEL amino acid sequence at its C-ierminus. 

Given die importance of disulfide bond-fonmng enzymes and their potential uses in a ramiber of diffcrcm 
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applicanons. for example in increasing chc yield of correct refolding of recombinantly produced proteins, efforts are 
cunemly bemg undertaken by both industry and academia to idenuiy new. «,tive proteins having homology to protein 
disulfide isomcrase. Many of these effons are focused on the scrcemng of mammalian recombinant DNA libraries 
to identify the coding sequences for novel protein disulfide isomerase horaologs. Examples of screening methods and 
techniques arc described in the litcnmire Isec. for example. Klein et al.. Proc. Na.l Ar.rt 9.; 21:7108-71 13 (1996); 
U.S. Patent No. 5.536.637)]. We herein describe a novel polypeptide having homology to protein disulfid^ 
isomerase, designated herein as PROI07S. 

48. EROm 

In Drosophila. the dorsal-ventral polarity of the egg chamber depends on the localization of the oocyte 
nucleus and the gurken RNA to the dorsal-anterior comer of the oocyte. Gurken protein presumably acts as a ligand 
for the drosophila EGF receptor (loipedo/DER) expressed in the somatic follicle cells surrounding die oocyte. 
Cornichon is a gene required in the gcrmline for dorsal-ventral signaling (Roth et al . ££ll 81:967-978 (1995)). 
Comichon. gurken and torpedo also fimction in an eariier signaling event that establishes posterior follicle cell fates 
and specifies the anterior-posterior polarity of the egg chamber. Mutations in any or all of these genes prevent the 
formauon of a correctly polarized microtubule cytoskeleton required for proper localization of the anterior and 
posterior determinants bicoid and oskar and for the asymmetric positioning of the oocyte nucleus. TTius. it is clear 
that the comichon gene product plays an important role in early development. We herein describe the identification 
and characierizaiion of a novel polypepndc having homology to the comichon protem. designated herein as PROl 81. 

20 49. PR019S 

Efforu arrc currently being undertaken to identify and characterize novel transmembrane proteins. We 
herein describe the identification and characterization of a novel transmembrane polypeptide, designated herein as 
PR0195. 

25 SO. PRCM65 

Efforts ane currently being undertaten to identify and characterize novel secreted proteins. We herein 
describe the identification and characterization of a novel secreted polypeptide, designated herein as PR0865. 



15 



30 



35 



51. PROg27 

VLA-2 is an cell-surface integrin protein that has been identified and characterized in a number of 
mammalian organisms, including both mouse and human. VLA-2 has been shown to be a receptor on die surface 
of cells for echovims-1 (EV-1) which mediates infection of VLA-2-expressing cells by EV- 1 (Zhang et al.. Viroloyy 
235(2):293-301 (1997) and Bergelson et al.. Ssisnss 255:1718-1720 (1992)). VLA.2 has also been shown to mediate 
the intcracuon of collagen with endothelium during in vitro vascular nibe formation (Jackson et al.. Cell Biol Tm 
18(9):859-867 (1994)). Various oUier imegrin proteins that share various degrees of amino acid sequence homology 
with VL\-2 have been identified and characterized in a variety of mammalian orgamsm. These iniegrins have been 
reported to play imponam roles in a variety of different physiological ftincUons. "nierefore. there is significam 
interest in identifying novel polypeptides having homology to one or more of the integnn proteins. We herein 
<tecribe the identification and characterization of a novel polypeptide having homology to VLA-2 integrin protein. 
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52. mnm 

Many important cytokine proteins have been identified and characterized and shown to signal through 
specific cell surface receptor complexes. For example, the class II cytokine receptor family (CRF2) indtides the 

5 interferon receptors, the interleukin-10 receptor and the tissue factor CRFB4 (Spencer « al.. 1 , RXP- Mcd. 187:571- 
578 (1998) and Kotenko et al., EMBQJ. 16:5894-5903 (1997)). Thus, the multimdc of biological activiUes exhibited 
by the various cytokine proteins is absolutely dependent upon the presence of cytokine receptor proteins on the 
surface of target cells. There is. therefore, a significant interest in identifying and characterizing novel polypeptides 
having homology to one or more of the cytokine receptor family. We herein describe the identification and 

10 characterization of a novel polypeptide having homology to cytokine receptor family-4 proteins, designated herein 
asPROUn. 

Interferons (IFNs) encompass a large family of secreted proteins occurring in venebratcs. Although they 
were originaUy named for their antiviral activity, growing evidence supports a critical role for IFNs in cell growth 
and differendation (Jaramillo et al.. C.ncrr iTlvtMieMiW 13(3):327.338 (1995)). IFNs belong to a class of negative 

15 growth factors having the ability to inhibit the growth of a wide variety of cells with both normal and transfonned 
phenotypes. IFN therapy has been shown to be beneficial in the treamiem of human malignancies such as Karposi's 
sarcoma, chronic myelogenous leukemia. non-Hodgkin's lymphoma, and hairy ceU leukemia as well as in the 
treamiem of Infectious diseases such as hepatitis B (CJamliel et al.. ,ScanniW MiCTOWPY 2(1):485^92 (1988). 
Einhorn et al.. Mrf ^nnrl ft Tv— ""'"^'^"■h^^ «>:25-29 (1993). Ringenberg et al.. Wmmu Mriit i nC 

20 85(l).21-26 (1988). Saracco et al.. ImiTTinlr^^-- ■n»v «nd Heoatologv 10:668-673 (1995). Gonzalez-Mateos 

et al.. H n"'"-""""""""'"*^ 42:893-899 (1995) and Malaguamera et al.. PlwmiacmhcragY 17(5):998-1005 
(1997)). 

taterferons can be classified into two major groups based upon their primary sequence. Type I mterferons. 
IFN^ and IFN-P. are encoded by a superiamily of intronless genes consisung of the IFN-« gene family and a single 
25 IFN-P gene that are thought to have arisen from a common ancestral gene. Type I interferons may be produced by 
most cell types. Type n IFN. or IFN-Y. is restri«ed to lymphocytes (T cells and namral killer cells) and is stimulated 
by nonspecific T cell activators or specific antigens in vivo. 

Although both type I and type n IFNs produce similar antiviral and antiproliferative effects, they act on 
distinct cell suifece receptors, wherein the binding b generaUy species specific (Unger et al.. ItrnnVHlOl . Todav 9:393- 
30 400 (1988)). Both IFN-a and IFN-P bind competitively to the same high affinity type I receptor, whereas IFN-Y 
bind, to a distinct type II receptor. The presence and number of IFN receptors on die surface of a cell does not 
generally reflect the sensitivity of the cell to IFN. although it is clear that the effects of the IFN protein is mediated 
through binding to a cell surface interferon receptor. As such, the identification and characterization of novel 
interferon receptor proteins is of extreme interest. 
35 We herein describe the identification and characterization of novel imerferon receptor polypeptides, 

designated herein as "PROllM interferon receptor" polypeptides. TTius. die PR01114 polypeptides of the presem 
invendoQ represents a novel cell surface inlerferon receptor. 
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53. PRQ237 

Carbonic anhydrasc is an enzyn,a«c protein ^: which aids carbon diox.de transpon and release in the 
manunahan blood system by catalyztng the synthesis (and the dehydration) of carbonic acd from (and to) carbon 

reacttons .n the mammal. As such, there is signiftcant interest in the identification and charactcrizauon of novel 
pcDT^ptides having homology to carbonic anhydrase. We herein describe the identification and charaaerizaUon of 
a novel polypeptide having homology ,o carbomc anhydrase. designated herein as PR0237. 

54. Eaos^ 

Numerous trypsin inhibitory proteins have been identified and characterized (see. e.g Yamakawa et al 
10 B i flCh i m, BionhY5 ^m 1395:202-208 (I998) and Mizuki et al.. MammalkoG^ 3:274-280 (.992)) Trypsi.; 
.nh.bi.or proteins play impor«n. roles in a vartety of different physiological and bio.og.ca, pathways and are 
specifically Involved in such processes as the regulation of protein degradation, digestion, and the like Given d,e 
unportantrolespuyedby suchenzymaticproteins. there is significant interest in identi(ymg and characterizing novel 
^.ypeptides having homology to known trypsin inhibitor proteins. We herein describe the identification and 
charactenzanon of a novel polypeptide having homology to a trypsin inhibitor prote.n. designated herein as PR0541. 

55. PR0273 

leukocytes include monocytes, macrophages, basophils, and eosim,phils and pUy an important role in .he 
unmuneresponse. T^«e cells are important in the mechanisms inifiated by T and/or B lymphocytes a«l sec«^ 
20 range of cytokines which recruit and activate other inflammatory cells and contribute to tissue destruction 

Thus, investigation of the regulatory processes by which leukocytes move to their appropriate destination 
and interac. with other cells is critical. Currently, leukocytes are thought to move from the blood to injured or 
inflamed tissues by rolling along the endothelial cells of .he blood vessel wall. "Hus movement is mediated by 

infractions between selectins and U.eir ligands. Next, die leukocyte mus. move through die vessel wall and 
u«o the ..ssues. TOs diapedesis and extravasauon srep involves cell activauon which promotes a more suble 
Icukocyte-endothelial cell interaction, again mediated by integrins and their ligands. 

Chemokines are a large famUy of sirucmrally related polypeptide cytokines. Hiese molecules stimulate 
leukoc^ movement and may explain leukocyte trafficking in differem inflammatory sin^uons. Chemokines mediate 
.he egression of particular adhesion molecules on endod«lial cells, and they produce chemoactractam, which activate 
specific cell types, in addition, the chemokines stimulate proliferation and regulate activation of specific cell types 
In both of diese activities, chemokines demonstrate a high degree of target cell specificity 

THe chemokine family is divided into two subfamilies based on whether two amino tenninal cysteine residues 
are mnnediately adjacent (C-C) or separated by one amino acid (C-X-C). Chemokines of the C-X-C family generally 

-----P^lsandfibroblastswhilemeC-Cchemokinesactonamorediversegroupoftargetcel^^ 
35 monocytes/macrophages, basophils, eosinophils and T lymphocytes. TTie known chemokines of bod, subfamilies are 
synthes.zed by many diverse cell O'pes as reviewed in Thomson A. (1994) TTie Cycokme Handbook 2 d Ed 
Academic Press. N.Y. Chemokines are also reviewed in Schall TJ (.994) Chemouciic Cytokmes: Targets for 
TherapeutK: Development. International Business Communications. Soudiborough Mass. pp ,80-270: and in Paul 
WE (1993) Fundamental Immunology. 3rd Ed. Raven Press. N.Y. pp 822-826. 
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Known chemokinrs of the C-X-C subfamily include macrophage intlanunaiory proteins alpha and beta 
(MlP-1 and MIP-2 ). interleukin-8 (lL-8), and growth regulated protein fGRO-alpha and bcu). 

MlP-2 was first identified as a 6 kDa heparin binding protein secreted by the mouse nacrophagc cell Imc 
RAW 264.7 upon somuladon with lipopolysaccharide (LPS). MIP-2 is a member of the C-X-C (or CXG subfamily 
of chemoldnes. Mouse MIP-2 is cheraoiactic for human neutrophils and induces local neutrophil infiltration when 
5 injected into the foot pads of mice. Rat MIF-2 shows 86% amino acid homology to the mouse MIP-2 and is 
chemotactic for rat neutrophils but does not stimulate migration of rat alveolar macrophages or human peripheral 
blood eosinophils or lymphocytes. In addidon. the rat MIP-2 has been shown to stimulate proliferation of rat alveolar 

epithelial cells but not fibroblasts. 

Curtem techniques for diagnosis of abnormalities in inflamed or diseased issues mainly rely on observation 

10 of clinical symptoms or serological analyses of body tissues or fluids for hormones, polypeptides or various 
metabolites. Problems exist with these diagnostic techniques. First, patients may not manifest clinical symptoms at 
early stages of disease. Second, serological tests do not always differcmiatc between invasive diseases and genetic 
syndromes. Thus, the identification of expressed chemokines is important to the dcvelopmem of new diagnostic 
techniques, effective therapies, and to aid in the understanding of molecular pathogenesis. 

15 To date, chemolones have been implicated in at least the following conditions: psoriasis, inflammatory 

bowel disease, renal disease, arthritis, immune-mediated alopecia, stroke, encephalitis. MS. hepatitis, and others, 
to addidon. non-ELR-comaining chemokines have been implicated m the inhibition of angiogcnesis. thus indicating 
that these chemokines have a rule in tumor vascularization and tumorigencsis. 

Thciefoit it is the object of diis invendon to identify polypeptides and nucleic acids encoding the same which 

20 have sequence identity and similarity wiUi cytokine-induced neutrophil chemoaaractants. MIP-1 . MlP-2. and other 
related proteins. The efforts of this object are provided herein. 

56. fRO701 

Beta neurexins and neuroligins are plasma membrane proteins that are displayed on the neuronal cell surface. 

25 NeuroUgin 1 is enriched m synaptic plasma membranes and acts as a splice site-specific ligand for beu neurexins as 
described in Ichichenko. et al.. Cdl. 81(3):435-443 (1995). The extracellular sequence of neuroligin 1 is composed 
of a catalytically inactive esterase domain homologous to acetylcholinesterase. Neuroligin 2 and 3 are similar in 
structure and sequence to neuroligin 1. All neuroUgins contain an N-terminal hydrophobic sequence with the 
characteristics of a cleaved signal peptide followed by a large esterase homology domain, a highly conserved single 

30 transmenttane region, and a short cytoplasmic domain. The three neuroligins are alternatively spliced at the same 
position and are expressed at high levels only in die brain. Tight binding of the three neuroUgins to beta neurexins 
is observed only for beta neurexins lacking an insert in splice site 4. Thus, neuroligins constiwte a multigene family 
of brain-specific proteins with distinct isoforms that may have overlapping Junctions in mediating recognition 
processes between neurons, see Ichtchenko. et al.. L-Biffl, Chcm.. 27 1(5) =2676-2682 (1996). Moreover, neurexins 
35 and neuroligins have been reported as functioning as adhesion molecules in a Ca=* depcndem reaction that is regulated 
by akematWe splicing of beta neurexins. i.e.. see Nguyen and Sudhof. f PioL Chem. . 272(4l):26032.26039 (1997). 

Given the foregoing, membrane bound proteins arc of interest. More generally, membrane-bound proteins 
and receptors can play an important lole in die formation, differentiation and maintenance of multiceUular organisms. 
The fate of many individual cells, e.g.. proliferation, migration, differemiation, or mteracrion with oOier cells, is 
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typically governed by mfomauon received from other cells and/or the inunediate eSmcm This mformation .s 
often transmuted by secreted polypeptides (for .nstance. mitogenic factors, survival factors, cytotoxic factors, 
differentiation factors, neuropeptides, and hormones) *hich are. in mm. received and imerprcied by diverse cell 
receptors or membrane-bound prottins. Such membrane-bound protcms and cell receptors include, but are not limited 
to. cytokine receptors, receptor kinases, receptor phosphatases, receptors im^olvcd in cell-cell interactions, and 
ceUular adhesin molecules like selectins and integrins. For instance, transduction of signals that regulate cell growth 
and differemiaiion is regulated in part by phosphorylation of various ceUular proteins. Protein tyrosine kinases, 
enzymes that caiaiyze thai process, can also act as growth fector receptors. Examples include fibroblast growth factor 
receptor and nerve growth factor receptor. 

Membrane-bound proteins and receptor molecules have various industrial applications, mcluding as 
phaimaceurical and diagnosdc agents. Receptor immunoadhesms. for instance, can be employed as therapeutic agents 
to block recepior-Ugand interaction. Hie membrane-bound proteins can also be employed for screening of potential 
peptide or small molecule inhibitors of the relevant rcceptor/ligand interaction. 

Efforts are being undertaken by both industry and academia to identify new. native membrane-bound 
receptor proteins, particularly diose having sequence identity and/or similarity wid, neuroligins 1. 2 and 3. Many 
efforts an: focused on the screening of mammalian recombinant DNA libraries to identify die coding sequences for 
novel secreted and membrane-bound receptor proteins. Examples of screening mediods and techniques are described 
in the literature [see. for example. Klein et al.. PrPC. Natl. ACi»d ,Sri . 22:7108-7113 (1996); U.S. Patent No. 
5.536.637)]. The results of such efforts are provided herein. 
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20 57. PRO704 

VIP36 is localized to die Golgi apparatus and die cell surface, and belongs to a family of legume lectin 
homok?gues in the animal secretory padiway diat might be involved in die trafficking of glycoproteins, glycolipids. 
or both. It is fimher believed that VIP36 binds to sugar residues of glycosphingolipids and/or gycosylphosphatidyl- 
inositol anchors and might provide a link between die extracellular/luminal face of glycolipid rafts and the cytoplasmic 
protein segregation machinery. Funhcr regarding VIP36. it is beUeved dat there is a signal at its Ctermmus d«t 
matches an miemalization consensus sequence which confers its ability to cycle between the plasma membrane and 
Golgi. See. Fiedler. etal.EMBQl.. 13(7):1729-1740 (1994); Fiedler and Simons. L£ea^.. I09(l):27i.276 
(1996); Idn. et al.. MBQi.. 14(10):2250.2256 (1995). I, is believed diat Vn»36 is eidier die same as or very closely 
related to die human GP36b protein. VIP36 and/or GP36b arc of interest. 

More generally, vesicular, cytoplasmic, extracellular and membrane-bound proteins play importam roles 
in die formation, differentiation and maintenance of multicellular organisms. The fate of many individual cells, e.g., 
proliferation, migianon, differcnnadon. or interaction wid, odicr cells, is typically governed by information revived 
from odw cells and/or die immediate environmem. This information is often iransmined by secreted polypeptides 
(for instance, mitogenic factors, survival factors, cy.otoxic factors, differentiation factors, neuropeptides, and 
honnones) which are. in turn, received and imerpreted by diverse cell receptors or membrane-bound proteins. These 
secreted polypeptides or signaling molecules normally pass dirough die cellular secretory padiway to reach dieir site 
of action in die extracellular environmem. usually at a membrane-bound receptor protein. 

Secreted pn>teins have various industrial applications, including use as pharmaceuticals, diagnostics, 
biosensors and Woreactors. to fact, most protein dmgs available at presem. such as dirombolyac agems. interferons! 
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mierleuldns. erythropoieiins. colony s.imulaiing factors, and various other cytokines, are secretory proteins. Their 
receptors, which arc membrane-bound proteins, also have potential as therapeutic or diagnostic agents. Receptor 
immunoadhesins. for instance, can be employed as therapeutic agents to block reccptor-ligand interaction. 
Membrane-bound proteins can also be employed for screening of potential peptide or small molecule inhibitors of 
die relevant receptor/Ugand iraeracnon. Such membrane-bound proteins and cell receptors include, but are not limited 
5 to. cytokine receptors, receptor kinases, receptor phosphatases, receptors involved m celKell interacnons. and 
cellular adhesin molecules like seleciins and integnns. Transduction of signals that regulate cell growth and 
differentiation is regulated in pan by phosphorylation of various cellular proteins. Protein tv rosine kinases, enzymes 
that catalyze that process, can also act as growth factor receptors. Examples include fibroblast growth factor receptor 
and nerve growth factor receptor 
10 Efforts arc being undertaken by both industry and academia to identify new. native vesicular, cytoplasmic, 

secreted and membrane-bound receptor proteins, panicularly those having sequence identic and/or similarity with 
VIP36. Many efforts are focused on ihe screening of mammalian recombinant DNA libraries to identify the coding 
sequences for novel secreted and membrane-bound receptor proteins. Examples of screerang methods and techniques 
are described m Ihe litcranire (see. for example. Klein et al.. Pmr N;,i| AmO, Sti, . 22:7108-7113 (1996): U.S 
15 Patent No. 5.536.637)]. 



58. sBsm 

Acid phophatase proteins arc secreted proteins which dephophorylate terminal phosphate groups under acidic 
pH conditions. Acid phophatases contain a RHGXRXP amino acid sequence, which is predicted to be mechanistically 
20 significant. Acid phosphatases may have important functions in the diagnosis and treatment of human diseases. For 
example, prostatic acid phosphatase is a secreted protein uniquely expressed in prosutic tissue and prostate cancer. 
The level of prostatic acid phosphatase is a potential prognostic factor for local and biochemical control in prosttte 
cancer patients treated with radiotherapy, as described in Lankfotd ei al.. Iffl ] Radiat . OnCftl , Biftl. PhVS. 38(2): 
327-333 (1997). Research suggests that a cellular immune response to prostatic acid phosphatase may mediate 
25 destructive autoimmune prostatitis, and that xenogeneic forms of prostatic acid phosphatase may prove useful for 
immunotherapy of prostate cancer. See Fong ct al.. UmnWDfiL 169(7): 3113-3117 (1997). Seminal prostatic acid 
phosphatase levels correlate significantly with very low sperm levels (oligospermia) in individuals over 35. see Singh 
et al.. HinMnnre Med. J. 37(6): 598-599 (1996). Thus, prostatic acid phosphatase has been implicated in a variety 
of human diseases, and may have an important function in diagnosis and dicrapy of these diseases. A series of 
30 aminobcnzylphosphatic acid compounds are highly potem inhibitors of prostatic acid phosphatase, as described in 
B.^>t«l pinnrg Med Them. 4(10): 1693-1701 (1996). 

More generally, extracellular proteins play an important role m the formanon. differentiarion and 
maintenance of multicellular organisms. The fate of many individual cells, eg., proliferation, migration, 
differentiation, or interaction with other cells, is typically governed by information received from other cells and/or 
35 the immcdiaie emrironmcm. This information is often transmitted by secreted polypeptides (for instance, mitogcmc 
factors, survival fiictors. cytotoxic factors, differentiation factors, neuropeptides, and hormones) which arc. m mm. 
received and interpreted by diverse cell receptors or membrane-bound protems. These secreted polypeptides or 
signaling molecules normally pass through die cellular secretory pathway to reach dieir site of action in the 
exiracelhilar environment. 
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Secreted protems have various mdustrial applicauons. mcluding phaonacemicals. d.agnos..cs. b.oser«ors 
and b.oreactors. Most protein drugs available at ptesem. such as thrombolyuc agents, interferons, .mcrleukins 
erythropoietins, colony stimulating factors, and various other cytokines, are secretory proteins. Tl,e,r receptors' 
Which are membrane protems. also have potential as therapeutic or diagnostic agents. Efforts are being undertaken 
by both mdustry and academia to identify new. ™„ive secreted proteins, particularly those having sequence .demity 
with prostate acid phosphatase precursor and lysosomal acid phosphatase precursor and in some cases, those having 
.dennty with DNA found in fetal hean. Many efforts are focused on the screening of mammalian recombmant DNA 
hbranes ,o idendfy the coding sequences for novel secreted proteins. Examples of screening methods a™i techniques 
are described in the H.eramre (see. for example. Klem et al.. Proc. N.,. c.; 22:7108-7113 (1996)- U S 
Patent No. 5.536,637)]. ' ' 

59. PRO707 

Cadherins are a large famUy of transmembrane proteins A. least cadherins 1-13 as well as types B E EP 
M. N. P and R have been characterized. Among the fimctions cadherins are known for. with some exceptions' 
cadhenns panicipate m cell aggregation and are associated with cell-ceil adhesion sites. Cadherins are furthei 

2(l).I5-26 (1994). Moreover, i, has been reported tha, some members of .he cadherin superfamily are involved m 
g«eral ceU<eU i«enK:uon processes including transduction. See. Suzuki. J. Cel. Rin.H.^ 61(4):531.54. (.996) 
Therefore, novel members of the cadberin superfamily are of interest. 

More generaUy. all novel proteim are of interest, including membrane-bound proteins. Membrane-bound 
protems and receptors can play an imporum. role in the formation, differentiation and maintenance of multicellular 
organisms. TTie fate of many individual cells, e.g.. proliferation, migration, differentiation, or interaction wid, other 
cells. IS typically governed by information received from other cells and/or the immediate environmem TTus 
mformanon is often transmined by secreted polypeptides (for insuince. mitogenic factors, survival fectors. cvtotoxic 
actors, differentiation factors, neuropeptides, and hormones) which are, in turn, received and interpreted by diverse 
cell receptors or membrane-bound proteins. Such membrane-bound proteins and cell receptors include, but at* not 
bmited ,0. cytokine receptors, receptor kinases, receptor phosphatases, receptors involved in cell-cell mteracuons 
and cellular adhesin molecules like selectins and integrins. For insumce, transduction of signals that regulate cell 
growth and differentiation is regulated in pan by phosphorylation of various ceUular proteins. Protem tyrosme 
kinase,, enzymes that catalyze that process, can also act as growth factor receptors. Examples include fibroblast 
growth factor receptor and nerve growdi factor recepva. 

Membrane-bou«l proteins and receptor molecules have various industrial appUcations, including as 
Phamviceutical and diag««tic agents. Receptor immunoadhesins. for instance, can be employed as therapeutic agents 
to block receptor-ligand interaction. 1^ membrane-bound proteins can also be employed for screening of potential 
pepude or small molecule inhibitors of the relevant receptor/ligand interaction. 

Efforts are being undertaken by both industry and academia to identify new. native secreted and membrane- 
bound receptor proteins, particularly membrane bound proteins having identity wid, cadherins. THe results of such 
eifora are provided herein. 
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60. EEQ222 . . , 

Proieases are enzymatic proteins which are involved in a large number of very .mponam biological 

processes in mammalian antl non-mammalian organisms. Numerous differem protease enzymes from a variety of 
different mammalian and non-mammaUan organisms have been both identified and characterized, including the serine 
proteases which exhibit specific acdviiy toward various se,ine<omaining proteins. The mammalian protease enzymes 
5 play important roles in biological processes such as. for example, protein digestion, activanon. inaetivauon. or 
modulation of peptide hormone activity, and alteration of the physical properties of proteins and enzymes. 

Neuropsin is a novel serine protease whose mRNA U expressed in the central nervous system. Mouse 
ncuropsin has been cloned, and s«dies have shown that it is involved in the hippocampal plasticity. Neuropsin has 
also been indicated as associated with extracellular mauix modifications and cell migrations. See. generally. Chen. 
10 ct al.. tkUISSSi.. 7(2):5088-5097 (1995) and Chen, et al.. I HiMorkm , Cvtothcm .. 46:313-320 (1998). 

Efforts are being undertaken by both industry and academia to identify new. native membrane-bound or 
secreted proteins, panicularly Aose having homology to neuropsin. serine protease, neurosin and trypsinogen. Many 
efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for 
novel secreied and membrane-bound receptor proteins. Examples of screening methods and techniques are descnbed 
,5 in the literamre Isec for example. Klein et al.. Prpr m M Sa . 22:7108-7113 (1996); U.S. Patent No. 
5.536.637)1. 



61. £RQS2li . 

Protein-protein interactions include those involved with receptor and antigen complexes and signaling 
20 mechanisms. As more is Icnown about the stnicmral and functional mechanisms underlying protein-protein 
irtcracdons, protein-protein interacdons can be more easily manipulated to regulate the particuUr result of the proteu*- 
protein intetacuon. Thus, the underlying mechanisms of protein-protein interactions are of interest to the scienufic 
and medical community. 

All proteins containing leucine-rich repeats are thought to be invoWed in protein-protein interacdons. 
25 Leucine-rich repeats are shon sequence motifs present in a mnnber of proteins with diverse hmctions and cellular 
locadons TTk crystal structure of ribonuclease inhibitor protein has revealed that leucine-rich repeats correspond 
to beu-alpha stru«ural units. These units are arranged so that they form a parallel beta-sheet with one surface 
exposed to solvent, so that the protein acquires an unusual, nonglobular shape. These two features have been 
indicated as responsible for the protein-binding fimctions of proteins containing leucine-rich repeats. See. Kobe and 
30 r.^----'" T.^cBi«.^hen.. Sci.. 19(10):41S^21 (Oct. 1994). 

A snidy has been reported on leucine-rich proteoglycans which serve as tissue organizers, onenting and 
ordering collagen fibrils during ontogeny and are involved in pathological processes such as wound healing, dssue 
repair, and tumor stroma formation, lozzo. R. V.. Trit Prv Piorhrm Mol. Biol-. 32(2): 141-174 (1997). Others 
studies impUcating leucine rich proteins in wound healing and tissue repair are De U Salle. C. et al.. VWY . RCY .. 
35 ElJianaHd- (Gem«my). 37(4):215-222 (1995). reporting mutations in the leucine rich motif in a complex associated 
widi the bleeding disorder Bemard-Soulier syndrome. Chlemetson. K. J.. TTiromfr , Hacmoa - (Germany). 74(1):1 11- 
116 (luly 1995), reporting that platelets have leucine rich repeau and Rnoslahti. E. 1.. et al. . W091 10727.A by U 
Jolla Cancer Resea«h Foundation reporting that decorin binding to transfomring growth factor? has mvolvemem m 
a treatment for cancer, wound healing and scarring. Related by function to this group of proteins U the insulin like 
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g«,w* fac«,r aCF). ia to. n « useful in wound-healing and assocated U«:ap,es conS wid, rc-growth of «ssue 
such as connecdve ossue. sldn and bone: in pron.o.ing body growO. .n hunuu. and annnals; and in sumulaung oche; 
growft-^lated processes. The acid labile subuni, (AU) of IGF is also of .meres, in to, i, increases the haif-life of 

6(6).870-876 (1992): Bax«r. LMfim.. 264(20).n843-l 1848 ri989,: and Khosravi. e, al.. LSJilLlni^ 
Mfiiab.. 82(12):3944.3951 fl997). '" ' ^'"^^""^ 1 

Anotor protein which has been reponed to have leucine-nch repca« is to SLIT protein which has been 
reported to be useful .„ treating neuro-degenerative diseases such as Alzheimer's disease, nerve damage such as In 
Parlanson-s disease, and for diagnosis of cancer, see. Artavanistsakonas. S. and Ro,hberg. J. M.. WO9210518.A1 
by Yale University. Also of interest is LIG-1. a n^mbrane glycoprotein to, is expressed specifically in glial cells 
« to mouse brain, and has leucine rich repeats and immunoglobulin-likedotnains. Suzuki etal JRiol Chen, 
(U.S.). 27,(37):r^22 (1996). Otor studies reporting on to b.olog.ca. funct.ons of proteins 
repeats mclude: Tayar. N.. e. al.. MqI . C. I I Fn^orrinol -. (Ireland). 125(.-2):65-70 (Dec. ,996) (gonadotrop.n 
receptor involvement): Miura. Y.. e. al.. mmLM^ (Japan). 54(7): 1784-1789 (July 1996) (apoptosis 
-olvement): Harris. P. C. e. al.. J Am, N^phr^ .. 6(4):H25-1133 (Oct .995) (kidney disease invo.vemen,) 

.EfTons are torefore bang undertaken by bo* indusoy and academia to tdenrify new proteins having leucine 
nch repeats to better understand protein-protein in,en.ctions. Of particular interest are tose pro,ems having leucine 
nch repeats and identity or similarity to known proteins having leucine rich repeats such as ALS. Many effons ar« 
focused on to screening of mammalian recombinam DNA libraries to idemify to coding s«,„ences for novel 
secreted and membrane-bound proteins having leucine rich repeats. Examples of screening methods ami techniques 
are described in to literamre (see. for example. Klein et al.. Pf«, Nail, And ,^ri . 22:7108.7113 (1996)- U S 
Patent No. 5.536.637)1. 



62. £BQS31 

Cadherinsanralargefemilyofnansmembraneprottins. Cadherins comprise a family of calcium-dependent 
glycoproteins tot function in mediating cell-cell adhesion in virtually all solid tisaues of multicellular orgamsms 
At teas, cadherins M3 as well as types B. E. EP. M. N. P and R have been characterized. Among to functions 
cadhenns are known for. wid, some exceptions, cadherins partictpate in cell aggregation a«l are associated with cell- 
cell adhesion sites. Recently, it has been reponed tot while all cadherins share multiple repeats of a cadhcrin 
spectfic motif believed to correspond to folding of extracellular domains. m«nbers of to cadherin superfentily have 
dtvergem strucmres and, possibly, functions. In particular it has been reported that metnbers of to cadherin 
superfamily are involved in signal transduction. See. Suzuki. ^mMm... 61(4):531-542 (1996). Cadherins 
are furtor described in Tanihara. e. al.. L£dL^.. 107(6):,697-1704 (1994). Aberle. et al.. L£siLmhsm 
61(4):5l4-523 (1996) and Tanihara. e. al.. £dUidhcU^.. 2(.):15.26 (1994). 

Protocadherins are members of to cadherin superfamily which are highly expressed in to brain. In some 
stud.es, protocadherins have shown cell adhesion activity. See. Sano. et al.. EMfiQJ.. 12(6)-2249.2256 (1993) 
However, smdies have also shown to. some protocadherins, such as protocadherin 3 (also referred to as Pcdh3 or 
PC3), do no. show strong calcium depemlem cell aggregation activity. See. Sago, et al.. Qsmim, 29(3):631.640 
(1995) for tfiis smdy and further characteristics of Pcdh3. 
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Therefore. noveSbers of the cadhenn supcrfamily are of interest. More generally, all membrane-bound 
proteins and receptors arc of interest. Such proteins can play an important role in the fonnauon. differcntiaiion and 
nuintenance of multicellular organisms. The fate of many individual cells, eg., proliferation, migration, 
differenliadon. or interaction with other cells, is typically governed by information received ftom other cells and/or 
fte immediate environment. This informauon is often transmitted by secreted polypeptides (for instance, mitogenic 
fectors. survival fectors. cytotoxic factors, differentiation factors, neuropeptides, and hormones) which a«. in mm. 
received and interpreted by diverse ceU receptors or membrane-bound proteins. Such membrane-bound proteins and 
ceU receptors include, but are not limited to. cytokine receptors, receptor kinases, receptor phosphatases, receptors 
involved in cell-cell interactions, and cellular adhesin molecules like selecuns and integrins. For insumce. 
oansducdon of signals that regulate cell growth and differendation is regulated in par. by phosphorylation of various 
cellular proteins. Protein tyrosine kinases, enzymes that catalyze that process, can also act as growth factor 
receptors. Examples include fibroblast growth factor receptor and nerve growth factor receptor. 

Membrane-bound proteins and receptor molecules have various mdustrial applications, including as 
phamuceutical and diagnostic agents. Receptor immonoadhesins. for instance, can be employed as therapeutic agenu 
to block receptor-ligand interaction. The membrane-bound proteins can also be employed for screeni.« of potential 
15 peptide or small molecule inhibitors of the relevant receptor/ligand interaction. 

Efforts are therefore being undertaken by both industry and academia to identify new. native membrane 
bound proteins, particular those having sequence identity with protocadherins. especially 3 and 4. Many efforts are 
focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel 
membrane-bound proteins. Provided herein are the results of such efforts. 
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63. EBQ524 

Protein disulfide isomerase is an enzymatic protein which is involved in the promotion of correct refolding 
of proteins through the establishment of correct disulfide bond formation. Protein disulfide isomerase was initially 
identified based upon its ability to catalyze the renamration of reduced denamred RNAse (Goldberger et al.. UinL 
Clttm 239:1406-1410 (1964) and Epstein et al.. cm ^Prinp Hartwr ^VmP Ouant . Bio l- 28:439-149 (1963)). 
Protein disulfide isomerase has been shown to be a resident enzyme of the endoplasmic reticulum which is retained 
in the endoplasmic reticulum via a -KDEL or -HDEL amino acid sequence at its C-terminus. Protein disulfide 
isomerase and related proteins are further described in Uboissiere. e. al.. LMLS2m.> 270(47:28006-28009 
(1995): Jeenes,etal..fi6a6. 193a):15W56 (1997; Koivunen. etal.. Gsmm. 42(3):397-404 (1997): and DesUva. 
et al.. ntJA Cell Biol .. 15(1):9-16 (1996). These smdies indicate the importance of the identification of protein 
disulfide related proteins. 

More generally, and also of interest are all novel membrane-bound proteins and receptors. Such proteins 
can play an imponant role in the formation, differentiation and maintenance of multicellular organisms. The fate of 
many individual cells, e.g.. proliferauon. migration, differentiation, or interaction with other ceUs. is typically 
governed by information received from other cells and/or the immediate environmem. TTus informauon is often 
nansmioed by secreted polypeptides (for instance, mitogemc ftoors. survival factors, cytotoxic factors, differentiation 
factors, neuropeptides, and hormones) which are. in turn, received and imerpreted by diverse cell receptors or 
membrane-bound prtNtins. Such mcmbraneW proteins and cell receptors include, but are no. limited to. cytokine 
receptors, receptor kinases, receptor phosphatases, receptors involved in cell-cell interactions, and cellular adhesin 
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molecules like selecuns and integruu.. For i««„ce, transduccion of signal^ regulate cell growU, and 
d»ffe«««aon ^ regulated m pan by phosphorylation of various cellular proteins. Protein tyrosine Icnascs enzymes 
catalyze d.t process, can also ac, as growth fector receptors. Examples .ndude fibroblast growth factor receptor 
and nerve growth factor receptor. 

Membrane-bound proteins and receptor molecules have vanous industrial applicadons. including as 
5 phannaceudcal and diagnosdc agents. Receptor immunoadhesins. for .nstance. can be employed as therapeutic agents 
.0 block receptor-ligand interaction. TTk membrane-bound proteins can also be employed for screening of potential 
pepude or small molecule inhibitors of the relevant receptor/ligand interaction. 

Given the importance of membrane bound proteins, efforts are under way to identity novel membrane bound 
p^tetns. Moreover, g.ven the importance of disulfide bond-forming enzymes and Uteir potential uses in a number 
10 of different appUcations. for example in increasing the yield of correct refold.ng of recombinantiy produced proteins 
effom are cunremly being undenaken by boO. industry and academ,a ,0 identify new. r^tive proteins havmg sequence 
identity witi, protein disulfide isomerase. Many of these efforts are focused on the screening of mammalian 
recombinant DNA libraries to identify tite coding sequences for novel protein disulfide isomerase homologs We 
herein describe a novel polypeptide having sequence idemity witi, protein disulfide isomerase and the nucleic acids 
IS encoding the same. 



64. £BiMZ 

Secreted frizzled related proteins (sFRPs) are related to die frizzled family of transmembrane receptors 
TTK SFRPS are approximately 30 kDa in size, and each comains a putative signal sequence, a frizzled-like cysteine- 
nch domain, and a conserved hydrophilic carboxy-terminal domain. It has been reported tiutt sFRPs may (unction 
« modulate Wm signaling, or function as ligands for certain receptors. Ratmer. et al.. £MSiiSA. 94(7):2859-2863 
(1997). TTKrefore. sFRPs and proteins having sequence identity and/or similanty to sFRPs are of interest 

Another secreted protein of interest is any member of the family of secreted apop.osis-related proteins 
(SARPS). E^««ionofSAW.smodiiiesthein.racelhdarlevelsofbe.a.ca,emn. suggest 

tite Wnt-frizzled proteins signaling padiway. Melkonyan. et al.. PNAS UM. 94(25):13636-13641 (1997) 
•merefoie. SARPs and proteins having sequence idemity and/or similanty to SARPs aie of interest 

m addition to SFRPs and SARPs. many extracellular proteins are of interest. Extracellular proteins play 
an important role in ti,e formation, differentiation and maintenance of multicellular organisms. Tlte fate of many 
mdividual cells, e.g.. proliferation, migration, differentiation, or interaction with odier ceUs. is typically goven^d 
by mformation received from odier cells and/or the immediate environment. TOs infomution i, often tmsnmti 
by secreted polypeptides (for instance, mitogenic factors, survival factors, cytotoxic factors, differentiation factors 
neuropeptides, and honnones) which are. in mm. received and inten,reted by diverse cell receptors or membrane' 
bound proteins. TTiese secreted polypeptides or sigruling molecules normally pass fltrough ti« ceUular secretory 
padiway to reach dieir site of action in die extraceUular environment. 

Secreted proteins have vanous industrial applications, including phannaceuticals. diagnostics, biosensors 
and bioreactors. Most protein dnigs available at presem. such as thrombolytic agents, interferons, interleukins 
crydiropoierins. colony stimulating factors, and various odter cytokmes. are secretory proteins. THeir receptors' 
which are membrane proteins, also have potential as dterapeutic or diagnostic agems. 



32 



PCT/US99/0S028 

WO 99/46281 

Efforts arc beinSertakcn by both industry and acadcmia lo ideiuify new. native secreted proteins, 
paniculaily those having sequence identity or similarity with sFRP-2 and SARP-1 . Many efforts arc focused on the 
screening of mammalian recombinant DNA libraries to idcntiiy the coding sequences for novel secreted proteins. 
Examples of screening methods and techniques are described in Ihc literature (see. for example. Klein et al.. Ecifi. 
I ^ j ^i Arad Sci. . 22:7108-7113 (1996); U.S. Patent No. 5.536.637)). 

65. EEQZIZ 

Efforts are being undertaken by both industry and academia to identify new, native transmembrane receptor 
proteins. Many efforts ate focused on the screening of mammalian recombinant DNA libraries to identify the coding 
sequences for novel receptor proteins. The results of such efforts are provided herein. 
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66. PRQ731 

Cadherins arc a large family of transmembrane proteins. Cadherins comprise a family of calcium4eper>dem 
glycoproteins that function in mediating cell-cell adhesion in virtuaUy all solid tissues of multicelhilar organisms. 
At least cadherins 1-13 as well as types B. E. EP. M, N. P and R have been characterized. Amoi« the functions 
15 cadherins are known for, with some exceptions, cadherins participate in ceU aggregation and are associated with cell- 
cell adhesion sites. Recently, it has been reported that while aU cadherins share multiple repeats of a cadherin 
specific motif belfevcd to correspond to folding of extracellular domains, members of die cadherin supcrfamily have 
divergem structures and. posstDly. functions. In particular it has been reported that members of the cadherin 
superfamily are involved in signal transduction. See. Suzuki. LiZdlJiflStem-. 61(4):531-542 (IW^^ Cadherins 
20 are further described in Tanihara. et al.. LislLSsi- 107(6): 1697-1704 (1994), Aberle. et al.. L^filLBifiShOD.. 
61(4):5 14-523 (1996) and Tanihara, et al.. Cr]] A<1hW, CpmmWl- 2(1): 15-26 (1994). 

Proiocadherins are members of the cadherin superfamily which are highly expressed in the brain. In some 
studies, protocadherins have shown cell adhesion activity. See. Sano. et al.. EMBai-, i2(6):2249-2256 (1993). 
However, smdies have also shown diat some protocadherins, such as proiocadherin 3 (aUo referred to as Pcdh3 or 
25 pc3), do not show strong calcium dependem cell aggregation activity. See. Sago. « al.. Qsimm. 29(3):63l^ 
(1995) for this smdy and further characteristics of Pcdh3. 

Therefore, novel members of the cadherin superfamily are of imerest. More generaUy, all membrane-bound 
proteins and receptors are of interest. Such proteins can play an importam role in the formation, differemiation and 
maintenance of multicelhilar organisms. The fate of many individual cells, e.g.. proliferation, migration, 
differentiation, or interaction with other cells, is typically governed by information received from other cells and/or 
the immediate environment. TTus information is often iransmined by secreted polypeptides (for instance, milogenic 
factors, survival factors, cytotoxic factors, differentiation factors, neuropeptides, and hormones) which are. in mm. 
received and interpreted by diverse ceU receptors or membrane-bound proteins. Such membrane-bound proteins and 
ceU receptors include, but are not limited to. cytokine receptors, receptor kinases, receptor phosphatases, receptors 
involved in cell-cell interactions, and ceUular adbcsin molecules like seleains and intcgrins. For instance, 
transduction of signals that regulate cell growth and differemiation is regulated in pan by phosphorylation of various 
cellular proteins. Protein tyrosine kinases, enzymes (hat catalyze tiiat process, can also act as growth factor 
receptors. Examples include fibroblast growdi factor receptor and nerve growth factor receptor. 
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Membrane-bound proteins and receptor molecules have various uSial apphcations. including as 
Pharma^dcal and diagnostic agents. Receptor immunoadhesi». for .nstance. can be employed as therapeucc agents 
to block receptor-Ugand interacUon. The membrane-bound proteins can also be employed for screening of potential 
pepude or smaU molecule inhibitors of the relevant receptor/ligand imeraciion. 

^^f°"'"«'^^«^°«'«-gundenalcen by both industry and academ^a to identify m^^^ 
bound proteins, particular U^ose having sequence idemity with protocadherins. especially 4. 68 43 42 3 and 5 
Many cffons are focused on the screening of mammalian recombinant DNA libraries to tdentily the coding Sequences 
for novel membrane-bound proteins. Provided herein are the results of such effons. 

67. CEQ21S 

Efforts are being undertaken by both indusny and academia to identify new. native membrane bound 
protems. particularly those havmg sequence identity wiO. membrane regulator proteins. Many effons ar. focused 
on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel receptor 
protems. *^ 
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15 68. mm 

The imegrins comprise a supergene family of cell-surface glycoprotein receptors that promote cellular 
adhesion. Each ceU has numerous receptors that define its cell adhesive capabilities. Imegrins are involved in a wide 
var^ty of interaction between cells a«l other cells or matrix components. ITie imegrins are of particular Imponance 
m reguladng movement and function of immune system celU. TT,e platelet llb/niA integrin complex is of particular 
uyrtance in reguladng platelet aggregadon. Amember Of d^integrinf^^^ integrinM. is expressed on epithelial 
cells and modulates epithelial inflammation. Another integrin. leucocyte-associated antigen- 1 (LFA-1) is important 
m the adhesion of lymphocytes during an immune response. 

Of particular interest is H36-alpha 7. an integrin alpha cham that is developmentally regulated during 
myoge.«sisasdescribedinSong.etal..L£^ .17(3):643-657 (1992). T.e expression panem of Oie laminin- 
bmdmg alpha 7 beta 1 integrin is developmentally regulated in skeleul. cardiac, and smood. muscle. Ziober et al 
M^OtoLXai. 8(9):,723-I734 (1997, It has been reported U»t expression of the alpha 7-X1/X2 integrin is a novel 
mechamsm.hatregula.es receptor affinity states in a cell-specific context and may modulate integrin-dependem events 
during muscle developme«a«l repair. M. I- iUrther been reported dta. laminins promote d« locomotion of 
Skeletal myoblasts via ti« alpha 7 integrin receptor. In particular it was reported .ha. alpha 7 beu 1 receptor can 
promote myoblast adhesion and motiUty on , restricted manber of laminin isoforms a«i may be imponan. in 
r^^enic precursor recmitmentduri^ Vao. e. al.. LidLSd.. 109(13):3139-3150 

243(1).317-325 (1998) and Fomaro and Unguino. Mattkiifll-. 16(4):185-193 (1997). Moreover, it ha, been 
reported that absence of integrin alpha 7 causes a form of muscular dystrophy. THus imegrins. particularly tiK»e 
related to integrin 7 and related molecules, are of interest. 

In addition to die interest of imegrins. more generally, all membrane-bound proteins and recepmrs are of 
mteres. since such proteins can play an importam role in the formation, differentiation and maimenance of 
muhicellular organisms. THe fate of many individual ceUs. e.g.. proliferation, migration, differemiation or 
.nteraction with other cells, is typically governed by information received from odier cells and/or U« immediate 
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environmem. This inlonS is often transmitted by secreted polypeptides (for instance, mitogenic factors, survival 
factors, cytotoxic factors, differentiation factors, neuropeptides, and hormones) which are. in mm. received and 
interpreted by diverse ccU receptors or raembrane-bound proteins. Such membrane-bound proteins and cell receptors 
include, but are not limited to. cytokine receptors, receptor kinases, receptor phosphatases, receptors involved in ccll- 
cefl inttracnons. and ccUular adhesin molecules like selectins and imcgrins. For instance, iransducaon of signals that 
5 regulate ceU growth and differentiation is regulated in part by phosphorylation of various cellular proteins. Protein 
tyrosine kinases, enzymes that catalyze that process, can abo act as growth factor receptors. Examples include 
fibroblast growth factor receptor and nerve growth factor receptor. 

Membrane-bound proteins and receptor molecules have various industrial applications, including as 
pharmaceutical and diagnostic agents. Receptor immunoadhesins. for instance, can be employed as therapeutic agents 
10 to block teceptor-ligand mteraction. Hie membrane-bound proteins can also be employed for screemng of potential 
peptide or small molecule inhibitors of the relevant receptor/ligand interaction. 

Therefore, efforts are being undertaken by both industry and acaderaia to identify new. native receptor 
proteins. Many efforts are focused on the screening of mammalian recombinant DNA libraries to identify the coding 
sequences for novel receptor proteins. The results of such efforts, particularly those focused on identifying new 
15 polypeptides having sequence idenuiy with integrins. are provided herein. 



69. PR0771 

Testican is a multidomain testicular proteoglycan which is expressed in numerous tissue types including, 
but not limited to neuromuscular tissue, the brain and reproductive tissues. Testican resembles modulators of cell 
20 social behavior such as the regulation of ceU shape, adhesion, migration and proliferation. [Bonnet. F. et al.. LJifiL 
£h£BL. 22i(8):4373 (1996). Perin. J.P. et al.. EXS (Swiaerland). 70:191 (1994). Alliel. P.M.. et al. ElU^ 
BiSShEDL. 214(1):346 (1993). Charboraiicr. F.. et al.. r p ^m?^ ?)0C, Biol. Fil. (France). i21(l):127 (1997)]. 
Among other reasons, since testican has been impUcattd in neuronal processes and may be associated with the growth 
of connective tissue, testican and related molecules arc of interest. 
25 More generally, all extracelhilar proteins are of interest. Extracellular proteins play an important role m 

the fonnation. differentiation and maintenance of multicellular organisms. THe fate of many individual cells, e.g.. 
proliferation, migration, differeniiarion. or interaction with other cells, is typically govenicd by infonnauon received 
from other celU and/or the immediate environment. This information is often transmined by secreted polypeptides 
(for instance, mitogenic factors, survival factors, cytotoxic factors, differentiation factors, neuropeptides, and 
30 hormones) which are. in mm, received and interpreted by diverse ceU receptors or membrane-bound proteins. These 
secreted polypeptides or signaling molecules normaUy pass through the cellular secretory patiiway to reach tiieir site 
of action in the extracellular environment. 

Secreted proteins have various industrial applications, including phannaceuticals. diagnosucs. biosensors 
and bioreaciors. Most protein drugs available at presem. such as dirombolyuc agents, interferons, mttrieukim, 
35 erytiiropoieuns. colony stimulating factors, and various odier cytokines, are secretory proteins. Their receptors, 
which are membrane proteins, also have potential as therapeutic or diagnostic agents. Efforts are bemg undertaken 
by both industry and academia to identify new, native secreted proteins. Many efforts are focused on the screening 
of mammalian recombinant DNA libraries to identify the coding sequences for novel secreted proteins. Examples 
of screening methods and techniques are described in tiie harature |sce. for example. Klein et al.. Prft6 , Nail . Ac a d . 
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Sa. ^:7i08-7H3 (.996); ul. Pa.cn. No. 5.536.637),. resu.. of such cS^rts. pamcula.., *osc focse. on 
.demtfying molecules having idenu^ and/or similan^. wim .csdcan are of imercsc. 
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70. PR0733 

T1/ST2 is a reccptor-Uke molecule homologous .o U« .ype I i„.erleukin-l receptor, believed to be involved 
m cell Signaling, live T1/ST2 receptor and/or putauve ligands are furdier described m Gayle. e. al.. IMLSS^ 
271(10):5784-5789 (1996). Kun.. « ... o^.. 270(46,.27905-279.3 (1995). and Mitcham^fTt^ 
271(10:5777-5783 (1996). Tl,ese pro.e.ns. and proteins related thereto are of inters. " 

'^-«-""y^'™-bnu..boundpro.e.„sandrecep.orsareofm,eres.since.heycanpta^ 
role ■„ the fonnadon. differentiation and maintenance of multicellular organisms. Tl,e fate of many individual ceUs 
eg., proliferation, migration, differentiation, or interaction .id, other cells, is typically governed by infon»„ion 
received from other cells and/or me immedtate env.ronmem. information is often transmined by secreted 
polypeptides (for insunce. mitogenic facers, survival factors. cyu,.oxic factors, differentiation factors, neuropepddes 
»d hormones) which are. in mm, recetved and interpreted by diverse cel. receptors or membrane-bound proteins' 
such membrane-bCKmd proteins and ceU receptors mclude. but a« Bmited to, cytokine receptors, receptor kinases 
^ceptor phosphatases, receptors involved in cell-cell interactions, and cellular adhesin molecules like selectins and 
mtegnns. For insunce. transduction of s.gnals that regulate cell growd. and differentiadon « regulated in pan by 
Phosphorylation of various cellular proteins. Protein tyrosine kinases, enzymes d«t catalyze dutt process, can als! 
act as grow* factor receptors. Examples inch.dc fibroblast growth factor receptor and nerve growth fector receptor 
Membrane-bound proteins and receptor molecules have various i«iustrial appUcaUons. includ«g as 
pharmaceuncal and diagnostic agents. Receptor immunoadhesins, for i^tance. can be employed as therapeuUc agents 
«> block receptor-ligand interaction. The membrane-bound proteins can also be employed for screening of potential 
peptide or small molecule inhibitors of die relevant receptor/ligand interacuon. 

Efforts are being und«taken by both industry and academia to identify new. native receptor proteins. Many 
efforts are focused on d« screening of mammalian recombinant DNA libraries to identify dte coding sequences for 
novel receptor proteins. The results of such efforts ate provided herein. 



25 



71. PRQl<t2 

Pancreatitis-associated protein (PAP) is a secretory protein *at is overexpressed by d,e pancreas during 
acute pancreatitis. Serum PAP concentraaons have been shown to be abnormally high in paUents with 
30 pancreatitis. PezzilK et al.. Am. J. qastrffCniftrffK. 92(10):1887.|890 (1997). 

PAP is synthesized by die pancreas due to pancreatic inflammarion and has been shown to be a good serum 
marker for injury of die pancreas. In addiuon. serum PAP levels appear to strongly conflate wid, creadnine 

measurements. In patients widi a pancre^-kidney transplantation, PAP may prove to be a usefUl biological 
»d histological marker of pancreatic graft rejection. Van der Pijl et al. . m^,,,^, 63(7):995-1003 (.997) 
Further, PAPhas been shown to be useftil m screening neonates for cystic fibrosis. In fact, PAP may discriminate 

cystK:fib.osi,„eo«.,es with bener specificity dian die curremimmunoreactivetiypsisas^^^^ lovannaetal r» 
Acad. Aci, III. 317(6):561-564. ' 

Secretedpmteinssuchas PAPhave various industrial applications, including pharmaceuticals, diagnostics 
biosensors and bioreactors. Most protein dmgs available at present, such as dirombolytic agents, interferons! 
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iracrteuldns. erythropoictii^lony stimulating factors, and various other cytokines, are secretory proteins. Their 
receptors, which arc membrane proteins, also have potential as therapeutic or diagnostic agents. 

Efforts are being undertaken by both industry and academia to identify new. native secreted proteins. Many 
efforts are focused on the screening of mammalian recombinant DN A libraries to identify the coding sequences for 
novel secreted proteins. Examples of screening methods and techniques arc described in the liieranirt (see. for 
5 ri,imr'- P- Acd Sci. . 93:710^7113 (1996); U.S. Patent No. 5.536.637)1. The results of such 

efforts are presented herein. 
72. ERQZ88 

Ami-neopbstic urinary protein (ANUP) was identified as the major protein present in a fraction of human 
10 urine which exhibits antiproliferative activity against human mmor cell lines without affecting the growth of several 
normal diploid cell lines or tumor cells of mouse or hamster origin. Sloane et al.. fijacUcnU., 234(2):355-362 
(1986). 

ANUP is a unique cytokine that has been found in human gramilocytes. The N^crrainal ammo and 
sequence has been shown to be unique. A synthetic peptide corresponding to the first nine residues, with Cy, at 
15 positions 4 and 7. was found to be an anti-wmor agent in vitro. Ridge and Sloane. Qaclaitt. 8(l):l-5 (1996). 

Secreted proteins such as ANUP have various industrial applications, including pharmaceuticals, diagnostics, 
biosensors and bioreactors. Most ptxKein dmgs available at present, such as thrombolytic agents, interferons, 
inerieukins. etythropoietins. colony stimulating factors, and various other cytokines, are secreto^^ proteins. Tl,e.r 
.ecepiors. which are membrane proteins, also have potential as dterapeutic or diagnostic agents. Efforts are being 
20 undertakenbybothindustTyandacademia.oidentifyncw.na.ivesecretedproteins. Many efforts are focused on the 
screening of mammalian recombinant DNA libraries to identify the coding sequences for novel secreted proteins. 
Examples of screening methods and techniques are described in the literamre {sec. for example. iOcin et al.. ElflC 
Acad. Sci. . 21:7108-71 13 (1996); U.S. Patent No. 5,536.637)1. 

25 73. mam 

DickkopM (dkk-1) is a member of a famUy of secreted proteins and functions in head inducoon. Dkk-1 
lsaninducerofSpemannorganizcrinan^T»anemb.yos. Glinka. e. al.. Mamrs. 391(6665):357-362 (1998). Dkk-1 
is a potent antagonist of Wnt signalling, suggesting that dkk genes encode a family of secreted Wnt inhibitors. Tl»us. 
dkk-1 family members and related molecules are of interest. 

30 More generally, all extraceUular proteins are of interest since they can play an important role in the 

formation, differentiation and maintenance of multicellular organisms. THe fate of many individual cells, e.g.. 
proliferation, migradon. differeffliation. or interaction with other cells, is typically governed hy information received 
from other cells and/or the immediate enviromnent. This information is often transmined by secreted polypeptides 
(for instance, mitogenic faaors, survival bcton. cytotoxic factors, differentiation factors, neuropeptides, and 

35 honnones)whichaie.intum..*ceivedandinterpietedbydiversecellreceptorsormemb^^ n>cse 
«aeied polypeptides or signaling molecules normally pass through the cellular secretory pathway to reach their site 

of action in the extracellular environment. 

Secreted proteins have various industrial applications, including pharmaceuticals, diagnostics, biosensors 
and bioreactors. Most protein drugs available at present, such as thrombolytic agems. interferons, interleukins. 
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cryU,ropoi«i„s. colony sUmulaung factors. ^ various oO^er cy.oRi.s. are Tory protc... „ceptors 
which are membrane proteins, also have potential as therapeutic or diagnostic agents. 

Efforts are being underuUcen by both tndustty and academia to .dentify new. ™„,ve secreted proteins 
parttcularly those reU^ced to d«c-l. Many efforts are focused on the screening of mammalian recombinant DNA 
^branestoidentifydtecodrngsequencesfornovelsecretedproteins. Examples of screentng methods and techniques 
> are described in the literature .see. for example. Klein et al.. SlS^^JM^,^, 22:7.08-7, ,3 (,996)- U S 
Patent No. 5.536.637)]. The results of such effons to identify molecules related to dkk-1 are provided herein. 

Protein disuUide isomerase is an enzymatic protein which is involved in the promouon of corrtct refolding 
ofproemsthroughtheesublishmentofcorrectdisulfidebondformat^^^^ P-ein disulfide isomerase was imtiaU 
•dennfied based upon its abihty to catalyze the renacuration of reduced denatured RNAse (Goldberger et al L^L 
£bm. 239:.406-,410 (.964, and Epstein et a... Co.d SoHno c.,^, , ,3,3,^,;;;^ 

Protem di«.fide isomerase has been shown to be a resident enzyme of the endoplasmic reticulum which ts retained 
tn the endoplasmic reticulum via a -KDEL or -HDEL amino acid sequence at its Ctermtnus. Protein disulfide 
« and related proteins arc further described in Uboissiere. et a... I. Pip,, Chm.. 270(47:28006-28009 
(1955): Jeenes. e. al.. Q^, ,93(2):.51-,56 (1997; Koivunen. et al.. 42(3):397^4 (1997); and Desi.va 

et a. fiMXdl^.. 15(,):9-16 (.996). Titese studies indicate the importance of the idennfication of protein 
disulfide related proteins. 

^-«-'^y.*-''«™ficadonofal.extraceUuhra«.membra„e-b«^ 
play anportant roles in the formation. differenUation and maintenance of multicelhitar organisms. IT* fate of many 
u.d.v.dua. cells, e.g.. proliferation, migration, differemiation. or interaction with other celb. is typically governed 
by tnformation received from other cells and/or the immediate environment. Tlus informaUon is often tnmsmined 
by secreted pcrfypeptides (for instance, mitogenic factors, survival factors, cytotoxic factors, differemianon fectors 
neuropepddes. and hormones) which are. in turn, recetved and interpreted by d.verse cell receptors or membr^e^ 
bound protcms. n,ese secreted polypeptides or signahng molecules normally pass through the cellular secretory 
pa^way .0 reach their site of acnon in the extracellular enviromnent. usually at a membrane-bound receptor protein 
Secreted proteins have various industrial applicaUons. including use as pharmaceuticals, diagnostics 
btosensots and bioreactors. h. ftct. most protein drugs available at present, such as thrombolytic agents, interferons' 
uuerleuto. erythropoietic, colony stimulating fectors, a«l various other cytokines, are secretory protems T^ir 
receptors, which are membrane-bound proteins, also have potential as therapeutic or dtagnostic agents. Receptor 
.mmunoadhesins. for instance, can be employed as therapeutic agents .0 bkxk receptor-ligand interacdon 
Membrane-bound proteins can aUo be employed for screening of potential pepdde or small molecule inhibitors of 
a>e relevant receptor/ligand interacdon. Such membnme-bound proteins and cel. receptors include, but are no. liauted 
.0. cytokine receptors, receptor kinases, receptor phosphatases, receptors involved in cell-cell interactions a«l 
cellular adhesin molecules like selecuns and integrins. Transducuon of signals that regulate cell growth and 
•Wfcrennanon is regulated in pan by phosphorylation of various cellular proteins. Protem tyrostne kinases, enzymes 
-uucatatyze to process, can ako aa as growth factor receptors. Examples i«.ude fibroblast growd, factor receptor 
and nerve growdi factor receptor. 
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Of particular inicresi are cellular proteins having endoplasmic reticulum (ER) retention signals. These 
proteins are retained in the ccU and function closely with endoplasmic reticulum in protein producUon. Such protetns 
have been described previously, i.e.. see Shorrosh and Dixon. ElanU-. 2(l):51-58 (1992). 

Efforts are being underuken by both industry and academia to identify new. native secreted and membrane- 
bound receptor proteins, and in parucular. cellular proteins having ER retension signals. Many efforts a« focused 
on .he screening of mammaUan recombinant DNA libraries to idemify the coding sequences for novel secreted and 
membrane W receptor proteins Examples of screening methods and techniques are described in dte literamre 
(see. for example. Klein et al.. Pror Nilll. A«<l. Sd. 22:7108-7113 (1996); U.S. Patem No. 5.536.637)|. The 
«ul.s of sucheflbrts. particularly the identification of novel polypeptides and nucleic acids encoding the same, whtch 
have sequence identity and similarity to protein disulfide isomerase are presented herein. 

75. mum 

oxygen free radicals and antioxidants appear to play an important role in the central nervous system after 
cerebral ischemia and reperfusion. Moreover, cardiac injury, related to ischaemia and reperfusion has been reported 
to be caused by the action of free radicaU. Additionally, swdies have reported dta, the redox state of the cell .s a 
pivotal determinant of the fate of the cells. Ftmhermote. reactive oxygen species have been reported to be cyiotoxK. 
causing inflammatory disease, including ussue necrosis, organ failure. atherosclerosU. infertiUty. birth defects, 
ptemamre aging. mu««io» and malignancy. TTius. the control of oxidation and reduction is important for a number 
of reason, including for control and prevention of strokes, heart attacks, oxidative stress and hypertension. In this 
regard reductases, and panicularly. oxidoreductases. are of interest. Publications further describing this subject 
matter inchKle Kelsey. et al.. Br > Q nc«. 76(7):852.4 (1997): Friedrich and Weiss. J^Jisfll^.. l87(4):529-40 
(1997) and Pieulle. et al., LEaiSSliJiL. 179(18):5684-92 (1997). 

In addidon to reducuses m particular, novel polypeptides are generally of Imcrest. Exuacellular pro^ins 
pby an important role in the formation, differentiation and maimenance of multiceUular organisms, m fete of many 
individual cells, e.g.. proliferation, migration, differentiation, or interaction with other cells. Is typically governed 
by information received from other cells and/or the immediate environmem. This infonnation is often transmitted 
by sccittedpolypeptides (for instance, mitogenic factors, survival factors, cytotoxic factors, differentiation factors, 
neuropeptide, and hormones) which are. in «m,received and imerpreted by diverse cell receptors or membrane- 
bound proteins. These secreted polypeptides or signaling molecules normally pass thrm,gh the ceUular secretoty 
padiway to reach their site of action in the extracellular environmem. 

Secreted proteins have various industrial applications, including pharmaceuticals, diagnostics, biosensors 
and bioreac«,rs. Most protein drugs available at present, such as thrombolytic agems. interferons, intcrleukins. 
erythropoietins, colony stimulating factors, and various other cytokines, are secretory proteins. The« receptors, 
which are membrane proteins, also have potential as therapeutic or diagnostic agems. Effbrts are being undertaken 
by both indusuy and academia to identify new. native secreted proteins. Many efforu are focused on the screening 
of mammalian recombinant DNA libraries to idemify the coding sequences for novel secreted proteins. Examples 
of screening methods and techniques are described in the literature |see. for example. Klein et al.. ElflC^HaiLAaiL 
Sa. 21-7108-7113 (1996); U.S. Patem No. 5.536,637)]. The results of such efforts, particularly those idenofymg 
poiyi^ptide. having seque«e idemny with reductases, and the nucleic acids encoding the same, are presented herem. 
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76. ERQiflU ~ 

Enzymanc prceins play imponan. roI« in a.c chemical reacuons involved in the digesuon of foods the 
b.osyn*es.s of n.cronK,.ecules. the controlled release and uttlization of che^ica, energ>.. and other processes 
necessary to susta,„ life. Su.fotransferases are enzymes which transfer sulfate from a sulfate donor to acceptor 
substrates, particularly those containing termma. glucoromc acid. 1^ HNK-, carbohydrate epitope is expressed 
5 on several neural adhesion glycoproteins and a glycolipid. and is involved in «ll .meracuons 

glucuronyltransferase and sulfotransferase are considered to be the key enzymes in the biosynthesis of this epitope 
^cause the res. of the s.rucmre occurs often in g.ycoconjugates. HNK-1 sulfotransfcrerase is further described in 
Bakker. H., et al., J. Biol. Che^n^ 272 (47). 29942-2 9946 (1997). 

In add.t.on to HNK-1 su.fotransfcrerase. and novel pro.e.ns related thereto, all novel proteins arc of 
.0 .merest. Extracellular and membrane-bound proteins p,ay important roles m the formaaon. dlffercnUaUo„ and 
mamtenance of muldcellular organisms. T^e fate of many individual cells, e.g.. pro.iferanon. migntuon 
<^erenaanon. or interaction with other cells, is typically governed by information received from other cells and/oi 
*e m^edtate envi.m„ent. This infonnaUon is often transmttted by secreted polypeptides (for i„tance. mitogenic 
factors, survtva. factors, cytotoxic factors, differemiation factors, neuropeptides, and hormones, which are in turn 
15 received and .nterprcted by diverse cell receptors or membrane-bound proteins. TT^se secreted polypeptides or 
s.gnal.ng molecules normally pass through the cellular secreto,, pathway to reach their site of action in the 
extracellular enviromnem. usually at a membrane-boond receptor protein. 

Secreted proteins have various industrial applications, including use as pharmaceuticals, diagnostic, 
btosenso. and bioteactors. In fact, most protein dn,gs available at present, such as thrombolytic agems. i^erf^™! 
20 mterleulons, erythropoietins, colony stimulating factors, and various other cytokines, are secretory proteim TT«ir 
receptors, which are membrane-bound proteins, also have potential as therapeutic or diagnostic agents. Receptor 
mununoadhesms. for insunce. can be employed as d,erapcu.ic agents to block receptor-ligand interacdon 
Membrane-bound proteins can also be employed for screening of potential peptide or small molecule inhibitors of 
*e relevam receptor/ligand intentcdon. Such membtane-bound proteins and cell receptors include, but are not limited 
.0. cytokine receptors, receptor kinases, receptor phosphatases, receptors involved in cell-ceU interacuons and 
ccuular adhesin molecules like selectins ^d integrins. Transduction of signals that regulate cell growth' and 
Mcrennation b regulated in pan by phosphorylation of various cellular proteins. Protein tyrosine kinases, enzymes 
catalyze that process, can also act as growth factor receptors. Examples include f.bt.b.as. gt^wth factor receptor 
and nerve growth factor receptor. 

30 ^'^""^'^'^^'^•^^'^"bybothindus.O'andacademia.oidemifynew.^.dve^^^^^ 

bouml receptor proteins, particularly those having sequence identity with HNK-. sulfotransferase. Ma.^ cffortt are 
focused on the screening of mammalian recombinant DNA libraries to identify the coding sequences for novel 
secreted a«l membrane-bound receptor proteins. Examples of screening methods and techniques are described in 
.he lueramre |see. for example. Klein et al.. Pro.. NmI, Ar.d ^ri. 22:7108-7113 (1996); U.S. Paten. No 

35 5.536,637)J. The results of such efforts are provided herein. 

77. PRO47 4 

Enzymatic proteins play important roles in the chemical reactions involved in the digesoon of foods the 
bH>symhesis of macromolccules. the comroUed release and utilization of cheniical energy, and other proc«ses 
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necessary lo sustain life^icose dehydrogenase fiincrions in ihe oxidation of glucose to gluconate to generate 
me.abol.caUy useful energ>'. The regulation of the PQQ-linkcd glucose dehydrogenase in different organisms .s 
reviewed in Neijssel. ct al.. Amnnr ' 56(1):51-61 (1989). Glucose dehydrogenase functions as 
an auxiliary energy generating mechanism, because it is maximally synthcs.zed under conditions of energy stress. 
In addition to molecules related to glucose dehydrogenase, all novel proteins are of .merest. Extracellular 

5 and membrane-bound proiems play unponant roles in the formation, differemiatton and maintenance of multicellular 
organisms. TTie fate of many individual cells, e.g.. proliferation, migration, differentiation, or interaction with ottier 
cells, is typically governed by informauon received from other cells and/or the immediate environment. TTiis 
infomiation is often transmined by secreted polypeptides (for instance, mitogcnic factors, survival factors, cytotoxic 
factors, differentiation factors, neuropeptides, and hormones) which are. in turn, received and intetpreted by diverse 

10 cell receptors or membrane -bound proteins. TTiese secreted polypeptides or signaling molecules normally pass 
through the cellular secretory pathway to reach their site of action in the extracellular enviroranent. usually at a 

membrane-bound receptor protein. - 

Secreted proteins have various industrial applications, including use as pharmaceuticals, diagnostics, 
biosensors and bioreactors. In faa. most protein drugs available at present, such as thrombolytic agents, interferons. 

15 interleukins. aythropoietins. colony stimulating factors, and various other cytokines, are secretory proteins. Their 
receptors, which are membrane-bound proteins, also have potential as therapeutic or diagnostic agents. Receptor 
immunoadhcsins. for instance, can be employed as therapeutic agents to block receptor-ligand interaction. 
Membrane-bound proteins can also be employed for screening of potential peptide or small molecule inhibitors of 
the relevant recep.or/ligand interaction. Such membrane-bound proteins and cell receptors include, but are not limited 

20 to cytokine receptors, receptor kinases, receptor phosphatases, receptors involved in celUell interactions, and 
cellular adhesin molecules like selectins and integrins. Transduction of signals that regulate cell growth and 
differentiation b regulated in pan by phosphorylation of various cellular protems. Protein tyrosine kinases, enzymes 
that catalyze that process, can also an as growth factor receptors. Examples include fibroblast growth factor receptor 

and nerve giowih factor receptor. 
25 Efforts aie being undertaken by both industry and academia to identify new. native secreted and membrane- 

bound receptor proteins, and panicularly cellular proteins and those related to dehydrogenase or oxidoreductase. 
Maiy efforts are focused on the screening of mammalian recombinant DN A libraries to identify the coding sequences 
for novel secreted and membrane-bound receptor proteins. Examples of screening methods and techmques are 
described in the literamre Isee. for example. Klein et al.. Pro. m Arad. S« .. 22:7108-7113 (1996); U.S. Patent 
30 No. 5 .536.637)]. The results of such efforts are presented herein. 



78. mam 

It has been reported that the cytokine intcrieukin 17 (IH7) stimulates epithelial, endothelial, and fibroblastic 
cells to secrete cytokines such as IL-6, IL-8. and granulocyte-colony-stimulating factor, as well as prostaglandin E2 
35 Moreover, it has been shown that when culwred in the presence of lL-17. fibroblasts could sustain proliferation of 
CD34.h preferential maturation into neutrophils. THus it has been suggested that lL-17 constitutes an eariy initiator 
of the T ceU-dependent inflammatory reaction and/or an element of the cytokme network that bridges tite immune 
system to hematopoiesis. See, Yao. e. al.. Unuimcl- 155(12):54g3-5486 (1995): Fossiez. et al.. I F.«g . Mcd - 
183«5):2593.2603 (1996); Kennedy, et al.. T Imrrfmn rviQkine RCS- m):6\ 1-617 (1996). TTm. proteins related 
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10 IL-17 are of interest 

More generally, all novel proteins are of interest. Extracellular proteins play an important role m the 
fornauon, differentiation and maintenance of multicellular organisms. The fate of many individual cells, e.g.. 
proliferation, migration, differentiation, or interaction with other ceUs. is typically governed by information received 
from other cells and/or the immediate environment. TOs information is often transmitted by secreted poh^eptides 
(for instance, mitogenic factors, survival factors, cytotoxic factors, differentiation factors, neuropepudes. and 
hormones) which are. m mm. received and interpreted by diverse cell receptors or membrane-bound protems. n«se 
secreted polypeptides or signaling molecules nomvdly pass through the cellular secretory pathway to reach cheir site 
of action in the extracellular environment. 

Secreted proteins have various industrial applications, including pharmaceuticals, diagnostics, biosensors 
and blorcactors. Most protein dmgs available at present, such as thrombolytic agems. interferons, imerleukins. 
erythropoietins, colony stimulaUng factors, and various other cytokines, are secretory proteins. Their receptors!' 
which are membrane proteins, also have potential as therapeutic or diagnostic agents. 

Efforts are being undertaken by both industty and academia to identify new. native secreted protems 
particularly those related to IL-17. Many efforts are focused on the screening of mammalian recombmant DNA 
hbranes to identify the coding sequences for novel secreted proteins . Examples of screening methods and techniques 
are described in the li.eranire [see. for example. Klein ei al.. Prn. M»,l a.,h 22:7108-7113 (1996); U.S. 
Patent No. 5.536.637)). The results of such efforts are presented herein. 

79. PR093y 

Protein disulfide isomeiasc is an enzymatic protein which is involved in the promotion of correct refolding 
of proteins through the esuWishment of correct disulfide bond formation. Protein disulfide isomerase was initially 
identified based upon its abiUty to catalyze the renamration of reduced denanired RNAse (Goldberger et al. LJiflL 
Qcm. 222:1406-1410 (1964) and Epstein et al.. Cold .Snrino u.^. n,.^^^ pj-, 22:439-449 (1963)) 

Pmtcin disulfide isomerase has been shown to be a residem enzyme of the endoplasmic reticulum which is retained 
25 in the endoplasmic reticulum via a -KDEL or -HDEL amino acid sequence at its C-tenninus. Protein disulfide 
isomerase and related proteins are farther described in Uboissiere. et al.. J. Biol Ch^ 22Q(ffi:28006-28009 
(1995); Jeenes. et al.. fiaig. 122(2}: 151-156 (1997); Koivunen. et al.. Qsasmm. 42a}:397-404 (1997): Desilva et 
al.. DNACgliPio l.. 15£D:9-16 (1996): Freedman. e. al. Trend, .n R.nrh.n, 12:331-336 (1994); Bulleid N J 
Advance, i n Prorrhrm 44:125-50 (1993): and Noiva. R.. PrcExp andhtrifiranon 5: 1-13 (1994). n*,* swdies 
30 indicate the importance of die identification of protein disulfide related proteins. 

More generally, and also of interest are all novel membraneibound proteins and receptors. Such proteins 
can play an imponan role in the formation, differentiation and maintenance of multicellular organisms. The fate of 
many imlividual cells, e.g.. proliferation, migration, differentiation, or interaction with other cells, is typically 
governed by information received from other cells and/or the munediate enviromnent. TOs informanon is often 
35 transmitted by secreted polypeptides (for instance, mitogenic factors, survival factors, cytotoxic factors. diRcrennaaon 
factors, neuropepfides. and hormones) which are. in mm. received and imerpreted by diverse cell receptors or 
membtane-bound proteins. Such membtane-bound proteins and ceU recepton include, but are not limited to. cytokine 
receptors, receptor kinases, receptor phosphatases, receptors involved in cell-cell imeracdons. and cellular adhesin 
molecules like selectins and integrins. For instance, transduction of signals diat regulate cell growth and 
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diiferendation is rcguiaiedX by phosphorylanon of various cellular proteins. Protein tyrosine kinases, enzymes 
to catalyze that process, can also act as growth factor receptors. Examples include fibroblast growth factor receptor 

and nerve growth factor receptor. 

Membrane-bound proteins and receptor molecules have various industrial applications, including as 
phannaceudcai and diagnostic agents. Receptor itnmunoadhesins. for instance, can be employed as therapeutic agents 
to block receptor-ligand interaction. The membrane-bound proteins can also be employed for screemng of potential 
peptide or small molecule inhibitors of the relevant rcceptor/ligand interaction. 

Given the importance of membrane bound proteins, efforts are under way to .demity novel membrane bound 
proteins Moreover, given the importance of disulfide bond-forming enzymes and the.r potential uses in a number 
of different appUcadons. for example in increasmg the yield of correct refolding of recombinantly produced proteins, 
effons are cunenUy being undertaken by bodt industry and academia to identify new. nattve proteins having sequence 
idemity with protein disulfide isomerase. Many of these efforts are focused on the screening of mammalian 
recombinant DNA Ubraries to identify the coding sequences for novel protein disulfide isomerase homologs. 

We herein describe the idendficarion and characterization of a novel polypeptide having homology to protein 
disulfide isomerase. 
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80. PRO1082 

■nt low density lipopiotein (LDL) receptor is a membiane-bound protein diat plays a key role in cholesterol 
homeostasis, mediating cellular uptake of lipoprotein panicles by high affinity binding to its ligands. apolipoprotein 
(apo) B-lOO and apoE. The ligand-binding domain of *e LDL receptor contains 7 cysteine-rich repeats of 
approximately 40 amino acids, wherein each repeat contains 6 cysteuies. which form 3 imra-repea, disulfide bonds, 
■niese unique smicmral fean^s provide the LDL receptor with its ability co specifically interact with apo B-lOO and 
apoE thereby allowing for transport of diese lipoprotein particles across cellular membranes and meubolism of their 
components. Soluble fragments containing the extracellular domain of dte LDL receptor have been shown to retain 
Oic ability to intent with its specific lipoprotein ligands (Simmons et al.. ) Piftl.Ctem. 272:25531-25536 (1997)). 
25 LDLrecep.orsaiefiuiherdescribedinJavit..EASEai..9(l3):1378-l38l (1995) and Herz and Willnow. AniL±JI 
AeuLSfii.. 737:14-19 (1994). Thus, proteins having sequence identity with LDL receptors are of interest. 

More generally, all membrane-bound proteins and receptors can play an miportant role in the formation, 
differemiation and maintenance of multicellular organisms. The fate of many individual cells, e.g.. proliferation, 
migration, differemiation. or interaction with other cells, is typically governed by information received from other 
cells and/or the immediate environment. TMs information is often transmined by secreted polypeptides (for instance, 
mitogenic factors, survival factors. cytott,xic factors, differentiation factors, neuropeptides, and hormones) which are. 
in nirn, received and interpreted by diverse cell receptors or membrane-bound proteins. Such membrane-bound 
proteins and ceU receptors inchide. but are not limited to, cytokine receptors, receptor kmases. receptor phosphatases . 
receptors involved in cell<eU interactions, and cellular adhesin molecules like selecims and integrins. For instance. 
35 tnuBduction of signals that regulate cell growth and differentiation is regulated m par. by phosphorylation of various 
cellular proteins. P«>tein tyrosine kfaiases, enzymes that catalyze that process, can also act as growth factor 
receptors. E«mples include fibroblast growth factor receptor and nerve grow* factor receptor. Of particular 
interest are membrane bound proteins that have type II transmembrane domains. 
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Membfane-bound proteins and receptor molecules have various induS applications, including as 
phannaceudcal and diag«»uc agems Receptor mununoadhesins. for .nsunce. can be employed as a,erapcu.ic agents 
«) block receptor-ligand .nteracuon. The membrane-bound proteins can also be employed for screening of pocemtal 
pepude or small molecule inhibitors of ihe relevant receptor/ligand mteraction. 

Efforts are Hm being undenaken by both industry and academia to identify new. native protems. particularly 
membrane bound proteins including type 11 transmembrane bound proteins. Many efforu are focused on the 
screemng of mammalian recombmant DNA libraries to identify the coding sequences for novel receptor proteins 
The results of such efforu are provided herein. 

81. PR01(ffi3 

Of panicular interest are membrane bound proteins chat belong to the seven transmembrane (7TM) receptor 
superfamily. Examples of these receptors include G-protein coupled receptors such as ion receptors Another 
ex^iple of a 7TM receptor superfamily member is described in Os.erhoff. e, al.. DMAXsILBifll-. 16(4):379.389 

Membrane-bound proteins and receptor molecules have various industrial appUcations. including as 
pharmaceutical and diagnostic agents. Receptor immunoadhesins. for instance, can be employed as therapeutic agents 
.0 block receptor-ligand interaction. TT« membrane-bound proteins can also be employed for screening of potential 
pepude or small molecule inhibitors of the relevant receptor/ligand interaction. 

Efforts are being undenaJcen by both industry and academia to identify new. mitive receptor proteins Many 
efforts are focused on the screening of mammalian recombinant DNA libraries to id«,tiiy the coding seq«e«:es for 
novel receptor proteins. The results of such efforts are presemed herein. 

82. PRO200 

Polypeptides imroWed in survival, proliferation and/or differentiauon of cells are of interest. Polypeptides 
known to be imroNed in the survival, proliferation and/or differentiation of cells include VEGF and members of the 
25 bone morphogenetic protein family. TTtcrefore. novel polypeptides which are related to eiti«r VEGF or the bone 
morphogenetic piottin are of interest. 

The heparin-binding endothelial cell-growth factor. VEGF. was identified and purified from media 
condiuoned by bovine pimitary follicular or folIiculo-steUate cells over several year, ago. See Ferrara er al. 
Byys. Res. Corn.. 161, 851 (.989). VEGF is . naturally occurring compoumi that is produ«d in follicular oi 
30 folhculo-stellate cells (FC). a morphologically well characterized population of gramalar ceUs. TTte FC are stellate 
cells that send cytoplasmic processes between secretory ceUs. 

VEGF is expressed in a variety ofussues as multiple homodimeric forms (121. 165. 189 and 206 amino 
acds per mo«,mer) resulting from alternative RNA splicing. VEGF,,, is a soluble mitogen that does not bind 
hepann; the lo,«er forms of VEGF bind heparin with progressively higher affinity. The heparin-binding forms of 
VEGF can be cleaved in the carboxy termimis by plasmin to release (a) diffusible form(s) of VEGF Amino acid 
^^'^ of .he carboxy tenninal peptide identified after plasmin cleavage is Arg.,.-Ala.... Amino terminal "core" 
Piotm. VEGF (1-1 10) isolated as a homodimer. binds neutializing monoctonal antibodies (4.6. 1 and 2E3) and soluble 
forms of FMS-like tyn«ine kinase (FLT-1). kinase domain region (KDR) and fetal liver kinase (FIX) receptors wuh 
simlar affinity compared to the intact VEGF,», homodimer. 
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As noted. VEoHraains two domains that are responsible respectively for binding to the KDR and FLT-1 
receptois. Tliesc receptors exist only on endothelial (vascular) cells. As cells become depleted in oxygen, because 
of trauma and the like. VEGF production increases in such cells which then bind to the respective receptors in order 
10 signal ultimate biological effect. The signal then increases vascular permeability and the cells divide and expand 
to form new vascular pathways - vasculogenesis and angiogenesis. 

Thus. VEGF is useful for treating conditions in which a selected action on the vascular endothelial cells, 
in the absence'of excessive tissue growth, is important, for example, diabetic ulcers and vascular injuries resulting 
ftom trauma such as subcutaneous wounds. Being a vascular (artery and venus) endothelial cell growth factor. VEGF 
restores cells that are damaged, a process referred to as vasculogenesis. and stimulates the formulation of new 
vessels, a process referred to as angiogenesis. 

VEGF would also find use in the restoration of vasculature after a myocardial infarct, as well as other uses 
to can be deduced. In this regard, inhibitors of VEGF are sometimes desirable, particularly to mitigate processes 
such as angiogenesis and vasculogenesis in cancerous cells . 

Regarding the bone morphogenetic protein famUy. members of this family have been reported as being 
involved in the differentiation of cartilage and the promotion of vascularization and osteoinduction m preformed 
15 hydroxyapatitc. Zou. et al.. fioifisJJsv. (U.S.). ll(17):219l (1997); Uvine. et al.. Ann ?\m . Sur g- 39(2):158 
(1997). A number of related bone morphogenetic proteins have been identified, all members of die bone 
morphogenetic protein (BMP) family. Bone morphogenetic native and mutant proteins, nucleic acids encodiiMS 
Aercfor related compounds including receptors, host cells and uses are fiirther described in at least: U.S. Patent Nos. 
5.670.338; 5.454.419; 5,661.007; 5.637.480; 5.631.142; 5.166.058; 5.620.867; 5.543.394; 4.877.864; 5.013.649; 
55.106.748: and 5.399.677. Of particular interest are proteins having homology witii bone morphogenetic protein 
I, a procollagen C-proteinase that plays key roles in regulating matrix deposition. 

m present invention is predicated upon research intended to identify novel polypeptides which are reUued 
m VEGF and the BMP family, and in particular, polypeptides which have a role in the survival, proliferation and/or 
differentiation of cells. While the novel polypeptides are not expected to have biological activity identical to the 
known polypeptides to which they have homology. ti« known polypeptide biological activities can be used to 
deitrmine the relative biotogical activities of tiw novel polypeptides. In particular, the novel polypeptides described 
herein can be used in assays which are intended to determine the abiUty of a polypeptide to induce survival. 
proUferation or differentiation of cells. In turn, tite results of Uiesc assays can be used accordingly, for diagnostic 
and therapeutic purposes. The results of such research is tite subject of die present invention. 

30 

83. pBQMS ntwl Plt0286 

The cloning of die Toll gene of Drosophila, a maternal effect gene titat plays a central role in the 
esublishment of the embryonic dorsal-venlral pattern, has been reported by Hashimoto « a/.. CfiU 52, 269-279 
(1988). The Drosophila ToH gene encodes an integral membrane protein wiUi an extracytoplasmic domain of 803 
35 amiiK) acids and a cytoplasmic domain of 269 amino acids. Tim extracytoplasmic domain has a potential membrane- 
spanning segment, and contains multiple copies of a leucine-rich segmem. a structural motif found in many 
transmembrane proteins. THe Toll protein controls dorsal-ventral patterning in Drosophila embryos and activates 
the transcription factor Dorsal upon binding to its Ugand Spatzle. (Morisato and Anderson. CfiU 26. 677-688 (1994).) 
InaduhZ)«wc^ii. the ToU/Dorsal signaling patiiway participates in the anti-fungal immune response. (Lenaiire et 
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al.. CfiJl M. 97^983 (1996).) 1^ 

A human homologue of the Drosophila Toll protein has been described by Medzhitov et al.. Hm^ M 
394-397 (1997). TOs hunum Toll. jus. as Drosophila Toll, is a type 1 transmembrane protein, with an exoacellular 
domain consisting of 21 tamlemly repeated leucine-rich morifs (leucine-rich regton - LRR). separated by a non-UR 
region, and a cytoplasmic domain homologous to the cytoplasmic domam of the human interleukin-l OL-l) receptor 
5 A constinitively active mutant of the human Toll transfected into human cell lines was shown to be able to induce 
Ae acdvadon of NT-kB and the expression of NF-KBH»„,rolled genes for die inflammatory cytokines IL- 1 IL-6 and 
IL-8. as wen as the expression of the constimulatory molecule B7.1. which is required for the activation ofnative 
T cells, h has been suggested that ToU functions in venebrates as a no„<lonal receptor of the immune system, which 
can induce signals for activating both an innate and an adaptive immune response .n venebrates. Tie hmnan Toll 

10 gene reported by Medzhitov e, al., supro was most strongly expressed in spleen and peripheral blood leukocytes 
(PEL), and the authors suggested that tts expression m other tissues may be due to the presence of macrophages and 
dendntK cells, in which it couW act as an early-waning system for infecuon. Tl>e public GenBank daubase contains 
the following ToU sequences: ToUl (DNAM HSU88540-1. which is identical with the random sequenced full-length 
CDNA #HUMRSC786-1): TolI2 (DNAX# HSU88878-1): Toll3 (DNAX# HSU88879-1): and Toll4 (DNAX# 

15 HSU88880-1. which is identical wid, the DNA sequence reponed by Medzhitov e. al.. supra) A pantal Toll 
sequence (Toll5) is available from GenBank under DNAX# HSU88881-1. 

Further human homologues of the Drosophila Toll protein, designated as TolHike receptors (huTLRsl-5) 
were recentfy cloned and shown to minor die topographic strucnire of the Drosophila co«nten,an (Rock « filfifi. 
Nat l , Acad..S,i 25. 588-593 [1998]). Overexpression of a consdtu.ively active mutant of one human TLR 

20 (ToU-protein homologue - Medzhitov « ./.. supra: TLR4 - Rock « al.. supra) leads to Ac activation of NF-kB and 
mdttction of the inflammatory cytokines and constimulatory molecules. Medzhitov et al., supra. 

PR02I3-1. PR0133ft «nH PBnitf? 

Cancer is characterized by the increase in die number of abnormal, or neoplastic, cells derived from a 
25 normal tissue which prolifcmte to fomi a tumor mass, the invasion of adjacem assues by d«se neoplastic tumor cells 
and the gei«ratio„ of malignant cells which evennuUly spread via the blood or lymphatic system to regional lympli 
nodes and to distam sites (metastasis). In a cancerous state a cell proliferates under conditions in which nonnal cells 
would not grow. Cancer manifests iuelf in a wide variety of forms, characterized by different degrees of 
invasiveness and aggressiveness. 

30 Alteration of gene expression is intimately related to the uncontrolled cell growth and de-differentiation 

which are a common feamre of all cancers. TTie genomes of certain well studied tumors have been found to show 

decreased expression of recessive genes, usually refened to as mmor suppression ge«s. which would normally 

lUnctum to prevent malignant ceU growU,. and/or overexpression of cenain dominant genes, such as oncogenes that 

act to promote maHgnamg^wA. Eachof d«se genetic changes appears to be responsible for importing some of die 

35 «n«Bihat.maggregatc.representd«fullneoplasticphenotype(Hunter.Cell64. 1129 f 19911: Bishop CeU64 235 
248 (19911). 

A wcU known mechanism of gene (e.g. oncogene) oveitxpresskm m cancer cells is gene amplification TOs 
» a process where in Uk chn,mosome of the ancestral ceU multiple copies of a particular gene are produced The 
process mvolves unscheduled replicauon of the region of chromosome comprising the gene, followed by 
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recombination of the replicated segments back into the chromosome (Alitalo et al.. Adv. Cancer Res. 47. 235-281 
[19861). It is believed that the overexprcssion of the gene parallels gene amplification, i.e. is proportionate to the 
number of copies made. 

ProtOK>ncogenes that encode growth factors and growth factor receptors have been identified to play 
important roles in the pathogenesis of various human malignancies, including breast cancer. For example, it has been 
5 found thaiihe human ErbB2 gene (erbB2. also known as her2. or c<rbB-2), which encodes a 185-kd transmembrane 
glycoprotein .tceptor (pl85HER2: HER2) related to die epidermal growth factor receptor (EGFR). is overexpressed 
in about 25% to 30% of human breast cancer (Slamon et al.. Science 235:177-182 (19871; Slamon e. al.. Science 
244:707-712 (19891). 

It has been reponed that gene amplification of a protooncogene is an event typically involved in the more 
10 malignant forms of cancer, and could act as a predictor of clinical outcome (Schwab et al.. Genes Chromosomes 
Cancer 1. 181-193 (1990]; Alitalo et al., supra). THus. crbB2 overexpression is commonly regarded as a prcdiaor 
of a poor prognosis, especially in paiienu with primary disease that involves axillary lymph nodes (Slamon et al.. 
[19871 and [1989|. supra; Ravdin and Chamness. Gene 159:19-27 (19951; and Hynes and Stem. Biochem Biophys 
Acta 1198: 165-184 (19941). and has been linked to sensitivity and/or resistance to hormone therapy and 
15 chemotherapeutic regmiens, including CMF (cyclophosphamide, methotrexate, and fhwruracil) and anthracydines 
(Baselga et al.. Oncology 1 1 (3 Suppl I): 43-48 (19971). However, despite the association of erbB2 overexpression 
with poor prognosis, the odds of HER2-positive patients responding clinically to treamiem with taxanes were greater 
than three times those of HER2-negative patients (Ibid). A recombinant humanized anti-ErbB2 (anti-HER2) 
monoclonal antibody (a humanized version of the murine anti-ErbB2 antibody 4D5. referred to as rhuM Ab HER2 
20 orHerceptin7d)has been cUnically active in patients widi ErbB2-overexpressing metasutic breast cancers that had 
received extensive prior anticancer therapy. (Baselga et al.. J. Clin. Oncol. 14:737-744 (19961). 

The protein Notch and its homologues are key regulatory receptors in detctmining the ceU fate in various 
development processes. The pro«in Notch^. also known as int-3 oncogene, was originally idemified as a frequent 
urget in mouse mammary mmor virus (MMVS). Notch^ is believed to be a transgene which affects the 
25 differmiation capacity of stem cells and leads to neoplastic proliferation in epithelial cells. Shirayoshi et al.. Genes 
Cells 2(3): 213-224 (1997). During embryogencsis. the expression of Notch-l was detected in endothelial cells of 
blood vessels forming tissue, siKh as the donal aona. intersegmental vessels, yolk sac vessels, cephalic vessels, 
heart, vessels in branchial arches, and capillary plexuses. Notch.4 expression in these tissues was also associated 
with fik-1. the major regulatory gene of vasculogenesis and angiogenesis. Notch^ is also upregulated in vitto during 
30 dte differentiation of endothelial stem cell. The endothelial cell specific expression pattern of Notch^, as well as its 
strucmral similarity to Notch suggest that Notch-4 is an endotitelial cell specific homologue of Notch and that it may 
play a role in vaculogenesis and angiogenesis. 

85. EEQ22S 

35 Effortsansbeii^undenakenbybothindustryandacademiaw Many 

efforts art focused on ihe screening of mammalian recombinant DNA libraries to identify the coding sequences for 
novel receptor proteins. We herein describe the identification and characterization of novel transmembrane 
polypeptides, designated herein as PR0298 polypeptides. 
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86. PRQ337 

Neuronal dcvelopmem in higher vencbra.es is characterized by processes a.at must successfully navigate 
d^anc. cellular environmem en route to their synapdc u.gets. Tl, result is a foncnonaliy precise formation of neural 
crcutts. The precision is believed to result form mechanisms that regulate growth cone pathfmding and target 
recpgnmon. followed by latter refinement a«l remodeling of such projections by events that require neuronal activity 
Goodman and Shatz. Cell/Neuron JSuppl.1 72(10): 77-98 (1993). is further evident that different neurons extend 
nerve fibe„ to are biochemically distinct and rcfy on specific guidance cues provided by cell-cell, cell-matru and 
chemotrophic interactions to reach their appropriate synaptic targets, Goodman et al.. supra. 

One particular means by which diversity of the neuronal ceU surface may be generated is through differenUal 
express,on of cell surface proteins referred to as ceU adhesion molecules (CAMs). Neuronally expressed CAM, have 
been unpUcated in diverse developmental processes, including migration of neurons along radial glial cells, providing 
perm.ssive or repulsive substrates for ncuri.e extension, and in promo.mg the selecdve fasciculation of axons in 
projectional pathways. Jessel, Neuron I: 3-13 (1988): Edelman and Crossin. Annu. Rev. Biochem. 60- 155-190 
(1991). Intentctions between CAlVls presem on the growd, cone membrane and molecules on opposing cell 
membranes or in the extracellular matnx are thought to provide the specify guidance cues that direct nerve fiber 
ou^rowth along appropriate projectional pathways. Such interactions are likely to resuh in the acUvauon of various 

second messenger systems within the growth cone that regulate neurite outgrowth. Doherty and Walsh. Curr Opin 
Neurobiol. 2: 595-601 (1992). 

In higher ver^bratc,. most neural CAMs have been found to be members of three major smtcmral families 
of proteins: the integrins. the cadh«ms. and the immmioglobuUn gene superfamily OgSF). Jessel. supra.; Takelchi 
Anm.. Rev. Biochem. 59: 237-252 (1990): Reichard. and Tomaselli. Annu. Rev. Neurosd. 14: 531-570 (1991) Celi 
adhesion molecules of the IgSF (or Ig-CAMs). in parucular. constitute a large family of proteins frequently implicated 
m neural cell interactions and nerve fiber outgrowth during development. Salzer a«l Colman. Dev. Neurosd 11- 
377-390 (1989): Bnimmendorf and Rathjen. J. Neurochem. 61: 1207-1219 (1993). However, the majority of 
nuumnalian Ig-CAMs appear to be too widely expressed to specify navigational pathways or synaptic targets 
suggesting dtat other CAMs. yet to be idemified. have role in mese more selective interactions of neurons 

Many of the known neural Ig-CAMs have been found to be atutched to the plasma membrane via a 
glycosylphosphaiittylinositol (GPI) anchor. Additionally, many studies have implicated GPl-anchored proteins in 
pnwKhng specific guidance cues during the outgrowth on neurons in specific pathways . In smdies of the grasshopper 
nervous system, treatment of embryos with phosphatidyli«,sitol-specific phopholipase C (PIPLC), which selectively 
removes GPI-anchored proteins from the surfaces of ceUs. resulted in misdirection and faulty navigation among 
subsets of pioneering growth cones, as well as inhibited migratory patterns of a subset of early neurons. Chang et 
al..Devel. ,14:507-519(1992). The projection of redad fibers «, .he optic tecmm appears to depend, in pare on 

a33kDaGPI.anchon«lproidn,however.thcp,ecisenamreof,hispm,einisunknown. Stahletal Ncuron5-735- 
743 (1990). 

•n« expression of various GPl-anchored proteins has been characterized amongst the differem populations 
of pnmary ra, neurons amongst dorsal root ganglion, sympathetic neurons of the cervical ganglion. sympati«,ic 
neurons of the superior cervical ganglion, and cerebellar granule neurons. Rosen ei al.. J. CeU Biol. 1 17- 617-627 
(19W). in comras. to the similar panem of touU membrane protein expression by these different types of neurons 
smlong differences were obsenred in the expression of GPI-anchored prateins ben^een these neurons. Recemly. a 
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65 kDa protein band known as ncurotriniin was discovered and found to be differentially expressed by prin«r>- 
neurons (Rosen c. al.. supra), and restricted to *e nervous system and found to be .he most abundant and earhest 
expressed of the GPl-anchored species in the CNS. Stn^yk et al.. J. Neuroscie«:e 15(3): 2141-2156 (1995). n^e 
discovety of neurotiimin has further lead to the identification of a family of IgSF members, each conutimng three Ig- 
Uke domains that share sigi^an. amino acid identity. .k,w tem«d IgLON. Stmyk et al., supra: Pimenu ct al.. Gene 
170(2): 189-95 (1996). 

Additional members of the IgLON subfamily include opiate binding cell adhesion molecule (OBCAM). 
Schofield ec al.. EMBO J. 8: 489495 (1989); limbic associated membrane protein (LAMP). Pimenta et al.. supra: 
CEPU-l: GP55. Wilson et al.. J. CeB Sci. 109: 3129-3138 (1996): Eur. J. Neurosci. 9(2): 33441 (1997); and 
AvGp50, Hancox et al.. Brain Res. Mol. Brain Res. 44(2): 273-85 (1997). 

While the expression of neurotrimin appears to be widespread, it does appear to correlated with .he 
development of several neural crcuits. For example, between E.8 and PIO. neurotimin mRNA expression withm 
Ute forebrain is maintained at high levels in neurons of the developing thalamus, cortical subplate. and cortex, 
particularly laminae V and VI (with less intense expression in 11. IL and IV. and minimal expression in lamina I). 
Cortical subplate neurons may provide an early, temporary scaffold for the ingrowing thalamic affe^nts en route to 
their final synaptic targets in the cortex. Allendoerfer and Shau. Anm.. Rev. Neurosci. 17: 185-218 (1994). 
Conversely, subplate netm,ns have been suggested to be required for cortical neurons from layer V to select VI to 
grow into the thalamus, and neurons from layer V to select their targets in dte collicuhis. pons, and spinal cord 
(McConnelletal.. J. Neurosci. 14: 1892-19(^(1994). TTKhighlevelexpressionofneurotrimininmany of these 
projections suggests that it cotild be involved in their development. 
20 inthe Wndbrain. high levels of neurotrimin message expression were observed within the pontine mK:leus 

and by the internal granule cells and Puricinje cells of the cerebellum. The pontine nucleus received afferent .nput 
from a variety of sources including corticcpondne fibers of layer V. and is a major source of affcrem input, via mossy 
fibers to the granule cells which, in turn, are a major source of afferent input via parallel fibers to Purldnjc cells. 
fPaUyandChan-Palay.Tl,ecerebellarcortex:cytologyandorgamzation. New York: Springer (19741. High level 
25 expression of neurotrimin these neurons again suggests potemial involvement in the estabUshmem of these circotts. 

NeuHXriminalsoexhibi«tagnKWexprtssionpa..ernintheearly^^ Increased neurotrimm 

expression is ft«nd t^^rlying the dorsolaieral sniatum of the rat. while lesser hybridization intensity is seen overling 
the venaomedial striatum. Strttyk e. al.. supra. TOs region of higher neurotrimin hybridization intensity does not 
correspond to a cytoarchitecmrally differemiable region. raO^er it corresponds to the primary area of affcrem mpm 
30 from layer VI of the contralateral sensorimotor cortex (Gerfen. Nature 311: 461464 (1984); Donoghue and 
Herkenham. Brain Res. 365: 397403 (1986)). m ventromedial striatum, by comrast. receives the majonty of ,ts 
afferent input from 0^ perirtunal and associadon cortex. It is noteworthy that a complemenuty gnuled panem of 
LAMP expression, has been observed wiOtin dte striatium. with highest expression in venuomedial regtons. and 
lowestexpressiondorsolaterally. Uvitt. Science 223: 299-301 (,985): Chesseletetal..Neu««cience 40: 725-733 

35 (1991). 



^^transmembrane proteins, also known as single pass transmembrane proteins have an N-terminal 
portion lodged in the cytoplasm while the C-terminal portion is exposed to extraceUtUar domain. 
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Endothelii, is a family of vasocons.nc.or pepude, abouc which much Sy has been focused bener 
undersund .« basic phannacological. biochemical ar.d molecuh. biological feamres. i™iud.ng .he presence ^ 
s.™c.re Of .sopep^des and *eir genes fe..o.e,i„-,. -2 ^ 03,, regulacion of gene expression. i„.race,l.ar 

TT "'"""^ """"'"^ '"^^^ ^■«>- ^-cellular sig^ 

transduction foUowmg receptor activation, etc . 

The endo.helin (Fr) family of peptides have potent vascular, cardiac and renal actions which may be of 
pathophys.o.og.ca, importance m many human disease states, ET-l is expressed as an inactive 212 amil acid 

Lys91 and Arg92 are tnmmed from d,e protem to generate the propeptide big ET-l 

cataW. dT?"" ' ^ -m 

cau.y«d by the ztnc meulloprotease. endothelin converting enzyme (ECE). ECE was recently Coned and it, 

structure was shown" to be a single pass transmembrane protein with a short intracellular N-terminal and a long 
ex.race lu.ar Cermina. d«. conuins the catalytic domain and numerous N-g.ycosylation sites, ECEs cleave d, 
et^othehn propeptide between Tn.73 and Va.74 producing the active peptide. ET. which appears to fi^nction as a 
0^ rather than a ctrculating hormone fRubanyi. CM. * Pololcoff. M.A.. Pharmachologica. Reviews 46. 325^15 
(1994). Thus ECE activity is a potendal site of regulation of endothelin production and a possible target for 
dterapeunc mtervendon in ^ endothelin syst«„. By hlocldng ECE activity, i. is possible s.op d.e production of ET-l 
by tnhibitmg d,e conversion of the relatively inactive precursor, big ET-l. to dte physiologically active fonn 

Endo*elins may phy roles in the pathophysiology of a number of disease su.es including: I, cardiovascular 
diseases vasospasm, hypenension. myocardial ischemia: reperfUsion injury and acute myochardial infarction. s„ote 
(cerebri .schemia). congestive hean failure, shock, atherosclerosis, vascular duclcening); 2) kidney disease (acute 
^ chron^ renal feyure. glomen^onephritis. c^osis); 3) lung disease (bronchial asthma, pulmonary hypertension, 
)gas«u«esanald.sorders(gastric^r.^.ory bowel diseases): 5)^^^^^^^ 
dysmenorhea, preeclampsia) and 6) carcinogenesis. Rubanyi & Polokoff. supra. 

SUMMARY 9F THF I^PTM^^a^^ 

1. PR02I 3 

Applicants have identified a cDNA clone .ha. encodes a novel polypeptide, wherein tite polypeptide is 
designated m the presem application as •PR02 13", 

in one embodiment, dte invention provides an isolated nucleic acid molecule comprising DNA encoding a 
PR0213 polypepude. In one aspect, dte isolated nucleic acid comprises DNA encodmg .he PR02I3 polypeptide 
tav^add resi^^s I . 295 of Pi^re 2 (SEQ ID N0:2). or is complem^^^^ 
sequence and remamssublybou«.«.i,mKler a. least moderate..^ 

'n anoUKrembodimem. die invemion provides isolated PR02I3pol^^^^^^ In parncular. the invention 
P^vides tsolned^tive sequence PR02I3 polypeptide, which in one embodiment, includes an ammo acid sequence 
con^mg residues 1 to 295 of Figure 2 (SEQ ID N0:2). 
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2. PR0274 

Applicants have identified a cDNA clone that encodes a novel polypeptide, wherein the polypeptide is 
designated in the present application as " PR0274" 

In one embodiment, die mvention provides an isolated nucleic acid molecule comprising DNA encoding a 
PR0274 polypeptide. In one aspect, the isolated nucleic acid comprises DNA encoding the PR0274 polypeptide 
5 having amino acid residues I to 492 of Figure 4 (SEQ ID N0:7). or is complementary to such encodmg nucleic acid 
sequence, and remains stably bound to it under at least moderate, and optionally, under high stringency conditions. 
TTk isolated nucleic acid sequence may comprise die cDNA insert of die DNA39987.1 184 vector deposited on April 
21. 1998 as ATCC 209786 which includes the nucleotide sequence encoding PR0274. 

In anodier embodiment, the mvention provides isolated PR0274 polypeptide. In particular, die invention 
10 provides isolated nadve sequence PR0274 polypeptide, which in one embodiment, includes an amino acid sequence 
comprising residues 1 to 492 of Figure 4 (SEQ ID N0:7). An additional embodiment of the present invention is 
directed to an isolated extracellular domain of a PR0274 polypeptide. Optionally, die PR0274 polypeptide is 
obtained or is obtainable by expressing die polypeptide encoded by die cDNA insert of the DNA39987.1 184 vector 
deposited on April 21. 1998 as ATCC 209786. 
15 In another embodmiem. the mvention provides diree expressed sequence ugs (EST) comprising the 

nucleotide sequences of SEQ ID N0:8 (herein designated as DNA17873). SEQ ID N0:9 (herein designated as 
DNA36157) and SEQ ID NO: 10 (herein designated as DNA28929) (see Figure 5-7. respactively). 

3. £Bi22fifi 

20 Applicants have identified a cDNA clone diat encodes a novel polypeptide, wherein die polypeptide is 

designated in die present application as "PRO300". 

In one embodiment, the invemion provides an isolated nucleic acid molecule comprising DNA encoding a 
PRO300 polypeptide. In one aspect, die isolated nucleic acid comprises DNA encoding the PRO300 polypeptide 
having amino acid residues 1 to 457 of Figure 9 (SEQ ID N0:19). or is complementary to such encoding nucleic acid 
25 sequence, and remains stably bound lo it under at least moderate, and optionally, under high stringency conditions, 
•me isolated nucleic acid sequence may comprise die cDNA insert of die DNA40625-1 189 vector deposited on April 
21. 1998 as ATCC 209788 which includes die nucleotide sequence encoding PRO300. 

In anodicr embodiment, die invention provides isolated PRO300 polypepude. In particular, die invcnuon 
provides isotated native sequence PRO300 polypeptide, which in one embodiment, includes an amino acid sequence 
30 comprising residues 1 to 457 of Figure 9 (SEQ ID N0:19). An additional embodimem of die presem invention is 
directed to an isolated extracellular domain of a PRO300 polypepude. Optionally, die PRO300 polypeptide is 
obtained or is obtainable by expressing die polypeptide encoded by die cDNA insert of die DNA40625-1 189 vector 
deposited on April 21. 1998 as ATCC 209788. 

35 4. 

Applicants have idemified a cDNA done diat encodes a novel transmembrane polypeptide, wherein die 
polypeptide is designated in die presem application as "PR0284". 

In one en*odimem. die invention provides an isolated nucleic acid molecule comprising DNA encoding a 
PR0284 polypeptide. In one aspect, die isolated raicleic acid comprises DNA encoding the PR0284 polypeptide 
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havu,ganu„o«.d«sidues . «, 285 of Figure .1 (SEQ ID NO:28). or is con,p.c2Ly .o such encod.ng „uc,e.c 
aod sequence, and ren.u.s subiy bo^d .0 ic under a. ,eas. n,odera,e. and opuonaliy. under h.h stringency 
condu,ons. In other aspecu, .he isolated nucleic acid comprues DNA encoding the PR0284 polypeptide having 
ajntno acd residues about 25 to 285 of Ftgure . , (SEQ ID NO:28) or , or about 25 to X of Ftgure 1 1 (SEQ ID 
NO:28). Where X is any amino acid fro. 71 to 80 of Figure M {SEQ ID NO:28). or ts complementarv to such 
encoding nucie.c acid sequence, and renins stably bound to it under a. leas, tnoderate, and opuonaliy. under high 
strtngency condiuons. TT,e isolated nucleic acid sequence may comprise *e cDNA insert of the DNA233.8-.2. 1 
vector deposited on April 21. ,998 as ATCC 209787 which includes the nucleotide sequence encodmg PR0284 

In another embodiment, the invention provides isolated PR0284 polypeptide. In particular, the invenUon 
provtdes isolated «.ave sequence PR0284 polypeptide, which in one embodiment, includes an amino ac.d sequence 
•0 compnstng residues . to 285 of Figure 1 , (SEQ ID NO:28). .Additional embodiments of the present inventton are 
ducted to isolated PR0284 polypeptides comprising amino acids about 25 to 285 of Figure 1 1 (SEQ ID NO-28) or 
1 or about 25 to X of Figure I, (SEQ ID NO:28). where X is any amino acid from 11 .0 80 of Figure U (SEQ ID 
NO:28)^ OpdonaUy the PR0284 polypeptide is obtained or is obtainable by expressing the polypept.de encoded by 
ti.ecDNA.nsenoftheDNA233I8-121I vectordepositedon April 21. 1998 as ATCC 209787 

uNAii982 which comprises che nucleotide sequence of SEQ ID NO:29. 
UINAI5886 which comprises the nucleotide sequence of SEQ ID NO:30. 
20 5. PR0296 

Applicants have identified a cDNA Cone dut encodes a novel polypeptide having homology to the sarcoma- 
amphfKd protem SAS. wherein the polypeptide is designated m me presem application as -PR0296 ■ 

oon,n. ""^'""^ =°™P^'^i"8 I^NA encoding a 

PR0296 polypeptide. In one aspect, the isolated ^^leic acid comprises DNA encoding the PR0296 polypeptide 
havmg amino acid residues I to 204 of Figu« 15 (SEQ ID NO:36), or is complememary .0 such encoding ™,cleic 
acKl sequerKe. and renuins subly bound u, it u«.er a. least moderate, and optionally, under high stringency 
condtttons. In other aspects, the isolated nucleic acid comprises DNA encoding the PR0296 polypeptide having 
anuno acid residues about 35 .0 204 of Figure 15 (SEQ ID NO:36) or amino acid 1 or about 35 ,0 X of Figure 15 
(SEQ ID NO:36). where X is any amino acid from 42 .0 5 1 of F.gure 15 (SEQ ID NO:36). or is complememary .0 
«.c encoding nucleic acid sequence, and remains stably bound to it under at least moderate, and op.io«tlly under 
htgh stnnjency conditions, m isolated nucleic acd sequence may comprise Ote cDNA insen of the DNA39979. 
1213 vector deposited on April 21. 1998 as ATCC 20^89 which includes the nucleotide sequence encoding PR0296 
In anoth« embodiment, tite invention provides isolated PR0296 polypeptide. In particular, the mvention 
Picvtdes isolated native sequence PR0296 polypeptide, which m one embodiment, includes an amino acid sequence 
comprising residues 1 .0 204 of Figure 15 (SEQ ID NO:36). Additional embodiments of ti,e present invention are 
directed .0 PR0296 polypeptides comprising amino acids about 35 to 204 of Figure 15 (SEQ ID NO:36) or amino 
««! I or about 35 .0 X of Figu« 15 (SEQ ID NO:36). where X is any ami«> acid frmn 42 .0 51 of Figure 15 (SEQ 
ID NO:36). Optionally, the PRQ296 polypeptide is obtained or is obtainable by expressing dK polypeptide encoded 
by die CDNA insen of the DNA39979.1213 vector deposued on April 21. 1998 as ATCC 209789. 
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In another embodiment, the invention provides an expressed sequence tag iEST) designated herein as 
DNA23020 comprising the nucleotide sequence of SEQ ID NO-.37. 

In another embodiment, the invention provides an expressed sequence tag (EST) designated herein as 
DNA21971 comprising the nucleotide sequence of SEQ ID NO:38 

In another embodiment, the invention provides an expressed sequence tag (EST) designated herein as 
DNA29037 comprising the nucleotide sequence of SEQ ID NO:39. 

6. £B£2222 

AppUcants have identified a cDNA clone that encodes a novel polypeptide having homology to a high affinity 
immunoglobulin F. receptor, wherein the polypeptide is designated in the present application as ••PR0329-. 

In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a 
PR0329 polypeptide. In one aspect, the isolated nucleic acid comprises DNA encoding the PR0329 polypeptide 
having amino acid residues 1 to 359 of Figure 20 (SEQ ID NO:45). or is complememary to such encoding nucleic 
acid sequence, and remains su.bly bound to it under at least moderate, and opuonally. under high stringency 
conditions. The isolated nucletc acid sequence may comprise the cDNA insert of the DNA40594-1233 vector 
deposited on Febniaty 5. 1998 as ATCC 209617 which includes the nucleotide sequence encoding PR0329. 

in another embodiment, the invention provides isolated PR0329 polypeptide. In particular, the invention 
provides isolated nadve sequence PR0329 polypeptide, which in one embodiment, includes an amino acid sequence 
comprising residues I to 359 of Figure 20 (SEQ ID NO:45). Optionally, the PR0329 polypeptide is obtained or is 
obtainable by expressing the polypeptide encoded by the cDNA insert of the DNA40594-1233 vector deposited on 
20 February 5, 1998 as ATCC 209617. 

7. SBQ2SI 

AppUcants have identified a cDNA clone that encodes a novel polypeptide having homology to A33 anugen 
a«l HCAR membrane-bound protein, wherein the polypeptide is designated in the present application as "PR0362". 

in one embodiment, the invenuon provides an isolated nucleic acid molecule comprising DNA encoding a 
PR0362 polypeptide. In one aspect, the isolated nucleic acid comprises DNA encoding the PR0362 polypeptide 
having amino acid residues 1 to 321 of Figure 22 (SEQ ID NO:52). or ts complementary to such encoding nucletc 
acid sequence, and remains stably bound to I. under at least moderate, and optionally, under high stnngency 
conditions. In another aspect, the isolated nucleic acid comprises DNA encoding the PR0362 polypeptide havmg 
amino acid residues 1 to X of Figure 22 (SEQ ID NO:52) where X is any amino acid from amino acid 27 1 to 280. 
or is complemeniaiy to such encoding nucleic acid sequence, and remains stably bound to it under at least moderate. 
^ opdonally. under high stringency conditions. TTie isolated nucleic acid sequence may comprise the cDN A msert 
of the DNA45416-1251 vector deposited on February 5, 1998 as ATCC 209620 which Includes the nucleotide 

sequence encoding PR0362. 

In another embodiment, the invenuon provides isolated PR0362 polypeptide. In particular, the invention 
provides isolated native sequence PR0362 polypeptide, which in one embodimem. mcludes an amino acid sequence 
convrising widues 1 to 321 of Figure 22 (SEQ ID NO:52). An addioonal embodiment of the present invention is 
directed to an isolated extracellular domam of a PR0362 polypeptide compnsmg ammo acids I to X of the amino 
acid sequence shown in Figure 22 (SEQ ID NO:52). wherein X is any amino acid from amino acid 271 to 280. 
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OpdanaUy.a«PR0362po.ypepdde»obu««dorisobuu»b.c5ye^ 
uBcn of Ac DNA45416-125I vector deposited on Febnuuy 5. 1998 as ATCC 209620. 

Applicants have identified a cDNA clone that e™:odes a novel polypepdde having homology to the cell 
5 surface receptor protein HCAR. wherein the polypeptide is designated in the present appUcation as -PR0363- 

In one cbodi.ent. the invention provides an isolated nucleic acid nK,lecule con^risi^g DNA encodmg a 
PR0363 polypepude. .„ one aspect, the isolated nucleic acid comprises DNA encoding the FR0363 polypeptide 
havmg anuno acid res.dues 1 to 373 of Figure 24 (SEQ ID NO:59). or is contplententaty to such encoding mlleic 

co^huons. h another aspect, the isolated nucleic acid cotnprises DNA eroding a PR0363 extracellular domain 
po^de having aminoacid residues 1 toX of Figure 24 (SEQ ID NO:59) where X is any amino acid from amino 
aodZie toammoacid225. or is complementary tosuch encoding nucleic acid sequence, and remains stablv bou«. 
«. .t under at leas, moderate, and optionally, under high stringency conditions, m isolated nucleic acid siquence 
may comprise the cDNA insea of the DNA45419-,252 vector deposited on February 5. ,998 as ATCC 209616 

15 Which uiciudes the nucleotide sequence encoding PR0363. 

In another embodiment, the invention provides isolated PR0363 polypeptide. In particular, the .nvenUon 
provides isolated native se<,tence PR0363 polypeptide, which in one embodiment, includes an amino acid se<,ue«. 
composing residues 1 .o 373 of Figure 24 (SEQ ,D NO:59). An additional embodiment of the prescminvemion is 
d.rected to an isolated extracellular domain of a PR0363 polypepude. wherein that extracellular domain may 

20 compraseaminoacids 1 -oX of the sequence shown in figure 24 (SEQ ,D NO:59). whc« X U any «nmo acid J 

anuno acid 216 to 225. Optionally, the PR0363 polypeptide is obtained or is obtainable by expressit^ *e 

polypeptide encoded by the cDNA insert of the DNA45419 i^s^ a u r< u 

' "'^ u[NA*»D4i»^-1252 vector deposited on February 5. 1998 as ATCC 

209616. 



25 9. PR086y 

Applicams have identified a cDNA clone that encodes a «,vel polypeptide having homology to mmor 
«cros.s factor receptor, wherein the polypepdde is designated in the present application as -PR0868" 

in one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a 
PR0868 polypeptide. In one aspect, the isolated nucleic acid comprises DNA encoding ^ PRO868 polypepUdc 

30 havmg amino acid residues 1 - 655 of Figure 26 (SEQ ID NO:64). or is complementaty ,0 such encoding nucteic 
ac^ sequence, a^l remains stably bound to it under a. leas, moderate, and opuonally. u«ler high stringency 
condtnons. In another aspect, the isolated nucleic acid comprises DNA encoding the PR0868 polypeptide havi^r 
anuno acid residues 1 to X of Figure 26 (SEQ ,D NO:64). where X is any amino acid from amino acid 343 to 352 
Of *e sequence Shown in figure 26 (SEQ ID NO:64). or is complementary .0 such encoding nucleic acid sequence 

»5 and remains stably bound to it under at leas, moderate, and optionally, under high stringency condtdons. In yet' 
a»oU>er aspec., ^ iso,a.ed m«:leic acid comprises DNA encoding the PR0868 polypepdde having amino acd 
rcstdues X to 655 of Figure 26 (SEQ ID NO:64). where X is any amino acid from amino acid 37, to 380 of the 
sequence shown in Figure 26 (SEQ ,D NO:64,. or is compiemenutry to such encoding nuCetc acid sequence and 
remams stably bound .0 u«ter a, leas, moderate, and opnonaJly. under high stringency conditions. Tl.e .so'lated 
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™clc.c acid sequence nuy comprise *e cDNA insen of ^ DNA52594-1270 vector deposHed on March 17. 1998 
as ATCC 209679 whkh includes the nucleotide sequence encoding PR0868. 

to another embodimem. the invention provides isolated PR0868 polypepade. In particular, the invention 
provides isolated nanve sequence PR0868 polypeptide, which in one en.bodin«:nt. includes an amino acid sequence 
comprising residues 1 to 655 of Figure 26 (SEQ ID NO:64). In another aspect, the isolated PR0868 polypeptide 
5 comprises amino acid residues 1 to X of Figure 26 (SEQ ID NO:64). where X is any amino aod from amino acid 
343 to 352 of the sequence shown in Figure 26 (SEQ ID NO;64). In yet another aspect, the PR0868 polypeptide 
comprises amino acid residues X to 655 of Figure 26 (SEQ ID NO:64). where X is any ammo acid from amino acid 
371 u, 380 of the sequence shown in Figure 26 (SEQ ID NO:64). Optionally, the PR0868 polypeptide is obiamed 
or is obtainable by expressing the polypeptide encoded by the cDNA insert of the DNA52594-1270 vector deposited 
10 on March 17. 1998 as ATCC 209679. 

10. P«0382 

Applicants have identified a cDNA clone that encodes a novel polypeptide having homology to sermc 
proteases, wherein the polypeptide is designated in the present appUcation as "PR0382". 

to one embodimem. the invention provides an isolated nucleic acid molecule comprismg DNA encoding a 
PR0382 polypeptide. In one aspect, the isolated nucleic acid comprises DNA encoding the PR0382 polypeptide 
having amino acid residues 1 to 453 of Figure 28 (SEQ ID NO:69). or is complemenury to such encoding nucleic 
acid sequence, and remains subly bound to it under a. least moderate, and optionally, under high stnngency 
conditions. IT* isolau^d nucleic acid sequence may comprise the cDNA insert of the DNA45234.1277 vector 
deposited on March 5. 1998 as ATCC 209654 which includes the nucleoude sequence encodmg PR0382. 

to another embodimem. the invention provides isolated PR0382 polypeptide. In particular, the mvendon 
provides isolated nadve sequence PR0382 polypeptide, which in one embodiment, includes an amino acid sequence 
comprising residues 1 to 453 of Figure 28 (SEQ ID NO:69). An additional embodimem of the presem invcnuon ts 
di^cted to an isolated extraceUular domato of a PR0382 polypeptide, wim or without the signal peptide. Optionally, 
the PR0382 polypeptide is obtatoed or is obtamable by expressing the polypeptide em»ded by the cDNA msen of 
the DNA45234-1277 vector deposited on March 5. 1998 as ATCC 209654. 
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11. 

Applicams have idemified a cDNA clone that encodes a novel polypeptide having homology to meltrm. 
30 wherem the polypeptide is designated in the presem application as •PR0545 " 

to one embodimem. the invention provides an isolated nucleic acid molecule comprising DNA encodmg a 
PR0545 polypeptide, to one aspect, the isolated nucleic acid comprises DNA encodmg the PR0545 polypeptide 
having ammo acid residues 1 to 735 of Figure 30 (SEQ ID NO:74). or is complememary to such em^dmg micletc 
acid sequence, and remams stably bound to it under at leas, moderate, and optionally, under high stnngency 
35 conditions. The isolated nucleic acid sequence may comprise the cDNA msert of the vector deposited on March 5. 
1998 as ATCC 209655 which includes the nucleotide sequence encodmg PR0545. 

toanother embodimem. the invention provides isolated PR0545 polypeptide. In particular, the invemion 
provides isolated native sequence PR0545 polypeptide, which to one embodtoiem. mcludes an ammo a«d sequence 
compristog residues 1 to 735 of Figure 30 (SEQ ID NO:74) . An additional embodimem of ti« presem mvermon » 
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polypes. |.d.,i,„,^i„ 4. p„,„,^^„^^^.^^^^,. "»IW»CDa4. 

»^«-~.,.^„„^.^^„„^„^^^,,^ ^^^^^^ J^^ 
5. ISM as ATCC 209656 .tod, ,„tod= ^iMi* «,„to e.codln, PRO617 

P»«^»«1».n».„^™.PR06,,p„,,p,«*.«.»^^.,^^^^^ J^^ 

compnsingresidues I to67ofFiBure33fSEOinMn.B<v « ■ ,. ^m"c"cc 
„K„i„,K. K ^ °P""^"y. the PR0617 polypeptide is obtained or is 

13. PRO70P 

p™». ,»„^. ^ p,^^ „ ^^^^ ^ ^^^^ ^^^^^^^ ^ " 

ISW as ATCC 209721 wl«h ,™i„d.s mdad* so^, encodirj PROTOO 

h ™,„.ta.„, i„„uo, provides p„o,00 p,„p.p«.. partclT. i„™a.„ 

»»pn™« ■ „ 4,2 or Fi^ 35 <SEQ ,0 NO:,0, to p^l. „ 

Bolaied PROTOO polypeptide absent the signal sequence which inrh,rf« =n , . f es an 

ftnn.»K- .,0 ,r an ammo acid sequence comprising residues 

ftomahout34.o432ofFigu,e35(SEQIDNO:90). Optionally ihePRO700nniv„.n,iM • k . ■ 

. ^ . ^""^y- *^'*^^00 polypeptide IS obtamed or is obtainable 

oy expressing the polypeptide encoded bv the cDNa mc.« .u 

t~ ncooeo oy tDe cDNA insen of the vector deposited on March 31. 1998 as ATCC 
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14. PRO702 

Applicants have identified a cDNA clone that encodes a novel polypeptide having homology to conglutinm. 
wherein the polypeptide is designated in the present application as "PRO702". 

In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a 
PRO702 polypeptide. In one aspect, the isolated nucleic acid comprises DNA encoding the PRO702 polypeptide 
5 having amino acid residues 1 to 277 of Figure 37 (SEQ ID NO:97). or is complementary to such encoding nucleic 
acid sequence, and remains stably bound to it under at least moderate, and optionally, under high stringency 
conditions. In another aspect, the isolated nucleic acid comprises DNA encoding the PRO702 polypeptide having 
amino acid residues 26 to 277 of Figure 37 (SEQ ID NO:97). or is complementary to such encoding nucleic acid 
sequence, and remains stably bound to it under at least moderate, and optionally, under high stringency condinons. 
10 Tlie isolated nucleic acid sequence may comprise the cDNA insert of the DNA50980-1286 vector deposited on March 
31, 1998 as ATCC 209717 which includes the nucleotide sequence encoding PRO702. 

In another embodiment, the invention provides isolated PRO702 polypeptide. In particular, the invention 
provides isolated native sequence PRO702 polypeptide, which m one embodiment, includes an amino acid sequence 
comprising residues 1 to 277 of Figure 37 (SEQ ID NO:97). An additional embodiment of the present invemion is 
15 directed to an isolated PRO702 polypeptide comprising amino acid residues 26 to 277 of Figure 37 (SEQ ID NO:97). 
Optionally, the PR07(K polypeptide is obtained or is obtainable by expressing the polypeptide encoded by the cDNA 
insert of the DNA50980-1286 vector deposited on March 31, 1998 as ATCC 209717. 

15. PltO703 

20 Applicants have identified a cDNA clone thai encodes a novel polypeptide having sequence similarity to 

VLCAS. wherein the polypeptide is designated in the present application as "PRO703". 

In one embodiment, die invention provides an isolated nucleic acid molecule comprising DNA encoding a 
PRO703 polypeptide. In one aspect, the isolated nucleic acid comprises DNA encoding the PRO703 polypeptide 
having amino acid residues 1 to 730 of Figure 39 (SEQ ID NO:102), or is complementary to such encoding nucleic 
25 acid sequence, and remains stably bound to it under at least moderate, and optionally, under high stringency 
conditions. In another aspect, the isolated nucleic acid comprises DNA encoding the PRO703 polypeptide having 
amino acid residues from about 43 to 730 of Figure 39 (SEQ ID NO:102). or is complcmemaiy to such encoding 
nucleic acid sequence, and remains subly bound to it under at least moderate, and optionally, under high stringency 
conditions. The isolated nucleic acid sequence may comprise the cDNA insert of the DNA50913-1287 vector 
30 deposited on March 31. 1998 as ATCC 209716 which includes the Ducleotidc sequence encoding PRO703. 

In another embodiment, the invention provides isolated PRO703 polypeptide. In panicular. the invention 
provides isolated native sequence PRO703 polypeptide, which in one embodimem, includes an ammo acid sequence 
comprising residues 1 to 730 of Figure 39 (SEQ ID NO: 102). In another embodimem. tiic invention provides an 
isolated PRO703 polypeptide absent the signal sequence, which includes an amino acid sequence comprising residues 
35 from about 43 to 730 of Figure 30 (SEQ ID NO: 102). Optionally, the PRO730 polypeptide is obtained or is 
obuunable by expressing the polypeptide encoded by the cDNA insert of the DNA50913-1287 vector deposited on 
March 31, 1998 as ATCC 209716. 
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16. PRQTqg 

Applicanis have identified a cDNA clone thai encodes a novel polypeptide hav.ng homology ,o K-glyp.can. 
wherein the polypeptide is designated in the present application as "PROVOS" . 

In one embodiment, the mveniion provides an isolated nucleic acd molecule comprising DNA encoding a 
PRO705 polypeptide. In one aspect, the isolated nucleic add comprises DNA encodmg the PRO705 polypeptide 
havu« amino acid «sidues 1 to555of Figu«41 (SEQ ID NO: 109). or is compiemenun, to such encoding nucleic 
acd sequence, and remains subly bound to it under a. leas, moderate, and optionally, under high stringency 
conditions. In another aspect, the isolated nucleic acid comprises DNA encoding the PRO705 polypeptide having 
ammo acid residues about 24 ,o 555 of Figure 41 (SEQ ID NO: 109), or is complemeataty to such encoding mjcleic 
acd sequence, and remains stably bound to it under a, least moderate, and optionally, muter high stringency 
conditions. n,e isolated nucleic acid sequence may comprise the cDNA insert of the DNA50914-1289 vector 
deposited on March 31. 1998 as ATCC 209722 which includes nucleotide sequence encodmg PRO705. 

in another embodiment, the invention provides isolated PRO705 polypeptide. In particular, the invention 
Pi^vides isolated native sequence PRO705 polypeptide, which in one embodiment, includes an amino acid seque«:e 
comprising residues 1 to 555 of Figure 41 (SEQ ID NO:,09). An additional embodmien, of the presem invention 
IS directed to an isolated PRO705 polypeptide comprising amino acid residues about 24 to 555 of Figure 41 (SEQ 
ID NO:109). Optionally, the PRO705 polypeptide is obtained or is obtainable by expressing the polypeptide encoded 
by die CDNA insert of the DNA50914-1289 vector deposited on March 31 . 1998 as ATCC 209722. 

17. PR07(M^ 

20 Applicants have identified a cDNA clone that encodes a novel polypeptide having homology to the aryl 

sulfatases. wherem the polypeptide is designated in the present application as •pRO708". 

In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a 
PRO708 polypeptide. In one aspect. ti,e isolated nucleic acid comprises DNA encoding tfie PRO708 polypeptide 
havn« amino acid residues 1 to 515 of Figure 43 (SEQ ID NO: 1 .4), or is complememary to such encoding nucleic 
acd sequence. ^ remains stably bound to it under at least moderate, and optionally, under high stringency 
condiuons. Tlie isolated micleic acid sequence may comprise die cDNA insert of the DNA48296-129^ vector 
deposited on March 11. 1998 as ATCC 209668 which includes Uk nucleotide sequence encoding PRO708. 

In another embodiment, die invention provides isolated PRO708 polypeptide. In particular, the invention 
pwvides isolated lative sequence PRO708 polypeptide, which in one embodiment, includes an amino acid sequence 

30 comprising residues 1 to 515 of Figure 43 (SEQ ID N0:114). Another embodiment is directed to a PRO708 
polypepnde comprising residues 38-515 of the amino acid sequence shown in Figure 43 (SEQ ID N0114) 
Opoonally. ^ PRO708 polypeptide is obtained or is obtainable by expressing U« polypeptide encoded by U« cDNA 
insert of die DNA48296-1292 vector deposited on March 1 1 . 1998 as ATCC 209668. 

35 18. PR03M 

Applicants have identified a cDNA clone that encodes a novel polypepnde having homology to fibulin. 
wherem die polypeptide is designated in die presem application as "PRO320". 

In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a 
PRO320 polypeptide, to one aspect, die isolated micleic acid comprises DNA encoding die PRO320 polypeptide 
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having amino acid residues 1 to 338 of Figure 45 (SEQ ID N0:1 19). or is complemenuty .0 such etKoding nuclcc 
acid sequence, and remains sably bound to i. under at least moderate, and opUonaUy. under high strmgcncy 
conditions. Tl.e rotated nucleic acid sequence may comprise the cDNA iiBcrt of the vector deposited on March 11. 
1998 as ATCC 209670 which includes the tmcleotide sequence encoding PRO320. 

In another embodimcm. the invention provides isolated PRO320 polypepude. to particular, the mvemion 
5 provides isoUued nadve sequence PRO320 polypepdde. which in one embodiment, includes an amino acid sequence 
comprising residues 1 lo 338 of Figure 45 (SEQ ID N0:1 19). Optionally, the PRO320 polypeptide is obuined or 
is obuunable by expressing .he polypeptide encoded by the cDNA tnsert of the vector deposited on March 11 . 1998 
as ATCC 20%70. 

10 19. EB£^ 

Applicants have identified a cDNA clone that encodes a novel polypeptide having homology lo 
oxidoreductases. wherein the polypeptide is designated in the present application as "PR0324-. 

to one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a 
PR0324 polypeptide. In one aspect, the isolated nucleic acid comprises DNA encoding the PR0324 polypeptide 
15 havingaminoacidresidues I » 289 of Figure 47 (SEQ ID NO: 124). or is complementary to such encoding nucleic 
acid sequence, and remains stably bound to U under at least moderau^. and optionally, under high stnngency 
conditions. In other aspects, the isolated mKleic acid comprises DNA encoding the PR0324 polypept,de having 
amino acid residues 1 or about 32 to X of Figure 47 (SEQ ID NO: 124). where X is any amino acid from 131 to 140. 
or is complemenmry to such encoding nucleic acid sequence, and remains stably bound to it under a. least moderate. 
20 and optionally, under high stringency conditions, m isolated nucleic acid sequence may compnse the cDNA mser. 
of the DNA36343.1310 vector deposited on March 30. 1998 as ATCC 209718 which includes the nucleotide sequence 
encoding PR0324. 

,n another embodiment, the invention provides isolated PR0324 polypeptide. In particular, the tnvenuon 
provides isolated native sequence PR0324 polypeptide, which in one embodiment, inchules an amino acid sequence 
25 comprising residues 1 to 289 of Figure 47 (SEQ ID NO:124). The invemion also provides isoUted PR0324 
polypeptide comprising residues 1 or about 32 to X of Figure 47 (SEQ ID NO:124), wherein X is any amino acid 
ftomabout 131-140. OptionaUy. ti« PR0324 polypeptide is obtained or is obtainable by expressing the polypepode 
encoded by the cDNA insert of the DNA36343-1310 vector deposited on March 30. 1998 as ATCC 209718. 

30 20. tBsasi . . 

Applicants have identified a cDNA clone that encodes a novel polypeptide having sequence smulaniy to 
prostasin. wherein the polypeptide is designated in the present application as •PR035r. 

in one embodiment. ti« invention provides an isolated nucleic acid molecule comprising DNA encodmg a 
PR035I polypeptide. In one aspect, the isolated nucleic acid comprises DNA encoding the PR0351 polypeptide 
35 having amino acid residues 1 to 571 of Hgure 49 (SEQ ID NO:132). or is complementary to such encoding nucleic 
acid sequence, and remains stably bound to it under a. least moderate, and optionally, under high strmgency 
conditions. In another aspect, the isolated nucleic acid comprises DNA encoding the PR0351 polypeptide havmg 
amino add residues about 16 to 571 of Figure 49 (SEQ ID NO:132). or is complememary to such encodmg nucle.c 
add sequence, and remains stably bound to h under a. least moderate, and optionally, under high strmgency 
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conditions. The isolated nucieic acid sequence may comprise U>e cDNA imen of the DNA4057M315 vector 
deposited on April 21 . 1998 as ATCC 209784 which includes the nucleotide sequence encoding PR035 1 . 

In another embodimem. the invention provides isolated PR0351 polypept.de. In parucular, the invention 
provides isolated nadve sequence PR0351 polypeptide, which in one embodiment, .ndudes an amino acid sequence 
comprising residues 1 to 57, of Figure 49 (SEQ ID NO: 132). In another embodimem. the invention provides an 
.solated PR0351 polypepdde absem the sig«U sequence, which includes an amino acd sequence comprising residues 
from about 16 to 571 of Figure 49 (SEQ ID NO:132). Optionally, the PR0351 polypeptide is obtained or is 
obtainable by expressing die polypeptide encoded by the cDNA insen of the DNA4057I-I315 vec«,r deposited on 
April 21. 1998 as ATCC 209784. 

21. EEQ3S2 

Applicants have identified a cDNA clone that encodes a novel polypeptide having homology to butyrophilin. 
wheretn die polypeptide is designated in the presem application as •PR0352". 

In one embodimem. the invemion provides an isolated nucleic acid molecule comprising DNA encoding a 
PR0352 polypeptide. In one aspect. d>e isolated raicleic acid compnses DNA encoding the PR0352 polypeptide 
hav„,g amino acid residues 1 to 316 of Figure 5 1 (SEQ ID N0:137). or is complementary to such encoding mtcleic 
acd sequence, and remains subly bound to .t u«|er at leas, moderate, and optionally, under high stringency 
condutons. In other aspects, the isolated nucleic acid compr«es DNA encoding the PR0352 polypeptide having 
ammo acid residues of about 29 to 316 of Figure 51 (SEQ ID NO:I37). or 1 or about 29 ,o X of Figure 51 where 
X >s any amino acid from 246 to 255. or is complementary to such em:oding m^leie acid sequence. ^ renuUn, 
stably bound to i, umler at least moderate, and optionally, under high strir^ency conditions. TTie isola«d ««:Ieic acid 
sequence may comprise the cDNA insert of the DNA41386-1316 vector deposited on March 26. 1998 as ATCC 
209703 which includes the nucleotide sequence encoding PR0352. 

In anod«r embodiment, die invemion provides isolated PR0352 polypeptide. In particular, die invention 
provides isolated native sequence PR0352 polypeptide, which in one embodimem. includes an amim, acid sequence 
comprising residues 1 to 316of Figure 51 (SEQ ID NO:137). In other embodiments, the invention provides isolated 
PR0352 polypepndc comprising residues about 29 to 316 of Figure 51 (SEQ ID NO:137) and 1 or about 29 to X of 
Figure 51 (SEQ ID NO:137). wherein X is any amim, acid from 246 to 255. Optionally, the PR0352 polypeptide 
a obtami or is obtainable by expressing d« polypeptide emroded by d« cDNA insert of the DNA41386-1316 vecuir 
deposited on March 26. 1998 as ATCC 209703. 

22. £KQ281 

Applicams have idemified a cDNA clone dia. encodes a novel polypeptide having homology «, inununophilin 
protems. wherein the polypeptide is desigmiied in die presem application as -PR0381" 

In one embodimem. the invemion provides an isolated nucleic acid molecule comprising DNA encoding a 
PR0381 polypeptide, to one aspect, the isolated nucleic acid comprises DNA encoding die PR0381 polypeptide 
having amim, acid residues 1 to 21 , of Figure 53 (SEQ ID NO: 145). or .s complementary to such encoding nucleic 
acKl sequence, ami remains stably bound to i, umler a. leas, moderate, ami optionally, under high stringency 
combuons. In another aspect. d« isolated nucleic acid comprises DNA emroding die PR0381 polypeptide having 
anum, acid residues abou. 21 to 21 1 of Figure 53 (SEQ ID NO:145). or is complementary to such encoding mieleic 
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acid sequence, and remains stably bound lo it under at least moderate, and opdonally. under high stringency 
conditions, "me .solated nucleic acid sequence nuty con.pr.se the cDNA insert of .he DNA44194-1317 vector 
deposited on April 28. 1998 as ATCC 209808 which includes the nucleot.de sequence encoding PR0381 . 

In another embodiment, the invention provides isolated PR038I polypeptide. In panicular. the invention 
provides isolated nauve sequence PR0381 polypeptide, which in one embodiment, mcludes an amino acid sequence 
5 comprising residues 1 to 211 of Figure 53 (SEQ ID NO:145). Another embodiment is directed to a PR0381 
plypeptide comprising amino acids about 21 to 211 of Figure 53 (SEQ ID NO:145). Optionally. a.e PR0381 
polypeptide is obtained or is obtainable by expressing the polypeptide encoded by the cDNA insert of the DNA44194- 
1317 veaor deposited on April 28. 1998 as ATCC 209808. 

10 23. mm 

Applicants have iden.if.ed a cDNA clone that encodes a novel polypeptide having homology to the beu.2 
subunit of a sodium channel, wherein the polypeptide is designated in the present application as •PR0386- . 

In one embodiment, the invention provides an isoteted nucleic acid molecule comprising DNA cncodmg a 
PR0386 polypeptide. In one aspect, the isolated nucleic acid comprises DNA encoding the PR0386 polypeptide 
15 havi.v5 3n>i«> acid residues I ,o 215 of Figure 55 (SEQ ID NO: 150). or is complementary to such encoding nocle.c 
acid sequence, and remains stably bound to it under at least moderate, and optionally, under high stringency 
conditions In another aspect, the isolated mKleic acid comprises DNA encoding the PR0386 polypeptide having 
amino acid residues about 21 to 215 of Figure 55 (SEQ ID NO: 150) or 1 or about 21 to X. where X is any ammo 
acid from 156 to 165 of Figure 55 (SEQ ID NO:150). or is complementary to such encoding nucleic acid sequence. 
20 and remains stably bouml to it under a. least moderate, and opdonally, under high stringency conditions. THe isolated 
OKleic acid sequence may comprise the cDNA insert of the DNA4541M318 vector deposited on April 28. 1998 as 
ATCC 209810 which includes the nucleotide sequence encoding PR0386. 

in another embodiment, the invention provides isolated PR0386 polypeptide. In particular, the invention 
provides isolated native sequence PR0386 polypeptide, which in one embodiment, includes an amino acid sequence 
25 comprising residues I to 215 of Figure 55 (SEQ ID NO:150). Other embodiments of the present invention are 
directed to PR0386 polypeptides comprising amino acids about 21 to 215 of Figure 55 (SEQ ID NO:150) and 1 or 
about 21 to X of Figure 55 (SEQ ID NO:150). wherein X is any amino acid from 156 to 165 of Figure 55 (SEQ ID 
NO:150). Optionally, the PR0386 polypeptide is obtained or is obtamable by expressing the polypeptide encoded 
by the cDNA insert of the DNA454 15-1318 vector deposited on April 28. 1998 as ATCC 209810. 
30 In anodier embodiment, .he invention provides an expressed sequence tag (EST) comprising the nucleotide 

sequence of SEQ ID NO: 151 which corrsponds to an EST designated herein as DNA23350. 

In another embodiment, the invention provides an expressed sequence tag (EST) comprising the mideotide 
sequence of SEQ ID NO: 152 which corrsponds to an EST designated herein as DNA23536. 

35 24. ERQ542 

Applicants have .denufted a cDNA clone that encodes a novel polypeptide having sequence smularity to 
LCAT wherein the polypeptide is designated in the present application as "PRO540" . 

' In one embodm«m. the invention provides an isolated nucleic ac.d molecule comprising DNA encoding a 
PRO540 polypeptide. In one aspect, die isolated m^leic acid comprises DNA encoding the PRO540 polypepude 

61 



BNSOOCIO- <W0 994628 1A2.L> 



wo 99/46281 



10 



PCT/US99/0S028 

having an^no acid residues 1^5,2 of Figure 59 (SEQ ID NO:157). or is compllary .o such encoding „ucl«c 
acd sequence, and ren^ subiy bound .o u under a. leas. n»dera.e. and opUonaily. under high stridency 
condmons. In another aspec, ^ isolated nucleic acid con^rises DNA encoding *e PRO540 polypepUde having 
an^o acid residues about 29 to 412 of Figure 59 (SEQ ID NO: ,57). or is con.p,en,enury to such encoding nucleic 
acd sequence, and regains stably bound .o it under at leas, moderate, and optionally, under high stringency 
condtttons. The isolated nucleic acid sequence may comprise the cDNA insen of the DNA44I89.1322 vector 
deposited on March 26. 1998 as ATCC 209699 which mcludes the nucleotide sequence encodutg PRO540 

In anod«r embodiment, the invention provides isolated PRO540 polypeptide. In pamcular. U« invention 
P««..des .solated nadve sequence PRO540 polypeptide. wh.ch in one embodiment, mcludes an amtno acid sequence 
comprtstng residues , ,o 412 of Figure 59 (SEQ ID NO:157). TT,e invention also provides isolated PRO540 
polypeptide, which in one embodiment, i^ludes an amtno acid sequence comprismg residues about 29 to 412 of 
Ftgure 59 (SEQ ID NO:I57). OptionaUy. dte PRO540 polypeptide is obtained or is obta.nable by expressing the 
polypepude encoded by the cDNA insert of the DKA44189-1322 vector deposited on March 26. 1998 as ATCC 
209699. "«rkiv-v. 
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15 2S. PRQ61? 

Applicants have identified a cDNA clone that encodes a novel polypeptide having sequence similarity to 
synaptogyrm. wherein the polypeptide is designated in the present application as -PR0615". 

In one embodiment, the invention provides an isolated nucleic acid molecule comprisi,« DNA encoding a 
PR0615 polypeptide. In one aspect, the isolated nucleic acid comprises DNA encoding the PR0615 polypeptide 

20 hav^ ammo acid ^idues I to 224 of Figure 61 (SEQ ID NO:162), or is complementary to such encoding nucleic 
actd sequence. a«l remains stably bourn, to i, under at least moderate, and optionally, under high stringency 
condtuons. In a„od,er aspect, the isolated nucleic acid comprtses DNA encoding .he PR06.5 polypepdde having 
anuno acid residues X to 224 of Figure 6, (SEQ ID NO:,62,. where X is any amino acid from 157 to ,66 or is 
complementary to encoding ™tcleic acid sequence, and remains stably bou«, to it under a. least modera.^. and 

25 opnonally. under high stringency conditions. Tl« isolated nucleic acid sequence may comprise the cDNA insen of 
the DNA48304-I323 vector deposited on AprU 28. 1998 a. ATCC 20981 , which mcludes the nucleotide sequence 
encoding PR0615. ^ 

in another embodiment, the invention provides isolated PR06,5 polypeptide. In particular, tite invention 
provides isolated native sequence PR0615 polypeptide, which in one embodiment, include, an amino acid sequence 
comprising residues 1 to 224 of Figure 61 (SEQ ID NO: 162). An additional embodiment of the present u^ntion 
» dtrec^d to an isolated extraceUular domain of a PR0615 polypeptide which comprises amino acid residues X «, 
^4 of Ftgure 61 (SEQ ID NO:162). where X is any amino acid from 157 to 166 of Figure 6, (SEQ ID NO-,62) 
Qptio^tity. the PR06I5 polypeptide is obtained or is obtainable by expressing the polypeptide encoded by *e cDNA 
insen of tiie DNA48304-1323 vector deposited on April 28, 1998 as ATCC 20981 1. 

26. PR06H 

Applicants have identified a cDNA Cone U«t e«:odes a novel polyp^tide having sequence simitarity to 
enteropepudase. wherein titt poiypeptide is designated in ti« presem application as "PROeiS". 

In one embodiment, the invention provides an tsotated nucleic acid molecule comprising DNA encoding a 
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PR0618 polypeptide. In one aspect, the isotated Mclcic acid comprtscs DNA encoding the PR0618 polj-pcptide 
having ammo acid residues I to 802 of Figure 63 (SEQ ID NO:169). or .s complementary to such encoduig t«Klc.c 
acid sequence, and remains stably bound to ii under at least moderate, and optionally, under high stringency 
conditions. In another aspect, the isolated nucleic acid comprises DNA encoding an isolated extracellular domam 
of a PR06I8 polypeptide havuig amino acid residues X to 802 of Figure 63 (SEQ ID NO- 169). where X is any ammo 
5 acid from 63 10 72 of Figure 63 {SEQ ID NO:169). or is complementary to such encoding nucleic acid sequence, and 
remains stably bound to it under at least moderate, and optionally, under high strmgency conditions. TTie isolated 
mKkic acid sequence may comprise the cDNA insert of the DNA49152-1324 vector deposited on April 28. 1998 as 
ATCC 209813 wluch includes the nucleotide sequence encoding PR0618. 

In another embodiment. *e invention provides isolated PR0618 polypeptide. In particular, the invention 
10 provides isolated native sequence PR0618 polypeptide, which in one embodiment, includes an amino acid sequence 
comprising residues I to 802 of Figure 63 (SEQ ID NO: 169). An additional embodimem of the present invention 
is directed to an isolated extracellular domain of a PR0618 polypeptide comprising amino acid X to 802 where X 
is any amino acid from 63 to 72 of Figure 63 (SEQ ID N0:169). Optionally, the PR0618 polypeptide is obtained 
or is obuiinable by expressing the polypeptide encoded by the cDNA insert of the DNA4915M324 vector deposited 

15 on April 28. 1998 as ATCC 209813 

to another embodiment, the invenuon provides an expressed sequence tag (EST) comprising the nucleotide 
sequence of SEQ ID N0:170. designated herein as DNA35597 (sec Figure 64). 



20 



27. ESQ212 

Applicants have identified a cDNA clone that encodes a novel polypeptide having homology to lipoprotem 
lipase H wherein the polypeptide is designated in the present application as ■PR0719". 

In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a 
PR0719 polypeptide. In one aspect, the isolated nucleic acid comprises DNA encoding the PR0719 polypeptide 
havii* amino acid residues 1 to 354 of Figure 66 (SEQ ID NO. 178). or is complementary to such encoding nucleic 
25 acid sequence, and remains subly bound to it under at least moderate, and optionally, under high stringency 
conditions. In another aspect, the isolated nucleic acid comprises DNA encoding the PR0719 polypeptide havmg 
amino acid residues about 17 to 354 of Figure 66 (SEQ ID NO:178). or is complementary to such encoding nucleic 
acid sequence, and remains stably bound to it under at leas, moderate, and optionally, under high stringency 
conditions. THe isolated nucleic acid sequence may comprise the cDNA insen of the DNA49646-1327 vector 
30 deposited on March 26. 1998 as ATCC 209705 which includes the nucleotide sequence encoding PR0719. 

In anoUier embodiment, the invention provides isolated PR0719 polypeptide. In particular, the invention 
provides isolated native sequence PR07 19 polypeptide, which in one embodimem. inchides an amino acid sequence 
comprising residues l to 354 of Figure 66 (SEQ ID NO: 178). In anotiier embodimem. die imrenrion provides isolated 
PR0719 polypeptide which comprises residues abou, 17 to 354 of Figure 66 (SEQ ID NO:178). Optionally. Oie 
35 PR0719 polypeptide is obtained or is obtainable by expressing the polypeptide encoded by tiw cDNA insert of tiie 
DNA49646.1327 vector deposited on March 26, 1998 as ATCC 209705. 
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Applicant have identifed a cDNA clone *a. encodes a novel polypeptide having homology .o U,e LDL 
receptor, wherem the polypeptide is designated in the present application as "PR0724-. 

In one embodiment, the invendon provides an isolated nucleic acid molecule con,prismg DNA encoding a 
PR0724 polypepUde. In one aspect, the isolated nucleic acid comprises DNA encoding the PR0724 polypeptide 
hav«,g amino acid residues 1 to 713 of Figure 68 (SEQ ID N0183,. or U complementary to such encoding nucleic 
a.d sequence, and remains stably bound to it under at least moderate, and opnonaJly. u«.er high stringency 
condmons. In another aspect, the isolated nucleic acid compnses DNA encoding a soluble PR0724 polypeptide 
ha^ ^0 acid residues 1 to X of Figure 68 (SEQ ID NO:,83) where X is any am.no acid from amino acid 437 
.0 446. or is complementary to such encoding nucleic acid sequence, and remains stably bound to it under at leas, 
.noderate. and optio^Uly. under high stringency conditions. T.e above two polypeptides may either possess or not 
possess Ae signal peptide. TT* isolated nucleic acid sequence may comprise the cDNA insert of the DNA4963I 1328 
vector deposited on April 28. ,998 as ATCC 209806 which includes the nucleotide sequence encoding PR0724 

In another embodiment, .he invention provides isolated PR0724 polypeptide. In panicular. the invention 
pnwules isolated native sequence PR0724 polypeptide, which in one embodiment, includes an amino acid sequence 

"Tppntr'"'"'''*'"''^''^"''^'''^- ntionprovidesisolated 
soluble PR0724 polypeptide. In panicular. ti.e invention provides isolated soluble PR0724 polypeptide which in 

one embodm«nt. includes an amino acid sequence comprising residues 1 to X of Figure 68 (SEQ ID NO-,83) where 
X.sanyaminoacidfrom437to446ofthesequenceshowninFigu,e68(SEQ,DNO:183). OptionaUy .bePR0724 
pobrepode is Obtained or is obtainable by expressing the polypeptide encoded by d« cDNA inscn of the DNA4963,- 
20 1328 vector deposited on April 28. 1998 as ATCC 209806. 

29. PR0772 

APP-^J^ve identified a CDNA clone dutt encodes a novel polypeptide havi^^ 
wherem the polypeptide is desigmtted in the presem application as *PR0772". 

25 to one embodiment. ti.e invention p«vides an isolated m^leic acid molecule comprising DNA encoding a 

PR0772 polypeptide. In one aspect, the isolated nucteic acid comprises DNA encoding Ok PR0772 polypeptide 
havmg amino acid residues 1 to 152 of Figure 70 (SEQ ID NO:,90). or is complementary to s«:h encoding nucleic 
acd sequence, and remains subly bound to it under at leas, moderate, and optionally, under high strmgency 
condutons. In other aspects. U,e isolated nucleic acid comprises DNA encoding ti« PR0772 polypeptide having 

(SEQ ID NO: 190). or « complemematy to such encoding nucleic acid sequence, and remains subly bound to U 
a. leas, moderate, and optionally, under high stringency conditions. Tl.e isolated nucleic acid sequence may comprise 
*e CDNA insert of tite DNA49645-1347 vector deposited on April 28. 1998 as ATCC 209809 which include, U.e 
nucleotide 5cquencc encoding PR0772. 

35 In another embodiment, tiie invention provides isolated PR0772 polypeptide. In particular. ti,e invemion 

pnnodes isolated native sequence PR0772 polypeptide, which in one embodimem. i«=ludes an amino acid sequence 
conipHsi^g residues 1 to 152 of Figure 70 (SEQ ID NO:190,. Additional embodiments of ti,e presem mvention 
directed to PR0772 polypeptides comprising amino acids 1 to X of Rgure 70 (SEQ ID NO:190) where X is 
anunoacidfrom2. to 30 of Figure 70 (SEQ ,D NO:, 90). Optio^tily. the PR0772 polypeptide is obt««d 
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obuunablc by expressing the polypeptide encoded by .he cDNA insert of the DNA49645-1347 vector deposited on 

April 28. 1998 as ATCC 209809. 

in another embodiment, the invention provides an expressed sequence tag (EST> designated herein as 

DNA43509 comprising the nucleotide sequence of SEQ ID NO: 191 (Figure 71). 
5 30. EBQ8S2 

Applicants have identified a cDNA clone that encodes a novel polypeptide having homology to vanous 
protease enzymes, wherein the polypeptide is designated in the present application as -PR0852-. 

in one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encodmg a 
PR0852 polypeptide. In one aspect, the .solated nucleic acid comprises DNA encoding the PR0852 polypeptide 
having amino acid residues 1 to 518 of Figure 73 (SEQ ID NO-.196). or is complementary to such encoding nucleic 
acid sequence, and remains subly bound to .. under at least moderate, and optionally, under high stringency 
conditions. In other aspects, the isolated nucleic acid comprises DNA encoding the PR0852 polypeptide havmg 
anuno acid residues about 21 to 518 of Figure 73 (SEQ ID NO:196) or I or about 21 to X of Figure 73 (SEQ ID 
N0196) where X is any amino acid from ammo acid 461 to amino acid 470 of Figure 73 (SEQ ID NO:196). or is 
15 complementary to such encoding nucleic acid sequence, and remains sutbly bound to « under a. least moderate, and 
optionally under high stringency conditions. ITte isolated nucleic acid sequence may comprise die cDNA tnsen of 
fl« DNA45493-1349 vector deposiu«l on April 28. 1998 as ATCC 209805 which includes the nucleotide sequence 
encoding PR0852. 

in anodier embodiment, die invention provides isolated PR0852 polypeptide. In particular. d« mventum 
20 provides isolated native sequence PR0852 polypeptide, which in one embodiment, includes an amino acid sequence 
comprising residues I to 518 of Figure 73 (SEQ ID N0:196). In odier embodimems. the PR0852 comprises amino 
acids about 21 to amino acid 518 of Figure 73 (SEQ ID NO:196) or amino acids 1 or about 21 to X of Figure 73 
(SEQ ID N0:196). where X is any amino acid from amino acid 461 to amino acid 470 of Figure 73 (SEQ ID 
Nb:l96). Optionally. dK PR0852 polypeptide is obtained or « obtainable by expressing Uk polypeptide encoded 
by die CDNA insert of die DNA45493-1349 vector deposited on April 28. 1998 as ATCC 209805. 
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31. EBQ8S2 

Applicants have identified a cDNA clone toat encodes a novel polypeptide having sequence smulanty to 
reductase, wherein die polypeptide is designated in da present application as -PR0853". 

in one crabodimem. dK invention provides an isolated nucleic acid molecule comprising DNA encoding a 
PR0853 polypeptide. In one aspect, die isolated nucleic acid comprises DNA encoding d« PR0853 polypeptide 
having amino acid residues 1 to 377 of Figure 75 (SEQ ID NO:206). or is complememary to such encoding mideic 
acid sequence, and remains subly bound to it under at least moderate, and optionally, under high stringency 
conditions. In anodier aspect, a« isolated micleic acid comprises DNA encoding *e PR0853 polypeptide having 
amino add residues about 17 to 377 of Figure 75 (SEQ ID N0:206). or is complemenury to such encoding nucleic 
acid sequence, and remains subly bound to it under at least moderate, and optionally, under high stringency 
conditions. The isolated nucleic acid sequence may comprise die cDNA insert of *e DNA48227-1350 vector 
deposited on AprU 28. 1998 as ATCC 209812 which includes die nucleotide sequence encoding PR0853. 

In anodier embodiment, die invention provides Uolaied PR0853 polypeptide. In particular, die invemion 
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P«wides isola^d nadvc sequence PR0853 polypepddc. which in one embodimemlludes an ammo acid sequence 
comprising residues I ,o 377 of Figure 75 (SEQ ID NO:206). anod.r embodimem. ^ invention provdes an 
™0853po.ypepddeahsc«U«signa,se.^„ce. Which includes an 

from about ,7 to 377 of Figure 75 (SEQ ID NO:206). Optionally, the PR0853 polypeptide is obtained or is 
obtamable by expressing the polyp^tide encoded by the cDNA insen of d« DNA48227-1350 vector deposited on 
April 28. 1998 as ATCC 209812. 



.1 



32. EBOm 

Applicants have .dcntified a cDNA clone that encodes a novel polypeptide having sequence similarity to 
neurofascm. wherein the polypeptide is designated in the present application as "PRO860". 

" D...^'""" """"""" ""^'^'^ '''' DNA encoding a 

PRO860 polypeptide. 1„ one aspect, the isolated nucleic acid comprises DNA encoding the PRO860 polypepude 
having ami™, acid residues . to 985 of Figure 77 (SEQ ID N0:2. 1). or is complemeaury to such encoding nucleic 
acd sequence, and remains stably bound to it under a. least moderate, and optionally, under high stringency 
condtuons. In another aspect, the isolated nucleic acid comprises DNA encoding the PRO860 polypeptide having 

15 amino acid residues 1 to X of Figure 77 (SEQ ID N0:21 ,, where X is any amino acid from 443-452 of F.gure 7^7 
(SEQ ID N0:21 1). or is complementa^. to such encoding nucleic acid sequence, and remai.« stably bound to it under 
a. leas, moderate, and opdonally. under high stringency condiuons. -n.e .solated mtcleic acid sequence may comprise 
U>e CDNA insert of the DNA4.404-,352 vector deposited on May 6. 1998 as ATCC 209844 which include, the 
nucleotide sequence encoding PRO860. 

20 In m^cr embodiment. Ae invention provides isolated PRO860 polypeptide. In panicular. me invention 

pn^vides isolated native sequence PRO860 polypeptide, which in one embodiment, includes an amino acid sequence 
comprising residues 1 to 985 of Figure 77 (SEQ ID N0:2n). m another embodiment, the invention provides an 
related PRO860pobwtide Which ii«h,des an amim, acid sequence comprising ^^^^^^^ 1 to X of Figure 77 (SEQ 
ID N0:2I1). Where X is any amino acid residue from 443 to 452 of Figure 77 (SEQ ID N0:21 1). Opuonally the 
PRO860 polypeptide is obtained or is obudnable by expressing the polypeptide encoded by d,e cDNA msert of the 
DNA41404-1352 vector deposited on May 6. 1998 as ATCC 209844. 

33. PROg46 

30 CM^.'T'" " '''''' ' P'"^'^''' «»"ri.y to 

CMRF35, wherem the polypeptide is designated in the prescm application as •PR0846". 

m one embodiment, the invemion provides an isolated nucleic acid molecule comprismg DNA encoding a 

PR0846 polypeptide. In on, aspect, the isolated nucleic acid comprises DNA encoding the PR0846 polypeptide 

having aminoadd residues I to 332 of Figure 79 (SEQ ID NO:216). or is complementary to such encoding nucleic 

acd sequence, and remains stably bound to it under at least moderate, and optionally, under high stringency 

condinons. In odter aspecu. d,e isolated nucleic acid comprises DNA encoding the PR0846 polypeptide having 

ammo acid residues about 18 to 332 of Figure 79 (SEQ ID NO:216) or 1 or abou. 18 to X of SEQ ID NO 21 6 where 

XBanyami„oacidfh»n243to252ofFigure79(SEQIDNO:2,6).oriscomplememary.osuchenc^^^ 

acd sequence, and remains subly bound to it u«.er ai leas, moderate, and optionally, under high stringency 

conditions. isohued nucleic acid sequence may comprise d« cDNA imcr. of the DNA44I96-I353 vector 
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ATCC 209847 which includes the nucleotide sequence encodmg PR0846 
In another embodiment, the invention provides isolated PR0846 polypeptide. In particular, the .nvcntion 
provides isolated native sequence PR0846 polypeptide, which in one embodiment, includes an amino acid sequence 
comprismg residues 1 to 332 of Figure 79 (SEQ ID NO:216). In other embodiments, the invention provides an 
isolated PR0846 polypeptide absent the signal sequence, which mcludes an amino acid sequence comprising residues 
from about 18 to 332 of Figure 79 (SEQ ID NO:216). Additional embodiments of the present invention are directed 
.0 an ^olated PR0846 polypeptide comprising ammo acid 1 or about 18 to X of Figure 79 (SEQ ID NO:216). where 
X is any amino acid from 243 to 252 of Figure 79 (SEQ ID NO:216). Optionally, the PR0846 polypeptide is 
ob^ or is obtainable by expressing the polypeptide encoded by the cDNA insert of the DNA44196-1353 vector 
deposited on May 6. 1998 as ATCC 209847. 

34. mossi 

Applicants have identified a cDNA clone that encodes a novel polypeptide having sequence stmilanty lo 
lysozymc. wherein the polypeptide is designated in the presem application as "PR0862" 

In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a 
PR0862 polypeptide. In one aspect, tiie isolated nucleic acid comprises DNA encoding the PR0862 polypeptide 
having amino acid residues I to 146 of Figure 81 (SEQ ID NO:221). or is complementary to such encoding nucleic 
acid sequence, and remains subly bound to it under a. least moderate, and optionally, under high stringency 
conditions. In another aspect, die isolated nucleic acid comprises DNA encoding die PR0862 polypeptide havmg 
amino acid residues about 19 to 146 of Figure 81 (SEQ ID NO:221). or is complementary lo such encodmg nudeK: 
acid sequence, and remains stably bound to it under at least moderate, and optionally, under high stringency 
conditions. The isolated nucleic acid sequence may comprise Uk cDNA inser. of die DNA52187-1354 vector 
deposited on May 6. 1998 as ATCC 209845 which includes the raicleotidc sequence encoding PR0862. 

in another embodiment, .he invention provides isolated PR0862 polypeptide. In particular. ti,e invention 
provides isolated native sequence PR0862 polypeptide, which in one embodiment, includes an amino acid sequence 
comprising residues I to 146 of Figure 81 (SEQ ID NO:221). In another embodiment, die invenaon provides an 
isolated PR0862 polypeptide absent the signal sequence, which includes an amino acid sequence comprumg residues 
from about 19 to 146 of Figure 81 (SEQ ID NO:22l). Optionally, tiie PR0862 polypeptide is obtained or is 
obtainable by expressing die polypeptide encoded by tiie cDNA insert of die DNA52187.1354 vector deposited on 
May 6. 1998 as ATCC 209845. 
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35. EB£2Sjid 

Applicams have identified a cDNA clone d,at encodes a novel polypeptide havmg sequence similanty to 
Wnt-4 wherein the polypeptide is designated in die presem application as "PR0864-. 

In one embodiment, the invention provides an isolated micleic acid molecule comprising DNA encoding a 
PR0864 polypeptide, ta one aspect, die isolated nucleic acid comprises DNA encoding Uie PR0864 polypeptide 
having amino add residues 1 to 351 of Figure 83 (SEQ ID NO:226). or is complementary to such encodmg nucleic 
acid sequence, and remains subly bound to it under at leas, moderate, and optionally, under high stnngency 
conditions. In another aspect, the isolated nucleic acid comprises DNA encoding the PR0864 polypeptide having 
™ add residues about 23 to 351 of Figure 83 (SEQ ID NO:226). or is complememary to such encodmg nucleic 
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acid scque^^c. and rc„«i« subly bound .o i. under a. Ica., moderare. and optionally, under high stringency 
condmons. The isolated mtcleic acd sequence n«y comprise the cDNA insen of the DNA48328-1355 vector 
deposited on IMay 6. 1998 as ATCC 209843 which tnchides the nucleotide sequence encoding PR0864 

In another embodiment, the invention provides isolated PR0864 polypeptide In particular, the invenuon 
provides isolated nadve sequence PR0864 polypeptide, which in one embodiment. IncM^s an amino acid sequence 
comprising residues 1 to 35. of Figure 83 (SEQ ID NO:226). ,n another embodiment, die invention provides an 
uoUtedPR0864 polypeptide absent .he s^nai sequence. Which includes an 

from about 23 to 351 of Figure 83 (SEQ ID NO:226). Optionally, the PR0864 polypepdde U obained or is 
obtainable by expressing the polypeptide encoded by the cDNA insert of the DNA48328-I355 vector deposited on 
May 6. 1 998 as ATCC 209843 



36. PRQ792 

Applicants have identified a cDNA clone that encodes a novel polypepUde having homology u, CD.3 
whercm the polypeptide is designated in the present application as ••PR0792" . 

In one embodiment, me invendon provides an isolated mtcleic acd molecule comprising DNA encoding a 
PR0792 polypeptide. In one aspect. d,e isolated nucleic acid comprises DNA encoding the PR0792 polypeptide 
havu^amino acid residues 1 «> 293 of Figure 85 (SEQ ID NO:23.). or is complemenury to such encoding nucleic 
acd sequence, and remains stably bound .0 it under a. leas. modera«. and opdonally. under high stringency 
condtoons. In anodter aspect. d« isola«d m«Ieic acid comprises DNA encoding the PR0792 polypepdde havmg 
amino acid «sidues X .0 293 of Figure 85 (SEQ ID NO:231) where X is any amino acid from 50 «, 59 of Figure 85 
(SEQ ID NO:231). or is complementary to such encoding nndeic acid sequence, and remains stably bo««. .0 i. under 
a. leas, moderate, and optionally, under high stringency conditions. The isolated nucleic acid sequence may comprise 
.he CDNA insen of the DNA56352..358 vector deposited on May 6. 1998 as ATCC 209846 which i«:.udes Uk 
nucleotide sequence encoding PR0792. 

In anodKr embodimem. d,e invenUon provides isolated PR0792 polypeptide. In particular, the invention 
provides isolated nadve sequence PR0792 polypeptide, which m one embodiment, includes an amino acid sequence 
comprising residues 1 .0 293 of Figure 85 (SEQ ID NO:231). An additional embodimem of die present invention 
B dueced ,0 PRa792 polypeptide comprising amino acids X .0 293 of Ftgure 85 (SEQ ID NO:231). where X is any 
annno «nd from 50 .0 59 of Figure 85 (SEQ ID NO:231). OpnonaUy. U.e PR0792 polypeptide is obtained or is 
obtamable by expressing *e polypeptide encoded by *e cDNA insert of dte DNA56352-1358 vector deposited on 
30 May 6, 1998 as ATCC 209846. 



35 



37. tEOm 

AppUcants have identified a cDNA clone *at encodes a novel polypeptide having homology «, mi„din ami 
spondin prottms. wherein die polypeptide is designated in die present application as "PR0866-. 

Inoneembodimem, die invention provides an isolated nucleic acid molecule comprising DNA encodmg a 
PR0866 polypeptide. In one aspect, the isoUted nucleic acid comprises DNA encoding die PR0866 polypeptide 
havrngaminoadd «,idues 1 «, 331 of Figure 87 (SEQ ID NO:236). or is complemenury .0 such encoding nucleic 
acul sequence, and remains stably bound .0 h under a. least moderate, and optionally, under high stringency 
condmons. In anod^r aspect, die isolated micleic acid comprises DNA encoding the PR0866 polypeptide havmg 
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amino acd residues about27.o 229 of Figure 87 (SBQ ID NO-236). or .s complementary to such encoding nuclc.c 
acid sequence and remains stably bound .0 u under at least moderate, and optionally, under high stringency 
conditions. The isolated nucle.c acid sequence may compr.se the cDNA msen of the DNA5397M359 vector 
deposited on April 7. 1998 as ATCC 209750 which includes the nucleotide sequence encoding PR0866 

In another embodiment, the inventton provides .solaied PR0866 polypeptide. In partKular. the invention 
provides isolated nanve sequence PR0866 polypeptide, which m one embodiment, includes an ammo acid sequence 
comprtsine residues 1 to 331 of Figure 87 (SEQ ID NO:236». Another embodiment of the presem invention is 
directed to PR0866 polypeptides comprising amino acds about 27 to 331 of Figure 87 (SEQ ID NO:236). 
Optionally, the PR0866 polypeptide is obtained or is obuinable by expressing the polypeptide encoded by the cDNA 
insert of the DNA53971.1359 vector deposited on April 7. 1998 as ATCC 209750. 



10 



38. PR0871 

Applicants have idenufied a cDN A clone thai encodes a novel polypeptide having homology to CyP-60. 
wherein die polypeptide is designated in die present application as "PROS?!". 

in one embodiment, die invention provides an isolated nucleic acid molecule comprising DNA encoding a 
15 PR0871 polypeptide. In one aspect, the isolated nucleic acid compnses DNA encoding the PR0871 polypepude 
having amino acid residues I to 472 of Figure 89 (SEQ ID NO:245). or is compleme«ary to such encoding nucleic 
acid sequence, and remains subly bound to i. under a. least moderate, and optionally, under high stringency 
condiuons. In other aspecu. .he isolated mideic acid comprises DNA encoding die PR0871 polypeptide having 
a^ino acid residues about 22 to 472 of Figure 89 (SEQ ID NO:245). or is complementary ,0 such encodmg nucleic 
-H) acid sequence, and remains stably bound .0 it under a. least moderate, and optionally, under high stringency 
condiUons. The isolated nucleic acid sequence may comprise the cDNA insert of the DNA50919-1361 vector 
deposited on May 6 1998 as ATCC 209848 which includes the nucleotide sequence encoding PR0871. 

h, anodier embodiment, the invention provides isolated PR0871 polypeptide. In particular, the invention 
provides isolated native sequence PR0871 polypeptide, which in one embodimem. includes an amino acid sequence 
comprising residues 1 to 472 of Figure 89 (SEQ ID NO:245). An additional embodimem of the presem mvenuon 
is directed .0 FR0871 polypeptides comprising amino acids about 22 to 472 of Figure 89 (SEQ ID NO:245). 
Optionally, the PR0871 polypeptide is obtained or is obtainable by expressing the polypepude encoded by the cDN A 
insen of the DNA50919-1361 vector deposited on May 6. 1998 as ATCC 209848. 
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30 39. EBQS72 . w . . 

Applicants have identified a cDNA clone that encodes a novel polypeptide having homology to 

carboxylesterase, wherein the polypeptide is designated in die presem application as -PR0873". 

in one embodimem. the invention provides an isolated nucleic acid molecule comprising DNA encodmg a 
PR0873 polypeptide. In one aspect, the isolated nucleic acid comprises DNA encoding the PR0873 polypepude 
35 having amino acid residues 1 to 545 of Figure 91 (SEQ ID N0:254). or is complemenu^' » such encoding nucleic 
acid sequence, and remains subly bound .0 i. under at least moderate, and optionally, under high sumgency 
conditions. In other aspect, the isolated nucleic acid compnses DNA encodmg the PR0873 polypeptide having 
amino acid residues about 30 to abou. 545 of Figure 91 (SEQ ID N0:254). or is complemenury to such encodmg 
nucleic acid sequence, and remains subly bound to it under at least moderate, and opuonally. umler high stringency 

69 



BNSOOCtO- <W0_9946281A2J_> 



wo 99/46281 ^ ^ PCT/US99/05028 

conditions. The isolated nucleic acid sequence may comprise the cDNA insen of the DNA44179-1362 vector 
deposited on May 6. 1998 as ATCC 209851 which includes the nucleotide sequence encoding PR0873. 

In another embodiment, the invention provides isolated PR0873 polypeptide. In particular, the invention 
provides isolated native sequence PR0873 polypeptide, which m one embodiment, mcludes an amino acid sequence 
comprising residues 1 to 545 of Figure 91 (SEQ ID N0.254). Addiuonal embodiments of the presem invemion arc 
directed to PR0873 polypeptides comprising amino acids about 30 to about 545 of Figure 91 (SEQ ID NO:254) 
OpdonaUy. the PR0873 polypeptide is obtained or is obtainable by expressing the polypepudc encoded by the cDNA 
insert of the DNA44I79-1362 vector deposited on May 6. 1998 as ATCC 209851 . 

40. PRO940 

Applicants have identified a cDNA clone (hat encodes a novel polypeptide havii% homology to GD33 and 
OB binding protein-2. wherein the polypeptide is designated in the present applicanon as "PRO940-. 

In one embodiment, the invemion provides an isolated nucleic acid molecule comprising DNA encoding a 
PRO940 polypeptide. In one aspect, the isolated nucleic acid comprises DNA encoding the PRO940 polypeptide 
having amino acid residues 1 to 544 of Figure 93 (SEQ ID N0:259). or is complementary' to such encoding nucleic 
acid sequence, and remains stably bound to it under at least moderate, and optionally, under high stringency 
conditions. In other aspects, the isolated nucleic acid comprises DNA encoding die PRO940 polypeptide having 
amino acid residues about 16 to 544 of Figure 93 (SEQ ID N0:259) or 1 or about 16 to X of Figure 93 (SEQ ID 
N0:259). wheit X is aiy amino acid from 394 to 403 of Figure 93 (SEQ ID NO:259). or U complcmcniary to such 
encoding nucleic acid sequence, and remains stably bound to it under at least moderate, and optionally, under high 
stringency conditions. The isolated nucleic acid sequence may comprise the cDNA msert of the DNA54002-I3S7 
vector deposited on April 7. 1998 as ATCC 209754 which includes the nucleotide sequence encoding PRO940. 

In another embodiment, die invention provides isolated PRO940 polypeptide. In particular, the invention 
provides isolated native sequence PRO940 polypeptide, which in one embodiment, includes an amino acid sequence 
comprising residues 1 to 544 of Figure 93 (SEQ ID NO:259). Other embodiments of the present invention are 
directed to PRO940 polypeptides comprising amino acids about 16 to 544 of Figure 93 (SEQ ID NO:259) or 1 or 
about 16 to X of Figure 93 (SEQ ID NO:259). where X is any amino acid from 394 to 403 of Figure 93 (SEQ ID 
NO:259). Optionally, the PRO940 polypeptide is obtained or is obtainable by expressing the polypeptide encoded 
by the cDNA insen of die DNA54002-1367 vector deposited on April 7. 1998 as ATCC 209754. 

41. PR0941 

Applicants have identified a cDNA clone that encodes a novel polypeptide having homology to a cadherin 
protein, wherein the polypeptide is designated in the present application as "PR0941 " . 

In one embodiment, die invention provides an isolated nucleic acid molecule comprising DNA eiKodiiig a 
PR094I polypeptide. In one aspect, the isolated nucleic acid comprises DNA encoding the PR094I polypeptide 
having amino acid residues I to 772 of Figure 95 (SEQ ID NO:264). or is complementary to such encoding micleic 
acid sequence, and remains stably bound to it under at least moderate, and optionally, under high sttingency 
conditions. In other aspects, die isolated raicleic acid comprises DNA encoding the PR094I polypeptide having 
amino acid residues about 22 to 772 of Figure 95 (SEQ ID NO:264) or I or about 22 to X of Figure 95 (SEQ ID 
NO:264). where X is any amino acid from 592 to 601 of Figure 95 (SEQ ID NO:264). or is complementary to such 
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encodi,^ nucleic acid sequence, and remains stably bound to it under a. leas, moderate, and optionally, under high 
stringency condmons. n.e isolated nucleic acid sequence may comprise the cDNA insert of the DNA53906.1368 
vector deposited on April 7. 1998 as ATCC 209747 which includes the nucleotide sequence encoding PR0941 

In anoOier embodiment, the invention provides isolated PR0941 polypeptide. In particular, the invenuon 
provides isolated native sequence PR0941 polypeptide, which m one embodimem. mcludes an amino acid sequence 
comprising residues 1 to 772 of Figure 95 (SEQ ID NO:264). Additional embodiment of the present invention are 
directed to PR0941 polypeptides which comprise ammo acid about 21 to 772 of Figure 95 (SEQ ID NO:264) or 1 
or about 22 to X of Figure 95 (SEQ ID NO:264). where X is any amino acid from 592 to 601 of Figure 95 (SEQ ID 
NO:264). Optionally, the PR0941 polypeptide is obtained or .s obtainable by expressing the polypeptide encoded 
by the cDNA insert of the DNA53906-1368 vector deposited on April 7. 1998 as ATCC 209747. 

In another embodiment, die invention provides an expressed sequence tag (EST) designated hcrem as 
DNA6415 comprising the nucleotide sequence of Figure 96 (SEQ ID NO:265). 



42. PR0944 

Applicants have identified a cDNA clone that encodes a novel polypeptide having homology to Clostrtdmm 
15 perfringens enterotoxin receptor (CPE-R). wherein the polypeptide is designated in the present application as 
"PR0944" 

In" one embodimem. the im^ention provides an isolated nucle.c acid molecule comprising DNA encoding a 
PR0944 polypeptide. In one aspect, .he isolated nucleic add comprises DNA encoding d.e PR0944 polypeptide 
having amino acid residues 1 to 21 1 of Figure 98 (SEQ ID NO:270). or is complementary .o such encoding mKletc 
20 acid sequence, and remains stably bound to it under at least moderate, and optionally, under high strmgency 
condi.ions. In other aspects, the isolated nucleic acid comprises DNA encoding die PR0944 polypeptide having 
amino acid residues about 22 to 229 of Figure 98 (SEQ ID NO:270) or amino acid 1 or about 22 to X of Figure 98 
(SEQ ID NO 270) where X is any amino acid from 77 .o 80 of Figure 98 (SEQ ID NO:270). or U complememary 
to such encoding nucleic acid sequence, and remains sutbly bound to i, under at least moderate, and optionally . under 
^5 high stringency condmons. The isolated m«:leic acid sequence may comprise the cDNA inser, of *e DNAS2185- 
1370 yccm deposited on May 14, 1998 as ATCC 209861 which includes the nucleotide sequence encodmg PR0944. 

to anodier embodiment. ti« invention provides isolated PR0944 polypeptide. In particular, the invenuon 
p„^ isolated native sequence PR0944 polypeptide, which in one embodiment, includes an amino acid sequence 
comprising residues 1 to 21 1 of Figure 98 (SEQ ID NO:270). Additional embodiments of die presem mventton are 
30 direoed » PR0944 polypeptides comprising amino acids about 22 ,o 2 1 1 of Figure 98 (SEQ ID NO:270, or ami«, 
acid 1 or about 22 U) X of Figure 98 (SEQ ID NO:270). where X is any amino acid from 77 to 86 of F.gurt 98 (SEQ 
ID NO-270). Optionally, the PR0944 polypeptide is obutned or is obtainable by expressing tite polypepade encoded 
by ti« CDNA insert of Ote DNA52185-1370 vector deposited on May 14. 1998 as ATCC 209861. 

,„ another embodimem, *e Invemion provides an expressed sequence u.g (EST) designa«d herein as 
35 DNA14O07 comprising tfte nucleotide sequence of Figure 99 (SEQ ID NO:27I). 

In another embodimem, *e invention provides an expressed sequence ug (EST) designated herem as 
DNA12733 comprising the nucleotide sequence of Figure 100 (SEQ ID NO:272). 

In anoAer embodiment, .he invention provides an expressed sequence tag (EST) designated herem as 
DNA12746 comprising the micleoride sequence of Figure 101 (SEQ ID NO:273). 
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In another embodiment, the invenuon provides an expressed sequencSf (EST) designated herein as 
DNA12834 comprising the nucleotide sequence of Figure 102 (SEQ ID NO:274). 

In another embodiment, the invention provides an expressed sequence tag (EST) designated herein as 
DNA12846 comprising the nucleotide sequence of Fiugure 103 (SEQ ID NO:275). 

In another embodiment, the .nvemion provides an expressed sequence tag (EST) designated herein as 
i DNA13 104 comprising the nucleotide sequence of Figure 104 (SEQ ID NO:276). 

In another embodiment, the invenuon provides an expressed sequence tag (EST) designated herein as 
DNA13259 comprising the nucleotide sequence of Figure 105 (SEQ ID NO:277). 

In another embodiment, the invention provides an expressed sequence tag (EST) designated heitin as 
DNA13959 comprising the nucleotide sequence of Figure 106 (SEQ ID NO:278). 

HM. . J" ~ """'^ herein as 

DNA13961 compnsing the nucleotide sequence of Figure 107 (SEQ ID NO:279). 

43. PR09«3 

Applicants have identified a cDNA clone that encodes a novel polypeptide having homology to a vcsicte 
15 associated protein. VAP-33. wherein the polypeptide is designated in the present application as "PR0983- 

in one embodimem. the invention provides an isolated m«leic acid molecule comprising DNA encoding a 
PR0983 polypepdde. In one aspect, the isolated nucleic acid comprises DNA encoding the PR0983 polypeptide 
having ami™, acid residues 1 «, 243 of Figure 109 (SEQ ID NO:284). or is complementaty to such encoding nucleic 
acd sequence, and remains stably bound to it under at .east moderate, and optionally, under high stringency 
condtuons. In other aspects, the isolated nucleic acid comprises DNA encoding the PR0983 polypeptide havin. 
anuno acid residue 1 to X of Figure 109 (SEQ ID NO:284) where X is any ami™, acid from 2,9 to 228 of Figure 
109 (SEQ ID NO:284). or is complemenuiry to such encoding nucleic acid sequence, and remains stably bound to 
« under at leas. moden«. and optionally, under high stringency conditions, nie isolated nucleic acid sequence may 
comprise the cDNA insert of the DNA53977-137. vector deposited on May 14. 1998 as ATCC 209862 which 
-5 ulcludes die nucleotide sequence encoding PR0983. 

in anodier embodiment, the invention provides isolated PR0983 polypeptide. In panicular. the invention 
P«,vides isolated nadve sequence PR0983 polypeptide, which in one embodiment, includes an ami«, acid sequence 
comprising .endues 1 to 243 of Figure 109 (SEQ ID NO:284). Additional embodiments of the present invention are 
directed to PR0983 polypepudes comprising amino acid 1 to X of Figure 109 (SEQ ID NO:284). where X is any 
30 atnino acid from 219 to 228 of Figure .09 (SEQ ID NO:284). Optionally, the PR0983 polypeptide is obtained or 
« Obtainable by expressing the potypepude encoded by the cDNA insert of the DNA53977.I371 vector deposited on 
May 14. 1998 as ATCC 209862. 

nM.,„!! """"^ """"^'^ ''''""^ ^■'^ ''"*^««' herein as 

ONA17130 compnsing the nucleotide sequence of Figure 1 10 (SEQ ID NO:285). 

35 In another embodimem. the invennon provides an expressed sequence tag (EST) designated herein as 

DNA23466 comprising the nudeodde sequence of Figure 111 (SEQ ID NO:286). 

In another embodimem. the invenuon provides an expressed sequence tag (EST) designated herein as 
DNA26818 comprising die nucleotide sequence of Figure 112 (SEQ ID NO:287). 

In another embodimem. U« invennon provides an expressed sequence uig (EST) designated herein as 
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DNA37618 comprising ihe nucleotide sequence of Figure 113 (SEQ ID NO:288). 

in ancher embodimem. O^e mvemion provides ar> expressed sequence «g (EST) designated herein as 
DNA41732 comprising the nucleotide sequence of Figure 114 (SEQ ID NO:289). 

In another embodiment, the invention provides an expressed sequence tag (EST) designated herein as 
DNA45980 comprising the nucleotide sequence of Figure 1 15 (SEQ ID NO:290). 
5 in another embodiment, the invention provides an expressed sequence lag (EST) designated herein as 

DNA46372 comprising the nucleotide sequence of Figure 1 16 (SEQ ID NO:29l). 

44. EBiUfiS 

Applicants have idennfed a cDNA clone that encodes a novel polypeptide having homology to proteases. 
10 wherein the polypeptide is designated in the present application as -PRO1057-. 

In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encodmg a 
PRO1057 polypeptide. In one aspect, the isolated nucleic add comprises DNA encoding the PRO1057 polypeptide 
having amino acid residues 1 to 413 of Figure 118 (SEQ ID NO:296). or is complemeniary to such encodtng nucletc 
acid sequence, and remains stably bound to it under at leas, moderate, and optionally, under high stnngency 
15 conditions. In other aspects, the isolated nucleic acid comprises DNA encoding the PROI057 polypeptide having 
amino acid residues about 17 to 413 of Figure 1 18 (SEQ ID NO:296). or is complementary to such encodmg nucletc 
acid sequence, and remains stably bound to it under a. least moderate, and optionally. u«ler high stringency 
conditions, m isolated nucleic acid sequence may comprise the cDNA insert of the DNA57253-.382 vector 
deposited on May 14. 1998 as ATCC 209867 which includes the nucleoude sequence encodmg PRO1057. 

another embodiment, the invention provides isolated PRO1057 polypeptide. In particular, the invcmton 
provides isolated native sequence PRO1057 polypeptide, which in one embodiment, includes an amino acid sequence 
comprising residues 1 to 413 of Figure 1 18 (SEQ ID NO:296). Additional embodiments of the present invention a., 
directed to PR0.057 polypeptides comprising amino acids about 17 to 413 of Figure 118 (SEQ ID NO:296V 
Optionally, the PRO1057 polypeptide is obtained or is obtainable by expressing the polypeptide encoded by the cDNA 
25 insen of the DNA57253-1382 vector deposited on May 14. 1998 as ATCC 209867. 
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45. PRO1071 

Applicams have identified a cDNA clone that encodes a novel polypeptide having homology to 
thrombospondin. wherein the polypeptide is designated in the present application as •PR01071". 

In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA en«Ki.ng a 
PRO1071 polypeptide. In one aspect, the isolated nucleic acid comprises DNA encoding the PRO1071 polypepude 
having amino acid residues I to 525 of Figure 120 (SEQ ID NO:301). or .s complementary to stKh encoding nucletc 
acid sequence, and remains subly bound to it under at leas, moderate. a«l optionally, under high stringency 
conditions In other aspects, the isoUted nucleic acid comprises DNA encoding the PRO1071 polypeptide havng 
amino acid residues about 26 to 525 of Figure 120 (SEQ ID NO:301). or is complementary to such encoding nucletc 
acid sequence, and remains stably bou«l to i. under a. leas, moderate, and optionally, under high stringency 
conditions. 1^ isola,ed nucleic acid sequence may comprise the cDNA insen of ti« DNA58847-1383 vector 
deposited on May 20, 1998 as ATCC 209879 which includes the nucleotide sequence encodmg PR01071 . 

Inanother embodiment, the invemion provides isolated PRO 1071 polypeptide. In particular, tite mventton 
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pn,vidcs isolated nadvc sequence PR0107. polypeptide. wh.ch in one embodin«m.1ldes an anu.o acid sequence 
«resKlu«Uo525ofFisu«120(SEQIDNO:30l). Addiuona. en,bodi™en« of U,e present .nvenUon « 
d.rected ,o PROI07. polypeptides comprising amino acds about 26 to 525 of Figure .20 (SEQ ID NO-301) 
Opdonally. the PRO.07, polypepd^e is obtained or .s obtainable by expressing the polypeptide e™:oded by the cDNA 
insen of the DNA58847.1383 vector deposited on May 20. 1998 as ATCC 209879 

5 

46. EESUm 

Applicants have identified a cDNA clone that encodes a novel polypeptide hav.ng honK.logy to reducme 
protems. wherein the polypeptide is designated in the present application as ■PRO1072" 

10 PR0.072'" r !r"r* " "^'"^ "^^'""'^ ^^^^^^^^"^^ --"^ ^ 

PRO1072 polypeptKie. ,n one aspect, the isolated nucleic acid comprises DNA encoding d,e PRO1072 polypeptide 
having amino acid residues 1 to 336 of Figure 122 (SEQ .D NO:303). or is complemenury to such encoding nucleic 
acd sequence, and remains stably bound .0 it under at least moderate, and optionally, under hieh stringency 
condtnons. In other aspects, the isolated nucleic acid comprises DNA encoding the PROI072 polypeptide having 
--°-'»-^<^-sabout22.o336ofFigureI22(SEQIDNO:303).oriscomplemen^^ 
acd sequence^and remains stably bound to U under at leas, moderate, and optionally, under high stringency 
condutons. m isolated nucleic acid sequence may comprise the cDNA inser. of the DNA58747-.384 vector 
deposited on May 14. 1998 as ATCC 209868 which includes me nucleotide sequence encoding PRO1072 

'nanoU^rembodiment. the invenuon provides isolated PRO.072 polypeptide. In particular, the invcndon 

P«>VKles isolated native sequence PRO.072 polypeptide, which in one embodiment, includes an amino acid seq„e«e 
20 ^rcsidue...^ Additional embodiments of the present invenul are 

directed .0 PRO1072 polypepUdes comprising ami«> acids about 22 to 336 of Figure ,22 (SEQ ID NO-303) 

Omn^. the PRO1072 polypeptide is obtai«d or is obtainable by expressing the po.ypeptide encoded by the cDNA 

insert of the DNA58747-1384 vector deposited on May 14. 1998 as ATCC 209868. 

25 DNAan,!n """^ " "^"^ '^^^^ - 

DNA4021O comprising Uic nucleotide sequence of Figure 123 (SEQ ID NO:304). 

<7. PRO107S 

Applicants have identified a cDNA clone that encodes a novel polj^ptide having homology .0 p^tein 
disulfide Bomerase. wherein the polypeptide is designated in the present application as "PROWS" 

PRo,«,.'""""^'"'"* """"" " ""'^'^ ^'^'^"'^ --"^-"g DNA encoding a 

PROKr75 polypeptide. In one aspect, the isolated nucleic acid comprises DNA encoding the PRO1075 polypeptide 

havmga^inoacid residues I .o406of Figure 125 (SEQ ID NO:309). or is complememary to such encoding nucleic 
ac^sequen^. and remains subly bound to U under a. leas, moderate, and optiorully. under high stringency 
condtao^. In other aspects. d« isolated nucleic acid comprises DNA encoding the PRO1075 po.ypeptide havmg 
ammoac«„esiduesabou.30to406of Figure 125 (SEQ ID NO:309,. or is complementary .0 such encoding nucleic 
acK, sequence, and remains subly bound to it under at least moderate, and optionally, under high stringency 
condmons. m isolated nucleic acid sequence may comprise the cDNA inser, of the DNA57689.,385 vector 
depostted on May ,4. 1998 as ATCC 209869 which inch«les the nucleotide seque«:e encoding PROICHS 

Ina««herembodime«. die invention provides isolated PRO1075 polypeptide. In particular, the invemion 
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provides isolated thrive sequence PRO1075 polypeptide, which in one embodin«nt. .ncMes an amino acid seqt«nce 
comprising residues I to 406 of Fieure 125 (SEQ ID NO:309). Additional embodtments of .he present invenuon are 
directed to PRO1075 polypeptides cotnprising amino acids about 30 to 406 of Ftgure 125 (SEQ ID NO:309). 
Opdonally. the PRO1075 polypepdde is obtained or is obtainable by expressing the polypeptide encoded by the cDNA 
insert of the DNA57689.1385 vector deposited on May 14. 1998 as ATCC 209869. 
5 in another embodiment, the invention provides an expressed sequence tag (EST) designated herein as 

DNA 13059 comprising the nucleotide sequence of Figure 126 (SEQ ID N0:310) 

in another embodiment, the invenuon provides an expressed sequence tag (EST) designated herein as 
DNA19463 comprising the nucleotide sequence of Figure 127 (SEQ ID N0:31 1). 

10 48. musi 

AppUcants have identified a cDN A clone that encodes a novel polypeptide having homology to the comichon 
protein, wherein the polypeptide is designated in the present application as "PROlSr . 

in one embodiment, the mvention provides an isolated nucleic acid molecule comprising DNA encodmg a 
PR0181 polypepdde. In one aspect, the isolated nucleic acid comprises DNA encoding the PR0181 polypeptide 
15 having ammo acid residues 1 to 144 of figure 129 (SEQ ID NO:322). or is complementary to such encoding nudetc 
acid sequence, and remains stably bound to it under at least moderate, and optionally, under high strtngency 
conditions. In other aspects, dte Uolated nucleic acid comprises DNA encoding dte PR0181 polypeptide having 
anuno acid residues about 21 to 144 of Rgure 129 (SEQ ID NO:322) or amino acid 1 or about 21 to X of Figure 129 
(SEQ ID NO:322) where X is any amino acid from 52 to 61 of Figure 129 (SEQ ID NO:322). or is complementary 
20 to such encoding nucleic acid sequence, and remains stably bound to it under at leas, moderate, and optionally, under 
high stringency conditions. The isolated nucleic acid sequence may comprise the cDNA insert of the DNA23330. 
1390 vector deposited on April 14. 1998 as ATCC 209775 which includes the nucleotide sequence encodmg PR0181. 

to anodter embodimem. tite invemion provides isolated PR0181 polypeptide, to particular, the invenuon 
provides isolated native sequence PR0181 polypeptide, which in one embodiment. includ« an amino acid sequence 
.5 comprising residues 1 to 144 of Figure 129 (SEQ ID NO:322). Additional embodimems of dte presem invenuon are 
direc,ed.oPR0181 polypeptides comprising aminoacids about21 to 144 of Figure 129 (SEQ ID NO:322) oramino 
acid 1 or about 21 to X of Figure 129 (SEQ ID NO:322). where X is any amino acid from 52 to 61 of Figure 129 
(SHJ ID NO:322). Optionally, .he PR0181 polypeptide is obuuned or is obtainable by expressing die polypepude 
encoded by *e cDNA insert of die DNA23330-1390 vector deposited on April 14. 1998 as ATCC 209775. 

in anotiter embodiment, Oie invention provides an expressed sequence ug (EST) designated herein as 
DNA13242 comprising die nucleotide sequence of Figure 130 (SEQ ID NO:323). 
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49. ERQ12S . ^ 

AppUcants have identiHed a cDNA clone dia. encodes a novel transmembrane polypepude. wherem die 

35 polypeptide is designated in die presem application as -PR0195" . 

in one embodiment, die invemion provides an isoUted nucleic acid molecule comprising DNA encodmg a 
PR0195 polypepude. In one aspect, die isolated nucleic acid comprises DNA encoding die PR0195 polypeptide 
having amino add residues 1 to 323 of Figure 132 (SEQ ID NO:330). or is complementary to such encoding nucleic 
acid sequence, and remains suibly bound to h under at least moderate, and optionally, under high stringency 
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condUions. In o.her aspects. a« iso.a«d nucleic acid comprises DNA «Kodi Jff PR0195 polypcpude having 
an.no acid res.dues abou. 32 «, 323 of Rgu« 132 (SEQ ID NO.330, or anuno acid 1 or about 32 to X of Ftgurc 132 
(SEQ ID NO:330) where X is any a,m«> acid ftom 236 to 245 of Figure 132 (SEQ ID NO:330). or is complcmemary 
u, such encoding nucleic acid sequence, and remains subly bound to it under at least moderate, and optionally under 
h.gh stnngency conditions. •n,e isolated nucleic acid sequence may comprise the cDNA insert of the DNAi6847. 
5 1395 vector deposited on April 14. 1998 as ATCC 209772 which includes the „«deodde sequence encodi,^ PRO,95 
In another embodiment, the invention provides isolated PR0195 polypeptide. In particular, the mvention 
P«>vides isolated native sequence PR0195 polypeptide, which in one embodiment, includes an amino acid sequence 
compnsing residues 1 to 323 of Figure 132 (SEQ ,D NO:330). Addinona. embodiments of the presem invention are 
<i«ctedtoPR0195polypep,idescomprisingaminoacidsabout32.o323 ofFigure 132 (SEQ ID NO:330) or amino 
10 actd 1 or about 32 to X of Fig«e ,32 (SEQ ID NO:330). where X is any amino acid from 236 to 245 of Figure 132 
(SEQ NO:330). Optio^Oly. .he PR0195 polypeptide Is obuined or is obtai«.ble by expressing the polypepdde 
encoded by the cDNA insert of the DNA26847-I395 vector deposited on April 14. 1998 as ATCC 20977'' 

m another embodimem. the invention provides an expressed sequence Ug (EST) comprising the mtcleotide 
sequence of Figure 133 (SEQ ID NO:331). herein designated DNA15062. 
'5 In another embodiment, the invention provides an expressed sequence tag (EST) comprising the mtcleotide 

sequence of Figure 134 (SEQ ID NO:332). herein designated DNA13199. 

50. EEOm 

^'^'^""^•--^"--DNAclonetha.encodesanovelsecretedpolypepUd^^ 
2U IS designated in the presem appUcatum as "PR0865" . 

in one embodimem. the im^endon ptovides an Isolated nucleic acid molecule comprising DNA encoding a 
PR0865 polypepdde. In one aspect, the Isolated nucleic acid comprises DNA encoding u« pro865 polypeptide 
havmg amino acid residues 1 .0468 of Figure .36 (SEQ ID NO:337). or is complementary to such encoding nucleic 
acd sequence, and remains stably bound to it under at least moderate, and optionally, under high stringem:y 
condmons. In other aspects, the isolated ««leic acid compnses DNA encoding the PR0865 polypeptide having 
ammo acid residues about 24 to 229 of Figure 136 (SEQ ID NO:337). or is complementary to such encoding m«:,eic 
acd sequence, and remains stably bou«l to i. under a. teast moderate, and optionally, under high stringency 
condtfons. TT« isolated nucleic acid sequence may compnse d,e cDNA insert of the DNA53974-I401 vector 
deposited on April 14. 1998 as ATCC 209774 which includes the nucleotide sequence encoding PR0865. 

In another embodimem, the invention provides isolated PR0865 polypeptide. In particular. Che invcmion 
provides isototed native sequence PR0865 polypeptide, which in one embodimem, includes an amino acid sequence 
comprising residues 1 to 468 of Figure 136 (SEQ ID NO:337). An additional embodimem of tite presem invention 
IS directed to a PR086S polypeptide comprising amim, acids abom 24 to 468 of Figure 136 (SEQ ID NO-337) 
OpdonaDy. the PR0865 polypeptide is obtained or is obuunable by expressing tite polypeptide encoded by tite cDNA 
insert of tite DNA53974-1401 vector deposited on April 14. 1998 as ATCC 209774. 

In ano^er embodiment, the invention provides an expressed sequence tag (EST) comprising tite nucleotide 
sequence of Figure 137 (SEQ ID NO:338). herein designated as DNA37642. 
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^« have idenuficd a cDNA clone *a. encodes a novel polypepude havng homology .o imegrin 
proteu«. wherein *e polypepude is designated m the present application as "PR0827 " 

in one en^ditnent. the invention provtdes an isolated nucleic acid .olccule co.pr.s.ng DNA encodtng a 
PR0827 polypeptide. In one aspect, ^e isolated nucleic ac.d composes DNA encod^g .e PR0827 polypepude 
^vingalacidresidueslto 124 of F.gurel39(SEQ ID NO:346).o..s complementary to suche.^^ 

Xt^nce. and remai. sta.W .und to under at least moderate, and opt.ona.ly. under .gh strtngency 
oli ns. in o.er aspec.. dte isolated nucleic acid comprises DNA encoding .e PR0827 po.ypept.de havutg 
3^acid residues about23tol24ofPigure 139 (SEQ ID NO:346).ortscom,emcn.ry to such«^ 

Tse^uence. and rem^ su.y .und to n under at leas, moderate, and optionally. ^^^^^^^ 
::lls. isolated nucleic acid sequence n.y comprise .e cDNA .ert of ^ DNA5 ^9.^2 vector 
deposited on April 14. 1998 as ATCC 209777 which includes the nucleotide sequence encodmg PR0827. 

U> another embodiment, the invention provides isolated PR0827 polypepdde. In partkuUr. the tnvemton 
provides isolated nadve sequence PR0827 polypeptide, which in one embodiment, includes an amino ac.d sequence 
o„.ris^«siduesUol24orFigurel39(SEQlDNO:346>.Anadditio,.lembodimemo^^^^^^^^^ 
. dtrected to a PR0827 polypeptide compns.ng amino acids about 23 to 124 of F.gure 1 39 (SEQ D N0.346, 
optionally, the PR0827 polypepdde is obtained or is obtainable by expressmg dte polypepude encoded by the cDNA 
imert of the DNA57039-1402 vector deposited on AprU 14. 1998 as ATCC 209777. 
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52 PR01114 

Applicants have identified a cDNA clone that encodes a novel polypeptide having homology to cytokine 
receptor family-4 proteins, wherein the polypeptide is designated in the presem application as -PROIIW. 

one embodimem. the invemion provides an isolated nucletc acid molecule comprising DNA encodmg a 
PR01114 pol^Tcpdde. In one aspect, the .solated m.deic acid comprises DNA encoding the PROUU polypepdde 
having amino acid residues I to 3. 1 of Figure .42 (SEQ ID NO:352). or is complementary to such encoding nucte.c 
acid sequence, and remains subly bound to it under a. least moderate, and optionally, under high stringency 
conditions. In oUter aspects, the isoUted nucleic acid comprises DNA encoding the PR0.114 polypeptide having 
amino acid residues about 30 to 31 1 of Figure 142 (SEQ ID NO:352) or amino acid 1 or about 30 to X of F.gure 142 
(SEQ ID NO:352). where X is any amino acid from 225 to 234 of Figure 142 (SEQ ID NO:352). or .s 
complementary to such encoding nucleic acid sequence, and remains stably bound to it under a. teast moderate, and 
opnonally. under high stringency condiuons. The isolated nucleic acid sequence may comprise the cDNA msen of 
dte DNA57033-.403 vector deposited on May 27. 1998 as ATCC 209905 which includes the nucleodde sequence 
encoding PROl 114. 

in another embodiment, the im^endon provides isolated PROI 1 14 polypeptide. In particular, the mvenuon 
pn^vides isolated native sequence PROI 1 14 polypeptide, which in one embodiment, includes an amino acid sequence 
comprising tesidues I to 311 of Figure 142 (SEQ ID NO:352). Addidonal embodiments of the presem invention are 
directed to PROI 1.4 polypeptides comprising amino acids about 30 to 31 1 of Figure 142 (SEQ ID NO:352) or amino 
acid 1 or about 30 to X of Figure 142 (SEQ ID NO:352). where X is any amino acid from 225 to 234 of Ftgure 142 
(SEQ ID NO-352) Optional^, the PROI 114 polypeptide is obtained or is obtainable by expressing die polypepude 
encoded by the cDNA insert of the DNA57033-1403 vector deposited on May 27. 1998 as ATCC 209905. 
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DMA.*.!! ^ """^"^^ as 

DNA48466 compnsing the nucleotide sequence of Figure 143 (SEQ ID NO-353) 

A CDNA cto^ (DNA57033-.403) has been identified U«. encodes a nove. interferon receptor polypeptide 
designated in the present application as "PR01114 interferon receptor". 

5 P«n, . '""^^ " "^'^^"'^ -"P"*^ ^^'A encodtng a 
:> FROl 1 14 interferon receptor polypeptide. 

In one aspect, the isolated nucleic acid con^rises DNA having a. least about 80% sequence identity 
preferably at .east about 85% sequence identity. n.o.. preferably at leas, about 90% sequence identity, 
preferably a. least about 95% sequence identity to (a) a DNA molecule encoding a PR0U14 interferon receptor 

m J^Tnl rnT* °' ' °^ ^ -'""ve of Figure 

10 142 (SEQ ID NO:352). or (b) the complement of the DNA molecule of (a). 

In another aspect, U« invention cotKems an isolated nucleic acid molecule encoding a PROl 114 interferon 
^or polypeptide comprising DNA hybridizing to d,e complement of the nucleic acid between about nuc.eot.des 
250or about 337 and about 1182. inclusive, of figure 141 (SEQ ID NO:35.). Preferably, hybridization occurs under 
Stringent hybridization and wash condiiions. 

3^,8or'^T''^""^~"'""'°''^""''"*''™'"^^ 

about 80% sequence .denoty. preferably a. least about 85% sequence identity, more preferably a. leas, about 90% 
sequel, identity, most preferably at least about 95% sequence identity to (a) a DNA molecule e«:odi„g the same 
n-ature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 209905 (DNA57033.1403) or (b) 
^ c^nplement of d,e nucleic acid molecule of (a), b, a preferred embodiment, the m«,eic acid comprises a DNA 

'^oZm '"^ '""^ ""^^ 

In still a further aspect, the invemion concent an isolated nucleic acid molecule comprising (a) DNA 
cncodn, a polypeptide having at teas, about 80% sequence identity, preferably at least about 85% sequence identity 
more preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 
sequence of amino ackl residues 1 or about 30 to about 311. inclusive of Figure 142 (SEQ ID NO:352) or'(b) tite 
complement of the DNA of (a). 

In a ftmher aspect, the invention concerns an isolated nucleic acid molecule having a. least 10 mtdeotides 
^ produced by hybridizing a test DNA molecule u«ler stringem conditions with (a) a DNA molecule e«:odi«g a 
PR01114 mterferon receptor polypeptide having the sequence of amino acid residues from 1 or about 30 to about 
311. mchtsive Of figure 142 (SEQ ID NO:352). or (b) the complement of the DNA molecule of (a), and, if the DNA 
motecule has a. least about an 80 % sequence identity, prefereably a. least about an 85% sequence identity, more 
P«fer.bly at least about a 90% sequence identic, most preferably at leas, about a 95% sequence Identity to (a) or 
(0). isolating the test DNA molecule. 

pon. . " ""^'^'^ """'"^^ ^<"»P^«i^ DNA encodi,^ a 

PROnM imerferon receptor polypeptide. «ith or without the N-terminal signal sequence and/or the initiaung 
and ns sohible. i.e.. transmembrane domain deleted or inactivated variams. or is complememary to such 
c.«Hlmg nucleic acid molecule. 1^ signal peptide has been tenutively Identified as exte^l^g from about amino 
ac.dpo,mo„,«,abo«aminoacidposinon29inthesequenceofFigurel42(SEQlDNO:^^^^^ 
<iomam has been tentative^ identified as extending from about amim, acid position 230 to about amim, acid posiuon 
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255 in eie PROl 1 14 interieron receptor amino acid sequence (Figure 142. SEQ ID N0.352) 

anoa«r aspect. Oie mvention concerns an isolated nudeu: acid molecule compr«mg (a) DNA encoding 
a polypepdde scoring at leas, about 80% positives, preferably at leas, about 85% positives, more preferably a. leas, 
about 90% posidves. most preferably a. leas, about 95% positives when compared wtth the amino acid sequence of 
residues I or about 30 to about 311. inclusive of Figure 142 {SEQ ID NO:352). or (b) the complement of the DNA 

Another embodimen. is directed to fragments of a PROl 1 14 interferon receptor polypept.de coding sequence 
^ may find use as hybridizadon probes. Such nucleic acid fragments are from about 20 to abou. ^ nucleoti<., 
in lengl. preferably from about 20 to about 60 nuCeoades in length, more preferably from abou. 20 «. about 50 
nucleotides in length and most preferably from abou. 20 to abou. 40 nucleotides in length a«l may be denved from 
10 the nucleoude sequence shown in Figure 141 (SEQ ID NO:351). 

in another embodiment, the invention provides a vector comprising DNA encoding PROl 1 14 mterferon 
receptor or its variants. ^ vector may compnse any of the isoUited nucleic acid molecules hereinabove idendfied. 

A host ceU comprising such a vector b also provided. By way of example, the host cells may be CHO cells. 
E. coli or yeas.. A process for producmg PROllM imerferon recepror polypep.ides is furUter provided and 
,5 comprisesculturinghos.cellsundercondinonssuiublef^^ 

PROl 1 14 inttrfcron reccpror from die cell culture. 

,na««her embodiment, die invemton provides isolated PR01114 interferon receptor polypepade encoded 
by any of the isolated nucleic acid sequences hereinabove identified. 

m a specific aspect, the invention provides isolated narive sequence PR01114 m«rf.ro« recep^ 
20 po.ypep.ide. which in certain embodiments, includes an amino acid sequence comprising residues . or about 30 «, 
aboul 31 1 ot Fiont 142 (SEQ ID NO J52). 

to ~«.n„ » .»!•«<"■'«>"" '^'^ 
„^..«^ta*,»l««^»«>«^'*»^-'«'=*"'^"^"*"''^^ 

to.«-b»«P».*c.n«»«c«««»»^.»=..PR01114to„..«™,.»P»r,.^ 

„ 1^, ^ 90* P-ef-'.'y ^' ""P-" "* ^ 

.^,„<*,^i«»sl»*«»»>b«»3l'»'-™<''f*'"»^''^""'°^'"'- 
30 »«1«, «P«.. 

P„„.„b„. *. PROl 1 14 ■ ""^ of . »«. I^OIU. 

interferon receptor polypeptide. ^ ^ • . hna 

• 35 .„ a still fiirther aspect, the invenuon provides a polypepdde produced by (0 hybndumg a test DNA 

molec^e ^ stringer cor^lUor- .i* (a) a DNA n»lecule e^od^g a PROn 14 interferon ^ 

^^vingd^sec^nceofaminoacid residues from aboutlor about 30 toaboutSH.^ 

N0T52). or (b>dKComp^ofd« DNA molec.e of (a),and if die test DNA molecule has a. least ab«^ 

s«n«nceidendty.prefe,ablya.,eas.abou.an85%sequenceidenU.y.more preferably a. l«^^ 



79 



e^S0OCI0• <W0_9946W1A2J.> 



]0 



15 



WO 99/46281 

PCT/US99/05028 

In ancher cmbod.n,em. .nvenuon provides chin«„c molecules compr«u.g a PROM 14 interferon 
recep.orpo.pep..e.se..oa..^^^^^ 

n.olecu,c compnses a PROl 1 .4 interferon receptor polypeptide fused to an epitope ug seZnce or a Fc r 
an immunoglobulin. sequence or a Fc region of 

In another embodiment, the mvennon provides an antibody which specincally binds to a PROIl 14 in.erfc«.n 
receptor polypepUde. Optionally, the antibody is a monoclonal antibody. 

tayctanotherembodiment, the invemion concerns agomsts and antagonists of a native PR01114 inierferon 
««r«»P0lypepude. .naparticu^rem.^^^^^^ 

candtdate molecule and monitormg a biological activity mediated by said polypeptide. 

In a still fur^er embodmient. .he invention concerns a composition comprismg a PR01114 interferon 
.^^.^.de.oranago.stor.^^^^^^ 
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IZ II!T' " °" '""P"- ■'NA ft, P,o:37 polvpeX 

..po..^ . „ „ce «^ Wu*. ^ 

'»»«te™b«lta.„,ft.i„.,top^itoi^,^pR0237pol„cpM.. I.p.nicab,.ft.|„™i™ 
.--»»---„«^™02,,p,,^p,^.«^,,..^^„,^,^^^-^^^ 

» FRTOT p.*^ „^ .cii. .b». 24 „ 328 ,f ,« eEQ ,D N0:3«, „ ^ 

.-o^..««.DNA^„o,ft.DNA34353.,42..««*p„,^„„,„2.,^„,^^f'°* 
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54 PR0541 

Applicants have identified a cDNA clone that encodes a novel polypeptide having homology to a trypsin 
irtdbitor protein, wherein the polypeptide is designated m the present application as "PR0541" 

In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a 
PR0541 polypeptide. In one aspect, .he isolated nucleic acid comprises DNA encoding the PR0541 polypeptide 
having amino acid residues 1 to 500 of Figure 147 (SEQ ID NO:363). or is complementary to such encoding nucleic 
acid sequence, and remains subly bound to it under at least moderate, and optionally, under high stringency 
conditions. In other aspects, the isolated nucleic acid comprises DNA encoding the PR0541 polypeptide having 
^ acid residues about 21 to 500 of Figure 147 (SEQ ID NO:363). or is complementary to such encoding nucleic 
acid sequence and remains stably bound to it under at least moderate, and optionally, under high stringency 
conditions. The isolated nucleic acd sequence may comprise the cDNA insert of the DNA454,7-.432 vccu.r 
deposited on Mav 27. 1998 as ATCC 209910 which includes the nucleotide sequence encodmg PR0541. 

in another embodiment, the invention provides isolated PR0541 polypeptide. In particular, the invention 
provides isolated native sequence PR0541 polypeptide, which m one embodiment, includes an amino acid sequence 
compr^ing residues 1 to 500 of Figure 147 (SEQ ID NO:363). Additional embodtments of the present invention are 
directed to PR0541 polypeptides comprising amino acids about 21 to 500 of Figure 147 (SEQ ID NO:363). 
Optionally. Ok PR0541 polypepdde is obtained or is obtainable by expressing the polypeptide encoded by the cDNA 
insert of the DNA45417-1432 vector deposited on May 27. 1998 as ATCC 209910. 
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55. EBQ2Q , . . . 

Applicants have identified a cDNA clone that encodes a novel polypeptide, wherein the polypepnde ts 

designated in the present application as "PR0273" 

m one embodimem. the invention provides an isolated nucleic acid molecule comprumg DNA encodmg a 
PR0273 polypeptide. In one aspect. Uk isolated nucleic acid comprises DNA encoding the PR0273 polypepode 
having ammo acid residues 1 through 111 of Figure 149 ,SEQ ID NO:370), or is complementary to such encodmg 
ti^clcic acid sequence, and remains stably bound to it under at least moderate, and optionally. u«ler high stringency 

comUtions. . . ^ 

in another embodiment, the invention provides isolated PR0273 polypeptide. In panicular. the invenuon 
provides isolated native sequence PR0273 polypeptide, which m one embodimem. includes an amino acid sequence 
comprising residues 1 through 111 of Figure 149 (SEQ ID NO:370). 

56. PWO701 

AppUcants have identified a cDNA clone that encodes a novel polypeptide having homology to neurohgms 
1 2. and 3. wherein the polypeptide is designated in the present application as -PRO70r. 

m one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encodmg a 
PRO701 polypeptide. In one aspect, the isolated nucleic acid comprises DNA encoding the PRO701 polypeptide 
having amino acid residues 1 through 816 of Figure 151 (SEQ ID NO:375), or Is complementary to such encodmg 
nucWc acid sequence, and remains stably bound to it under at least moderate, and optionally, under high stnngency 
co«iitions. isolated r^^ac acid seque«» may comprise d« cDNA insert of die vector deposited with the ATCC 
on March 31. 1998 which includes the nucleotide sequence encoding PRO701. 
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In anod.er cmbodimem. U,c invenuon provides .solattd PR070I polyp " ,„ particular. *e mvenuon 
provides isola^d nadve sequence PR0701 polypeptide, which in one embodimem. includes an amino acid sequence 
con,pnsing residues 1 duough 816 of Figure 151 (SEQ ID NO:375). An addiuonal en.bodin,em of U.e presem 
•nvenuon is directed to an isolated extracellular don^n of a PR0701 polypepude. Optionally, the PR0701 
polypepude is obtained or is obtainable by expressing the polypeptide encoded by the cDNA insen of ^ vector 
deposited with the ATCC on March 31, 1998. 

57. PRQ704 

Applicants have idemified a cDNA Cone that encodes a novel poiypep..<le having sequence idennty with 
V1P36. wherem the polypeptide is designated in the present application as "PRO704" 

In one embodiment, the invention provides an isoUted nucleic acid nK,lecule comprising DNA encoding a 
PRO704 polypepdde. In one aspect, the isolated nucleic acid comprises DNA encoding d,e PRO704 polypeptide 
havtng amino acid residues 1 through 348 of Figure 153 (SEQ ID NO:380,. or „ complementary to such encoding 
nuc^^ acid sequence, and remains stably bound to it under at least moderate, and optionally, under high stringency 
con^nons. iso.ted nucleic acid sequence may comprise the cDNA insen of the vector deposited on March 3 1 
1998 with the ATCC as DNA5091 1-1288. which includes the nucleotide sequence encoding PRO704 

in another embodiment, the invention provides isolated PRO704 polypeptide. In particular, the invention 
provides isolated nadve sequence PRO704 polypeptide, which in one embodiment, includes an amino acid sequence 
comprising residues I through 348 of Figure .53 (SEQ ,D NO:380). An additional embodiment of the present 
.nvennon is directed to an isolated extracellular domain of a PRO704 polypeptide. Optionally, the PRO704 
20 polypeptide is obuined or is obuinable by expressing the polypeptide encoded by the cDNA msert of the vector 
deposited on March 31. 1998 widj the ATCC as DNA5091 1-1288. 

58. iBsm 

Applicants have idendfied a cDNA clone dtat encodes a novel polypeptide havtng homology to prostatic acd 
.5 phosphatase precursor ^ lysosomal acid phosphatase ptecunor. wherein *e polypeptide is designated in the presem 
application as "FRO706". 

in one embodimem. the invention provides an isolated nucleic acid molecule comprising DNA encoding a 
PRO706 polypepUde. In one aspect, the isolated nucleic acid comprises DNA encoding the PRO706 polypeptide 
bavu^ amino acid residues 1 through 480 of Figure 155 (SEQ ID NO:385). or is complemcnuuy to such encodmg 
nt^letc acKi sequence, and remains stably bound to it under at least moderate, and optionally, under high stringency 
condtnons. TTk isolated ntKlcic acid sequence may comprise *e cDNA insen of the vector deposited on AprU 21 
1998 with die ATCC as DNA48329-I290 which includes dte nucleotide sequence encoding PRO706. 

In another embodimem. the invention provides isolated PRO706 polypeptide. In panicular. the invenuon 
P«,v«tes isolated native sequence PRO706 polypeptide, which in one embodiment, includes an amino acid sequence 
comprising residues 1 thnntgh 480 of Figu« 155 (SEQ ID NO:385). or comprismg residues 19 through 480 of Figure 
155 (SEQ ID NO:385). Optionally. d,e PRO706 polypeptide is obtained or is obtainable by expressing d,e 
polypeptide encoded by die cDNA insen of the vector deposited on April 21. 1998 with the ATCC as DNA48329- 
1290, 
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59. 

Applicaras have identified a cDNA clone that encodes a novel polypeptide having homology to cadherms. 
particularly cadherin FIB3. wherein the polypeptide is designated in the present application as -PRO707". 

In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encodmg a 
PRO707 polypeptide. In one aspect, the isolated nucleic acid comprises DNA encoding the PRO707 polypeptide 
having amino acid residues 1 to 916 of Figure 157 (SEQ ID NO:390). or is complementary to such encodmg nucleic 
acid sequence, and remains stably bound to it under a. least moderate, and opuonally. under high stringency 
conditions. The isolated nucleic acid sequence may comprise the cDNA insert of the vector deposited on May 27. 
1998 with the ATCC as DNA48306-1291 which includes the nucleotide sequence encoding PRO707. 

In another embodiment, the invendon provides isolated PRO707 polypeptide. In particular, the invemion 
provides isola«d native sequence PRO707 polypeptide, which in one embodiment, includes an amino acid sequence 
comprising residues 1 to 916 of Figure 157 (SEQ ID NO:390). An additional embodiment of the presem invemion 
is directed to an isolated extracellular domain of a PRO707 polypeptide. Optionally, the PRO707 polypeptide is 
obtained or is obuunable by expressing ti,e polypeptide encoded by the cDNA insert of the vector deposited on May 
27. 1998 wilh the ATCC as DNA48306-1291. 
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60. EBfi22Z 

AppBcaas have identified a cDNA clone that encodes a novel polypeptide having homology to neuropsm. 
wherein the polypeptide is designated in the presem application as -PR0322". 

In one embodimem. the invention provides an isolated nucleic acid molecule comprising DNA encoding a 
PR0322 polypeptide. In one aspect, the isolated nucleic acid comprises DNA encoding the PR0322 polypeptide 
having amino acid residues 1 or 24 through 260 of Figure 159 (SEQ ID N0:395). or is complememary to such 
encoding nucleic acid sequence, and remains stably bound to it umier at least moderate, and optionally, under togh 
stringency conditions. isolated mtcleic acid sequence may comprise the cDNA msert of the vector deposi««i on 
March 11 1998 as ATCC no. 209669 which includes the raicleotlde sequence encoding PR0322. 

in another embodiment, the invemion provides isolated PR0322 polypeptide. In particular, ti« invemion 
pnmdes isolated native sequence PR0322 polypeptide, which in one embodiment, includes an amino acid sequence 
comprising residues 1 or 24 through 260 of Rgure 159 (SEQ ID N0:395). An additional embodimem of the presem 
invemion is directed to an isobted extracellular domain of a PR0322 polypeptide. Optionally, the PR0322 
polypeptide U obtained or is obtainable by expressing the polypeptide encoded by tite cDNA insert of the vector 
30 deposited on Mardt 11. 1998 as ATCC no. 209669. 



25 
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61. miS2i . 

Applicams have identified a cDNA clone that encodes a novel polypeptide having sequence identity with 

ALS. wherein Uk polypeptide is dcsigmited in the presem application as "PR0526". 

in one embodimem. ti« invention provides an isolated nucleic acid molecule comprising DNA encoding a 
PR0526 polypeptide. In one aspect, the isolated nucleic acid comprises DNA encoding the PR0526 polypeptide 
having amino acid residues I to 473 of Figure 161 (SEQ ID NO:400). or is complememaiy to such encoding nucleic 
acid sequence, and remains suibly bound m it mider a, least moderate, and optionally, under high strmgency 
conditions. isolated nucleic acid sequence may comprise the cDNA insert of the vector deposited on March 26. 
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1998 witf, the ATCC as DNA44I84-1319 which includes the nucleoUde sequenc^odu-g PR0526. 

In another embodiment, die invention provides isolated PR0526 polypeptide. In panicular. dte invention 
provides isolated native sequence PR0526 polypeptide, which in ot„ embodiment, includes an ami™, acid sequence 
comprising residues 1 to 473 of Figure 161 (SEQ ID NO:400). Optionally, the PR0526 polypeptide is obtained or 
.s obtamable by expressing the polypeptide encoded by the cDNA insen of the vector deposited on March 26 1998 
with the ATCC as DNA44184-I319 which includes the ra^leotlde sequence encodmg PR0526. 

62. tBQSSl 

Applicants have identified a cDNA clone that encodes a novel polypeptide havtng sequence identity with 
protocadherins. wherein the polypeptide is designated in the present application as "PR053l». 

In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a 
PR0531 polypeptide. In one aspect, the .solated nucleic acid comprises DNA encoding die PR053I polypepUde 
havtng ami™, acid tesidues 1 to 789 of Ftgure 163 (SEQ ID NO:405). or is complemenuty to such encoding nucleic 
actd sequence, and remains subly bound to it under a. leas, moderate, and optionally, under high stringency 
condtnons. The isola«=d nucleic acid sequence may comprise the cDNA insen of dte vector deposited on March 26 
1998 as DNA48314-1320 which includes the nucleotide sequence encoding PR0531. 

In anoUter embodiment, the invention provides isolated PR053I polypeptide. In particular, die invention 
provides isola^d native sequence PR053 1 polypeptide, which in one embodiment, includes an amino acid sequence 
comprising residues 1 to 789 of Figure .63 (SEQ ID NO:405). An additiom,l embodimem of the present invention 
u dtreced to an isolated extracellular domain of a PR0531 polypeptide. Optionally, .he PR0531 polypeptide is 
obtamed or is obtainable by expressing *e polypepdde encoded by the cDNA insert of d« vector deposited on March 
26. 1998 as DNA48314-1320. 



63. PROS34 

Applicant, have identified a cDNA clone that encodes a novel polypeptide having sequence identity with 
disulfide tsomerase (sometimes referred to herem as protein disulfide isomerase). wherein die polypeptide is 
designated in the present application as ''PR0534". 

In one embodiment, die invemion provides an isolated nucleic acid molecule comprising DNA encoding a 
PR0534 polypeptide. In one aspect. d« isolated nucleic acid comprises DNA encoding ti«: PR0534 polypeptide 
havmg amino acid residues 1 to 360 of Figure 165 (SEQ ID NO:410). or is complememary to such encoding nucleic 
30 acd sequence, and remains stably bound to it under at least moderate, and optionally, under high stringency 
condiuons. TTie isolated micleic acid sequence may comprise the cDNA insen of the vector deposited on March 26. 
1998 as DNA48333-I321 which includes the nucleotide sequence encoding PR0534. 

In another embodimem. the invention provides isolated PR0534 polypeptide. In panicular. die invemion 
provmes isolated native sequence PR0534 polypeptide, which in one embodiment, includes an amino acid sequence 
35 comprising «sidues 1 to 360 of Figure ,65 (SEQ ID NO:410). An additional embodiment of die presem invention 
ts directed to an isolated extn«ellular domain of a PR0534 polypeptide. Optionally, die PR0534 polypeptide is 
Obtained or a obtainable by expressing d« polypeptide encoded by the cDN A imen of the vector deposited on March 
26. 1998 as DNA48333-1321. 
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64. PRO<i»7 ^ . 

Applicants have idemif.ed a cDNA clo« .hat encodes a novel polypeptide having sequence ,den„ty w„h 

sFRPs wherein the polypeptide is designated in the present application as "PR0697-. 

' in one embodiment, the invention provides an isolated nucleic ac.d molecule comprismg DNA encoding a 
PR0697 polypepiide. In one aspect, the isolated nucleic acid comprises DNA encoding the PR06^ polypeptide 
having amino acid residues 1 d^ough 295 of Figure 167 (SEQ ID NO:415). or is complemenury to such encoding 
nucleic acid sequence, and remains subly bound to it under a. least moderate, and optionally, under high stringency 
c^Klidons. 11. isolated nucleic acid sequence may comprise die cDN A insert of me vector deposited with the ATCC 
on March 26. 1998 as DNA50920-1325 «hich includes the nucleotide sequence encoding PR0697. 

to another embodiment, die invention provides isolated PR0697 polypeptide. In parocular. the invention 

provides isolated native sequence PR0697 polypeptide, which in one embodiment, includes an amino acid sequence 

comprising residues 1 through 295 of Figure 167 (SEQ ID NO:415). Optionally, the PR0697 polypeptide is ob^nc. 

0, is Obtainable by expressing die polypeptide encoded by the cDNA insert of the vector deposited with the ATCC 

on March 26. 1998 as DNA50920-1325. 



Applicants have «len.ified a cDNA clone that encodes a novel 12 transmembrane polypeptide, whcrem dK 
polypeptide is designated in die present application as ••PR0717- 

inoneembodiment. d« invention provides an isolated nucleic acid molecule comprising DNA encodmg a 
PR0717 polypeptide. In one aspect, die isolated nucleic acid comprises DNA encoding die PR0717 polypeptide 
.0 having amino acid residues I through 5«, of Figure .69 (SEQ ID NO:420). or is complementary to «Kh encodmg 
' „acleicacidsequence.andremainssub.yboundtoitunderat,eastmoderate.andop.ion^^^ 

condidons. The isolated nucleic acid sequence may comprise die cDNA insert of die vector deposited on Apr.. 28. 
,998 wid, die ATCC as DNA50988-1326 which includes die nucleotide sequence encodmg PR0717. 

toanodierembodimem. die invention provides isolated PR0717 polypeptide, to particular, d.e invention 
2S p„videsiso.a.edna.vesequencePR07.7polypepdde.whichinoneembodiment.^^^^^^ 

comprising residues 1 dirough 560 of Figure 169 (SEQ ID NO:420). An additional embodmient of die pre^n 
invention is directed to an isolated extracellular domain of a PR07.7 polypeptide. Optionally, die PR07n 
polypepdde is obtained or is obuiinable by expressing die polypeptide encoded by die cDNA i«en of ^ vector 
deposited on April 28. 1998 widi d« ATCC as DNA50988-1326. 



30 
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Applicams have idenufed a cDNA clone dui. encodes a novel polypeptide having sequence identity widi 
protocadherin 4. wherein die polypeptide te designated in Uie present application as -PR0731 " 

to one embodiment, die invention provides an isolated nucleic acid molecule comprising DNA encodmg a 
PR073, poiypeptide. In one aspect, die isolated ™.c.eic acid comprises DNA encoding die PR0731 polyp^ndc 
havtogaminoacid residues 1 duough 1.84 of Figure 171 (SEQ ID NO:425). or is complementary to such encodmg 
^Ceic add sequence, and remains subly bound to it under a. least moderate, and optionally. u«ier high «n„gency 
conditions. ^ isolaied nucleic acid sequence may comprise die cDNA inser. of dK vector deposited on March 3. . 
1998 wid, dK ATCC as DNA48331-1329 which includes dK nucleotide sequence encoding PR073.. 
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In another cmbodinKni. iht invention provides isolated PR073I polypeptide. In panicular. Oie invention 
provides isolated nadve sequence PR0731 polypeptide, which in one embodiment. uK:ludes an amino acid sequence 
comprising residues 1 through 11 84 of Figure 17, ,SEQ ID NO:425). An additional embodiment of the present 
mvention is directed to an isolated extracellular domain of a PR0731 polypeptide Optionally, the PR0731 
polypeptide is obtained or is obtainable by expressing the polypeptide encoded by the cDNA insen of the vector 
5 deposited on March 31 , 1998 with the ATCC as DNA48331-1329. 

«7. mm 

AppUcants have identified a cDNA Cone .hat encodes a novel multi-transmembrane protein having sequence 

10 ^'^ZT ' '"P''""^- " 

In one embodimem. the invention provides an isolated nucleic acid molecule comprising DNA encoding a 
PR0218 polypeptide. In one aspect, the isolated nucleic acid comprises DNA encoding the PR0218 polypeptide 
havuig amino acid residues I through 455 of Figure 173 (SEQ ID NO:430). or is complementary to such encoding 
nucleic acid sequence, and remains stably bound to it under a. least moderate, and optionally, under high stringency 
15 conditions. -Die isolated nucleic acid sequence may comprise the cDNA insen of the vector deposited on April 28 
1998 with the ATCC as DNA30867-1335 which includes the nucleotide sequence encodmg PR0218. 

In anoOier embodiment, the invendon provides isolated PR0218 polypeptide. In panicular. .he invendon 
provides isolated nadve sequence PR0218 polypeptide, which in one embodimem. includes an amino acid sequence 
comprising residues 1 through 455 of Figure 173 (SEQ ID NO:430). Optionally, .he PR0218 polypeptide is obtained 
or « obtamable by expressing d« polypeptide encoded by Ae cDNA insen of Uk vector deposited on April 28 1 998 
with the ATCC as DNA30867-1335. 

In another embodimem. the invemion provides an expressed sequence ug (EST) sequence comprising the 
raicleoade sequence of Figure 174 (SEQ ID NO:43I}. designated herem as DNA14472. 

In another embodimem. die invendon provides an expressed sequence ug (EST» sequence comprising the 
raicleonde sequence of Figure 175 (SEQ ID NO:432). designated herein as DNA15846. 

68. tEom 

Applicants have identified a cDNA clone that encodes a novel polypeptide having sequence identity with 
mtegnns. whercm die polypepdde is designated in the present application as "PR0768". 

In one embodimem. the invemion provides an isolated nucleic acid molecule comprising DNA encoding a 
PR0768 polypeptide. In one aspect, the isolated nucleic acid comprises DNA encoding *e PR0768 polypeptide 
havmg amino acid «sidues I through 1 141 of Figure 177 (SEQ ID NO:437). or is complementary to such encoding 
nucleic acid sequence, and remain stably bound to it umler at least moderate, and opuonaUy. under high stringency 
conditions. Tbc isolated mideic acid sequence may comprise die cDNA insen of die vector deposited on April 6 
35 1998 as DNA55737-1345 which inchides die raicleotide sequence encoding PR0768. 

In anodier embodimem. the invention provides isolated PR0768 polypeptide. In particular, d« invention 
provides isolated nadve sequem:e PR0768 polypeptide, which in one embodimem. includes an amim, acid sequence 
compnsing residues 1 d„ough 1 141 of Figure 177 (SEQ ID N0:437). An additional embodiment of the presem 
invention is directed to an isolated extracellular domain of a PR0768 polypeptide. Optionally, die PR0768 
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polypepudc is obuined or is obuinable by expressing U.e polypeptide encoded by d,e cDNA uaert of vector 
deposited on April 6. 1998 as DNA55737-1345. 

69. PR0771 

Applicants have identified a cDNA clone that encodes a novel polypeptide having sequence identity with 
tcstican. wherein the polypeptide is designated in the present application as "PR077r 

In one embodiment, die invention provides an isolated nucleic acid molecule comprising DNA encoding a 
PR0771 polypepUde. In one aspect, the isolated nucleic acid comprises DNA encoding the PR0771 polypeptide 
having anuno acid residues 1 through 436 of Figure 179 (SEQ ID NO:442). or is complementary to such encoding 
nucleic acid sequence, and remains stably bound to i. under a. least moderate, and optionally, under high stringency 
conditions. Hie isolated nucleic acd sequence may comprise die cDNA insert of die vector deposited on AprU 7. 
1998 with die ATCC as DNA49829-1346 which includes die nucleotide sequence encoding PR0771. 

in anodier embodiment, die invention provides isolated PR0771 polypeptide. In particular, the invention 
provides isolated nadve sequence PR077 1 polypeptide, which in one embodiment, includes an amino acid sequence 
comprising residues 1 dirough436 of Figure 179 (SEQ ID NO:442). Optionally, die PR0771 polypeptide is obuined 
or is obtainable by expressing die polypeptide encoded by die cDNA insert of die vector deposited on April 7. 1998 
wiUt the ATCC as DNA49829-1346. 

70. PRQ733 

Applicants have identified a cDNA clone diat encodes a novel polypeptide having sequence identity with 
Tlisn receptor binding protein, wherein die polypeptide is designated in die present application as -PR0733-. 
in one embodiment, die invention provides an isolated nucleic acid molecule comprising DNA encoding a 
PR0733 polypeptide. In one aspect, die isolated nucleic acid comprises DNA encoding the PR0733 polypeptide 
having amino acid residues 1 dirough 229 of Figure 181 (SEQ ID NO:447). or is complementary to such encodu« 
rwcleic add sequence, and remains stably bound to it under at least moderate, and optionally, under high smngency 
conditions. The isolated micleic acid sequence may comprise die cDNA insert of dK vector deposited on Apnl 7. 
1998 witii die ATCC as DNA52196-1348 which includes die nucleotide sequence encoding PR0733. 

b anodier embodiment, die invention provides isolated PR0733 polypeptide, ta panicular. die invemion 
provides isolated native sequence PR0733 polypeptide, which in one embodimem. includes an amino acid sequence 
comprising residues 1 dirough 229 of Figure 181 (SEQ ID NO:447). An additional embodiment of die presem 
invention is directed to an isolated extracellular domain of a PR0733 polypeptide. Optionally, die PR0733 
polypeptide is obtained or is obtainable by expressing die polypeptide encoded by d« cDNA inscr. of the vector 
deposited on April 7, 1998 as DNA52I96-1348. 
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71. £Bfil£Z 

Applicants have identified a cDNA clone duit encodes a novel polypeptide having sequence idenuty wid, 
pancreatitis-associated protein, wherein the polypeptide is designated in die present appUcation as -PR0162" 

In ot. embodimem, die invention provides an isolated nucleic acid molecule comprising DNA encoding a 
PR0162 polypeptide. In one aspect, die isolated nucleic acid comprises DNA encoding die PR0162 polypeptide 
having amino acid residues 1 duough 175 of Figure 183 (SEQ ID N0:4S2). or is complementary to such encoding 
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nudcc acid sequence, and renmns stably bound .0 i. u^ler a, least moderate, and op.io„ally. under h,gh s.r.n«.y 

~ J'lT'' """" ^^^^ or .e vector deposaed on May 6. 

1998 w.d. Che ATCC as DNA56965-1356 which includes U,e nucleo..de sequence encoding PR016. 

In another embodiment, the invention provides isolated PR0162 polypepnde. In particular, the invention 
provK.es .soUted nanve sequence PR0162 polypeptide, which .n one ena,odin«nt. includes an anino acid sequence 
comprising residues 1 duough 175 of Figure 183 (SEQ ,D NO:452,. Optionally, dte PR0.62 polypeptide .s obta^ed 
or . obt^unable by expressing d,e polypeptide encoded by *e cDNA insen of the vector deposited on May 6 1998 
With ihe ATCC as DNA56965-1356. ' 



72. PRQ788 



Apphcants have identified a cDNA clone that encodes a novel polypeptide having sequence identity with 
anu-neoplastK urinary protein, wherein the polypeptide is destgnated in the present application as "PROVSS- 

PRO,«« r T""™' " ""'^'^ ^-"--S -««n8 a 

PR0788 polypepnde. In one aspect. d,e isolated nucleic acid comprises DNA encoding the PR0788 polypeptide 

havmg anuno ..d residues 1 trough 125 of Figure 185 (SEQ ID NO:454,. or is complementary to such encoding 
15 --^^--^e^-andrenuunsstablyboundtoitunder 

con^tuons. The isolated nucleic acid sequence may comprise U,e cDNA insert of the vector deposited on May 6 

1998 wnh the ATCC as DNA5«05-1357 which includes the nucleotide sequence encoding PR0788 

m another embodia«m. the invention provides isolated PR0788 polypeptide. In particular, the invention 

provdcs «o..ed native sequence PR0788 polypepUde. which in one embodintcnt. includes an amino acid sequence 
.0 compnsmg residues 1 duough 125 of Figure 185 (SEQ ID NO:454). An additional embodiment of dte present 

.nvendon is directed to an isolated extracellular domain of a PR0788 polypeptide. Optionally. U« PR0788 

polypepnde .s obuined or is obuunab.e by expressmg d,e polypeptide encoded by the cDNA insert of the vector 

deposited on May 6. 1998 witfi dje ATCC as DNA56405-1357 

25 73. mam 

Applicant, have idendfied a cDNA clone dtat encodes a novel polypeptide having sequence identity with 
d.ckkopf-1 (dkk-1). Wherein the polypeptide is designated in die present application as -PROIOOS" 

P»n,rv»'""""^™"' " "^"''^ ""'""'^ DNA encoding a 

PRO1008 polypeptide. one aspect, the isolated nucleic acid comprises DNA encoding U,e PRO1008 polypeptide 
havmg ammo acid residues 1 through 266 of Figure 187 (SEQ ID NO:456). or is comptemcntaty to such encoding 
nuce. acid sequence, and rctnains stably bound toU 

-n^ons. m isolated nucleic acid seque«:e may comprise the cDNA insert of the vector deposited on May 20 
1998 wid, d.e ATCC as DNA57530-I375 which includes the nucleotide seque«:e encoding PROI008 

to anod^r embodiment, die invention provides isolated PRO1008 polypeptide. In particular, the invention 
provides isolated nadve sequence PR01(X)8 polypeptide, which in one embodiment, includes an amino acid sequence 
compnsmg residues 1 dirough 266 of Figure 187 (SEQ ID NO:456). Optionally, the PRO.008 polypeptide is 
^ta^or is Obtainable by expressing die pol>,eptide encoded by die cDNA insert of the vector deposited on May 
20. 1998 widi die ATCC as DNA57530-1375. 
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In another embodiment, the invent.on provides an expressed sequence tag (EST) designated herein as 
DNA16508 comprising the nucleotide sequence of Figure 188 (SEQ ID NO:457). 

74. £RQlfil2 

Applicants have identified a cDNA clone that encodes a novel polypeptide having sequence idenitty with 
5 disulfide isomcrase and pbospholipase C. wherein the polypeptide is designated in the present application as 
•PRO1012". 

In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encodmg a 
PR01012 polypeptide. In one aspect, the isolated nucleic acid comprises DNA encoding the PRO1012 polypeptide 
having amino add residues 1 through 747 of Figure 190 (SEQ ID NO:459). or is complemenury to such encoding 
10 nucleic acid sequence, and remains subly bound .o « under a, least moderate, and optionally, under high strmgency 
conditions. The isolated nucleic acid sequence may comprise the cDN A insert of the vector deposited on May 14. 
1998 with the ATCC as DNA56439-1376. which includes the nucleotide sequence encoding PR01012. 

In another embodiment, the invenuon provides isolated PRO1012 polypeptide. In particular, the inveKkm 
provides isolated native sequence PRO1012 polypeptide, which in one embodiment, includes an amino acid sequence 
15 comprising residues 1 through 747 of Figure 190 (SEQ ID NO:459). Optionally. ti« PRO1012 polypeptide is 
obtained or is obuunable by expressmg ti« polypeptide encoded by the cDN A insert of ti« vec«,r deposited on May 
14. 1998 witfi tiie ATCC as DNA56439-1376. 

75. SSSUSIA . „ 

20 Applicants have idemif.ed a cDNA clone that encodes a novel polypeptide having sequence .dennty wtth 

reductase, wherein ihc polypeptide is designated in the present application as "PROIOM". 

In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a 
PRO1014 polypeptide. In one aspect, the isolated nucleic acid comprises DNA encoding ti»e PROIOM polypepadc 
having amino acid residues 1 U^ough 300 of Figure 192 (SEQ ID NO:464). or is complememary u, such encoding 
.5 n«:leic acid sequence, and remains stably bound to it under a. leas, moderate, and optionally, under high stringency 
conditions. Tlte isolated nticleic acid sequence may comprise tite cDNA insert of tite vector deposited on May 20. 
1998 as DNA56409-1377 with the ATCC which includes ti« mtcleotide sequence encodmg PRO1014. 

In another embodiment, the invemion provides isolated PRO1014 polypeptide. In particular, ti^e invemion 
provides isolated native sequence PROIOM polypeptide, which in one embodimem. includes an amino acid sequence 
30 comprising residues I ti^ough 300 of Figure 192 (SEQ ID NO:464). Optionally. Ok PRO1014 polypeptide is 
obtained or is obtainable by expressing the polypeptide encoded by dte cDNA insert of tite vector deposited on May 
20. 1998 as DNA56409-1377 witii the ATCC. 

76. mum 

35 Applicants have identified a cDNA clone that encodes a novel polypeptide having sequence identity w«h 

HNK-1 sulfoiransferase. wherein tite polypeptide is designated in the present application as "PROlOn-. 

to one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding a 
PRO1017 polypeptide. In one aspect, tite isolated nucleic acid comprises DNA encoding .he PRO1017 polypeptide 
having amino acid residues 1 tiuough 414 of Figure 194 (SEQ ID NO:466). or is complementary to such encoding 
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nucleic acid sequence, and remains subly bound .o 1. under a, least moderate, and op,io„ally. under high stringency 
condiuons. n,e isolated nucleic acid sequence may comprise the cDNA insert of d,e vector deposited on M.y 20 
1998 wiih Ae ATCC as DNA56112-1379 which includes U,e nucleotide sequence encoding PRO10I7. 

In another embodunent. die invendon provides isolated PRO10I7 polypeptide. In panicuiar. the invention 
provides isolated nadve sequence PRO1017 polypeptide, which m one embodimem. i«:ludes an anuno acid sequence 
comprising residues 1 dirough 414 of Figure 194 (SEQ ID NO:466). OpUonally. d« PRO1017 polypeptide is 
obtained or is obtainable by expressing ^ polypeptide encoded by the cDNA insert of Uie vector deposited on May 
20, 1998 with die ATCC as DNA561 12-1379. 

77. PR0474 

Applicants have idennfied a cDNA clone that encodes a novel polypeptide having sequence identity with 
dehydrogenase, wherein the polypeptide is designated in die present application as "PR0474-. 

In one embodimem. die invention provides an isolated nucleic acid molecule comprising DNA encoding a 
PR0474 polypeptide. In one aspect, die isolated nucleic acid comprises DNA encoding die PR0474 polypeptide 
havmg amino acid residues 1 dirough 270 of Figure 196 (SEQ ID NO:468,. or is complementary to such encoding 
nucleic acid sequence, and remains stably bound to it under at leas, moderate, and optionally, under high stringency 
condihons. n« isolated nucleic acid sequence may comprise .he cDNA insert of *e vec.or deposi.ed on May 14 
1998 widi die ATCC as DNA56045-1380 which includes die nucleotide sequence encoding PR0474. 

In anodier embodiment, die invention provides Uolated PR0474 polypeptide. In particular, die invention 
provides isolated nadve sequence PR0474 polypeptide, which in one embodiment, include, an amino acid sequence 
20 con^rising residues 1 through 270 of Figure 196 (SEQ ID NO:468). Optionally. Uk PR0474 polypeptide is obtained 
or IS obtainable by expressing die polypeptide encoded by die cDNA insen of die vector deposited on May 14 1998 
widi die ATCC as DNA56045-I380. 

78. iBDmi 

25 AppUcaats have identified a cDNA clone diat encodes a novel polypeptide having sequence identiQ, widi 

IL-17. wherem die polypeptide is designated in die present application as "PRO103r . 

in one embodimem. die invention provides an isolated nucleic acid molecule comprising DNA encoding a 
PRO1031 polypeptide. In one aspect, die isolated raicleic acid composes DNA encodi,« dK PROI031 polypeptide 
havmg amino acid residues 1 duough 180 of Figure 198 (SEQ ID NO:470). or i, completncMaiy u> such encoding 

30 nucleic add sequence, and remains stably bound to it u«ler at least moderate. ^ optionally. u«ler high stringency 
condinons. Tlie isolated nucleic acid sequence may comprise die cDNA insen of d» vector deposited on May 14 
1998 wid, die ATCC as DNA59294-1381 which includes die nucleotide sequence encoding PRO1031. 

inanodierembodimem. die invention provides isolated PRO1031 polypeptide. In particular, die invention 
pn,vides isolated native sequence PRO1031 polypeptide, which in one embodimen.. includes an amino acid sequence 

35 comprising residues 1 dirough 180 of Figure 198 (SEQ ID NO:470). OpUonally. die PRO1031 polypep.ide is 
obtamed or b obai«rt,te by expressing die polypeptide encoded by die cDNA insen of die vector deposited on May 
14. 1998 widi die ATCC as DNA59294-138I. 



90 



eNsooao- <v*ro_994e2ei«j.> 



• PCT'i;S99 '05028 

79. ESQ228 

Applicants have identified a cDNA clone that encodes a novel polypeptide having sequence identity to 
protein disulfide isomcrase. wherein the polypeptide is designated in the present applicaaon as 'PROnV. 

In one embodimem. the invention provides an isolated nucleic acid molecule comprUing DNA encoding a 
PR0938 polypeptide. In one aspect, the isolated nucleic acid comprises DNA encoding the PR0938 polypeptide 
having amino acid residues 1 to 349 of Rgure 200 (SEQ ID NO:472). or is complementary to such encoding nucleic 
acid sequence, and remains stably bound to it under at least moderate, and optionally, under high stringency 
conditions. In other aspects, the isolated nucleic acid comprises DNA encoding the PR0938 polypeptide having 
amino acid residues about 23 to 349 of Figure 200 (SEQ ID NO:472) or amino acid I or about 23 to X of Figure 200 
(SEQ ID NO:472). where X is any amino acid from 186 to 195 of Figure 200 (SEQ ID NO:472). or is 
complementary to such encoding nucleic acid sequence, and remains stably bound to it under at least moderate, and 
oprionaUy. under high stringency conditions. The isolated nucleic acid sequence may comprise tiie cDNA insert of 
the DNA56433.1406 vector deposited on May 12. 1998. as ATCC Accession No. 209857 which inchides the 

nucleotide sequence encoding PR0938. 

In another embodimem. the invcmion provides isolated PR0938 polypeptide. In particular, the invention 
provides isolated native sequence PR0938 polypeptide, which in one embodimem. includes an amino acid sequence 
comprising residues 1 to 349 of Figure 200 (SEQ ID NO:472). Additional embodiments of the present invention are 
directed to PR0938 polypeptides con^rising amino acids about 23 to 349 of Figure 200 (SEQ ID NO:472) or amino 
acid 1 or about 23 to X of Figure 200 (SEQ ID NO:472). where X is any amino acid from 186 to 195 of Figure 200 
(SEQ ID NO:472). Optionally, tiic PR0938 polypeptide is obtained or is obtainable by expressing the polypeptide 
encoded by die cDNA insert of tiie DNA56433-1406 vector deposited on May 12. 1998. as ATCC Accession No. 
209857. 

80. mam 

Applicants have identified a cDNA clone tiiat encodes a novel polypeptide having sequence identity with 
a lectin-Iikc oxidized LDL receptor, wherein the polypeptide is designated in tije present application as •PRO1082-. 

In one embodimem. tiie invemion provides an isolated nucleic acid molecule comprising DNA encoding a 
PRO1082 polypeptide. In one aspect, the isolated nucleic acid comprises DNA encoding titc PRO1082 polypeptide 
having amim. acid residues 1 tiuough 201 of Figure 202 (SEQ ID NO:477), or is complementary to such encoding 
ncleic acid sequence, and remains stably bound to it under at least moderate, and optionally, under high stringency 
conditions. TTk isolated nucleic acid sequence may comprise tiie cDNA insert of die vector deposited on May 14. 
1998 witij tiw ATCC as DNA539I2-1457 which includes the nucleotide sequence encodmg PRO1082. 

in anodier embodimem. Uie invention provides isolated PRO1082 polypeptide. In particular. \bt invemion 
provides isolated native sequence PRO1082 polypeptide, which in one embodimem. includes an amino acid sequence 
comprising residues 1 tiirough 201 of Figure 202 (SEQ ID NO:477). An additional embodimem of the presem 
invention is directed to an isolated domain of a PROI082 polypeptide, excluding tiic transmembrane domain. 
Optionally, the PRO1082 polypeptide is obtained or is obtainable by expressing tiie polypeptide encoded by the cDNA 
insert of Uk vector deposited on May 14. 1998 witit the ATCC as DNA53912-1457. 
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Applicants have idcnuficd a cDNA clone du« encodes a novel polypeptide having sequence identic, with 
a 7TM receptor, latrophilin-related protein 1. and a macrophage restricted cell surface glycoprotein, wherein the 
polypepude is designated in the present application as -PROIOSS". 

In one embodiment, the invention provides an isolated nucleic ac,d molecule comprUing DNA encoding a 
5 PRO1083 polypeptide. In one aspect, the isolated nucleic acid comprises DNA encoding .he PRO1083 polypeptide 
havmg amino acid residues 1 through 693 of Figure 204 (SEQ ID NO:483). or is complementary to such em«xling 
nuclei add sequence, and remains stably bound to it under a, leas, moderate, and optionally, under high stringency 
conditions. Ute isolated nucleic acid sequence may comprise U.e cDNA insen of the vector deposited on May 12 
1998 with Ihe ATCC as DNA50921.1458 which includes the nucleotide sequence encoding PRO1083. 
10 In another embodiment, the invention provides isolated PRO1083 polypeptide. In particular, the invention 

p«wides isolated native sequence PRO1083 polypeptide, which in one embodimem. includes an amino acid sequence 
compnsing residues 1 through 693 of Figure 204 (SEQ ID NO:483). An additioma embodiment of the preset,, 
mvemton is directed to an isolated extracellular domain of a PRO1083 polypeptide. Optionally, die PRO1083 
polypeptide is obuuned or is obtainable by expressing d.e polypeptide encoded by d« cDNA insert of the vector 
15 deposited on May 12. 1 998 with the ATCC as DNA5092 1-1458. 

In another embodiment, the invemion provides an expressed sequence tag (EST) desigK„cd herein as 
DNA24256 which comprises the nucleotide sequence of Figure 205 (SEQ ID NO:484). 
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82. PRO20ff 

The objects of this invention, as defined generally ijufia. are achieved a. least in pan by die provision of 
a novel polypeptide. VEGF-E also herein designa.ed PRO200. (SEQ ID NO:488) and ti,e nucleic acid encoding 
therefor. SEQ ID NO:487. residues 259 through 1293. 

In one embodimem. the invention provides an isolated nucleic acid molecule comprising DNA encoding a 
VEGF-E polypeptide. In one aspect, die isolated nucleic acid composes DNA encoding VEGF-E polypepude 
having amino acid residues I duough 345 of Figure 207 (SEQ ID NO:488). or ts complementary to such encoding 
nucleic acid sequence, and remains stably bound to i. under low stringency conditions. In anotiier embodimem 
variant are provided wherein the VEGF-E m«leic acid has single or multiple deletions, substimtions. insertions.' 
truncations or combinations thereof. 

In another embodimem. die invemion provides isolated VEGF-E polypeptide. In panicular. U>e invemion 
30 provides an isolated native sequence VEGF-E polypeptide, which in one embodimem. includes an amino acid 
sequence compiling residues 1 duough 345 of Figure 207 (SEQ ID NO:488). In anotiier embodimem variants are 
provided Wherein U« VEGF-E polypeptide has single or multiple deletions, substimtions. insertions, truncations or 
combinations thereof. 

Inyetlimher embodimems. die presem invemion is directed .o compositions useful for treating indications 
35 where proliferation, survival ami/or differemiation of cells is desired, comprising a tiierapeutically effective amount 
of a VEGF-E polypeptide hereof in admixture witfi a phannaceutically acceptable carrier. 

m invemion fimher includes associated embodiments of VEGF-E such as modified VEGF-E polypeptides 
and modified variams which have die same biological application as VEGF-E. and pharmaceutical compositions 
mcorporatingsame. Inhibitors of VEGF-E are also provided. 
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83. gpn7«'; and PR0286 

Appbcanu have identified .wo novel cDNA clones Aai encode novel hunun Toll polypeptides, dcs.gna.ed 
.„ a,e presen. apphcauon as PR0285 (encoded by DNA40021-1 154) and PR0286 (encoded by DNA42663-1 154) 
In ox embodiment, die invention provides an .sola.ed nucle.c acid molecule comprising a DNA encoding 
a polypeptide hav.ng at leas, about 80% sequence identity, preferably at least about 85% sequence ident.ry. more 
5 preferably a. leas. abou. 90% sequence identity, most preferably at least about 95% sequence .dent.n, to (a) a DNA 
molecule encoding a PR0285 polypeptide having amtno acd restdues 27 .o 839 of Figure 209 (SEQ ID NO:496); 
or (b) «> a DNA molecule encodmg a PR0286 polypeptide hav.ng amino acd residues 27 to 825 of Figure 2 1 1 (SEQ 
ID NO-498) or (c) d« complement of the DNA molecule of (a) or (b). m complemenury DNA molecule preferably 
remains subly bound to such encoding nucleic acd sequence under a. least moderate, and op.,onally. under h.gh 

10 siringcncv conditions. 

in a further embodiment. d,e isolau.1 nucleic acd molecule comprises a polynucleo.ide *a. has a. leas. abou. 
90% preferably at least abou. 95% sequence identity mth a polynucleotide encoding a polypeptide comprtsing Ovc 
sequence of amino acids I to 839 of Figure 209 (SEQ ID NO:496); or a. least about 90% . preferably at leas. abou. 
95% sequence idcnuty with a polynucleotide encoding a polypept.de comprising the sequence of am.no ac.ds 1 to 

15 1041 of Figure 21 1 (SEQ ID NO:498). 

in a specific embodiment. Ok .nvemion provides an isolated nucleic acid molecule compris.ng DNA 
encoding nanve or variant PR0285 and PR0286 polypeptides, witi, or wid,out Ce N-.erminal signal sequence, and 
wid. or wid«ut d« oansmembrane regions of the respective fuU-.engO. sequences, in one aspect. a.e isolated rnKtoc 
acid comprises DNA encoding a mamre. full-length nat.ve PR0285 or PR0286 polypeptide havmg amino ac.d 

,0 residues I to .049 of Figure 209 (SEQ ID NO:496) and 1 to 1041 of Figure 211 (SEQ ID NO: 498). or is 
complemen,a...osuchencodingnudeicacidsequence. In another aspect. Ute invemton concerns an isola.ed miclcic 
add molecule d.at comprises DNA encoding a native PR0285 or PR0286 polypeptide wi*o«. an N-terminal signal 
sequence, or is complementary to such encoding mK:leic acd sequence. In ye. amHher embodmtem. the invention 
concent nudeic acid encoding «nsmembrane-domain deleted or inactivate, forms of Uk full-length native PR0285 

25 or PR0286 proteins. mMA^nn^i 
to anod«r embodiment. Ok invemion the isolated nucleic acid molecule compr>ses tite clone (DNA40021. 
1 ,54) deposited on October 17. 1997. under ATCC number 209389: or ti,e clone (DNA42663-1 154) deposi«d on 
Ocmbcr 17. 1997, under ATCC number 209386. 

,„ ye. another embodiment. Utc invendon provides a vector comprising DNA encoding PR0285 and PR0286 
30 polypeptides, or .heir variant. Tlius, ti« vector may comprise any of ti« isolated mtcleic acid molecules hereinabove 

defined. ^ . . . 

toanoti«rembodimcm.*einventionprovidesisola«dPR0285andPR0286 polypepudes. Inpamcular. 

the im^emion provides isola«d native sequence PR0285 and PR0286 polypeptides, which m one embodm^m. mdude 
.he ami™ acid sequences comprising residues 1 to 1049 and 1 to .041 of Figures 209 and 21 1 (SEQ ID NOS:496 
35 and498) «specnve.y. -n* invention a.so provides for variants of ti,e PR0285 and PR0286 polypeptides which are 
encoded by any of ti.e isolated nudeic acid molecules hereinabove defined. Specific variants include, but are not 
deletion .truncated) variants of .he fuU-lengti> native sequence PR0285 and PR0286 po,ypcp..des which 
lack ti,e respecnve N-terminal signal sequences and/or have ti^cr respective transmembrane and/or cytoplasms 
domains deleted or inactivated. 
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The mvemion specifically u«iudes anybodies wi* dual specificities.^bispeclfic annbodies binding 
n»re than one Toll polypeptide. 

In yet another embodintent. the invention concerns agonists and antagonists of the native PR0285 and 
PR0286 polypeptides. In a particular embodiment, the agonist or antagonist .s an anti-PR0285 or a„ti-PR0286 

In a further embodimem. the invention concern screening assays to identify agonist or antagonists of the 
native PR0285 and PR0286 polypeptides. 

In a still further embodiment. ^ invention concerns a composition comprising a PR0285 or PR0286 
l^^, or an agonis, or antagonist as hereinabove defined, in combinauon with a phar™a«udcaHy acceptable 

pon,.. ^ ' ""^""^ ^""^'y »"«'ing a PR0285 or 

PR0286 polypcpode. in combination with a pharmaccutically acceptable carrier. 

The invention also concerns a method of treating septic shock comprising administering to a pa.iem an 
effective amount of an antagonist of a PR0285 or PR0286 polypeptide. In a specific embodimem. the antagonist 
IS a blockmg antibody specifically binding a native PR0285 or PR0286 polypeptide. 

PR021M.PROn30«nH PPr>fl4^o 

The present invention concerns compositions and methods for the diagnosis and treatment of neoplastic cell 
growth andp^fiierationinmammals. including humans. THe presem invention is based on the identification of genes 
*at are amplified In the genome of cumor cells. Such gene amplification is expected to be associated with the 
ovet^xpression of d« gene product and comribute to mmorigenesis. Accordingly, the proteins e™:oded by the 
anvhlied ge.« m believed to be usefiil targets fbr the diagnosis and/or treatmem rincluding prevemion) of certain 
cancers, and may m as predictors of the prognosis of tumor treamiem. 

one embodiment, the pr«em invention provides an isolated nucleic acid molecule comprising DNA encoding a 
PR0213-1. PRO1330 and/or PROI449 polypeptide. In one aspect, the isolated nucleic acid comprises DNA 
encoding the PRO213-I.PRO1330 and/or PROI449 polypeptide having amino acid residues 1 .0 295 of Figure 213 
(SEQ ID NO:506). 20 to 273 of Figure 215 (SEQ ID NO:508) and 20 to 273 of Figure 217 (SEQ ,D NO-510) 
«spect.vely. or is complementary to such encoding nucleic acid sequence, and remains stably bou«l to U under ai 
I«« modenue. and optionally, under high stringency conditions. The isolated nucleic acid sequence may comprise 
the cDNA msert of the vector designated as DNA30943.1163 (ATCC 209791) deposited on April 21 1998- 
30 DNA64907.1163-1 (ATCC 203242) deposited on September 9. .998 and/or DNA64908-n<i3-l (ATCC M3243; 
deposited on September 9. 1998. 

In another embodimem. the presem invention comprises an isolated nucleic acid molecule having at leas, 
about 80% sequence idemity. preferably a. least about BS% sequence idemity. more preferably at leas, about 90% 
sequence id«d^. mos. prefaaWy a. leas. about 95% sequence idemity to (a) a DNA molecule encoding a PR02I3.1 
35 PRO1330 and/or PR01449 polypeptide having amino acid residues 1 to 295 of Figure 213 (SEQ ID NO.506, 20 
.0 273 of Figure 215 (SEQ ID NO:508) and 20 to 273 of Figure 2.7 (SEQ ID NO:5.0). respective^, or (b)'ti,e 
compkmem of the DNA nwlecule of (a). 

In anotiier embodiment, the invention provides an isolated PR0213-1. PRO1330 and/or PR01449 
pobWtide. to particular. Ok invention provides isolated native sequence PR0213-1. PRO1330 ami/or PR01449 
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polypeptide which in oSbodiinem. includes an anuno acid sequence comprising residues 1 lo 295 of Figure 2 13 
(SEQ ID NO:506). 20 to 273 of Figure 215 (SEQ ID NO:508, or 20 to 273 of Figure 217 (SEQ ID NO 510). 
respectively Optionally, the PR0213-1. PRO1330 and/or PR01449 polypeptide is obtained or obumablc by 
expressing dte polypepdde encoded by dte cDNA insert of the DNA30943-1 163 (ATCC 209791). DNA64907-, 163-1 
(ATCC 203242) or DNA64908-n63-l (ATCC 203243) 
5 m another aspect, the invention provides an Uolated PR0213-1 . PRO1330. and/or PR01449 pol>pep.ide. 

comprising an amino acid sequence having at least about 80% sequence ident.ty. preferably at leas, abou, 85% 
sequence idendty. more preferably a, least abou. 95% sequence identity to amino acid residues I to 295 of Ftgure 
213 (SEQ ID NO:506). 20 to 273 of Figure 215 (SEQ ID NO:508) or 20 to 273 of Figure 217 (SEQ ID NO:510). 
inclusive. 

,0 m ye. anoU,er embodiment, me invemion provides an isola.ed PR0213-I. PRO1330. and/or PR01449 

polypeptide compnsing ti,c amino acid residues 1 .o 295 of Figure 213 (SEQ ID NO:506). 20 to 273 of Figure 215 
(SEQ ID NO-508) or 20 u, 273 of Figure 217 (SEQ ID NO:510). or a fragment dtereof sufficient to provide a bindmg 
site for an anti-PR0213-l. anti-PRO1330 and/or anti.PR01449 antibody. Preferably, the PR0213-1. PRO1330. 
and/or PR01449 ftagmcm retains a qualitative biological activity of a native PR0213.1 . PRO1330. and/or PR01449 

15 polypeptide. 

in a fimher aspect, the i„v««ion concerns an isolated PR0213-1. PRO1330. and/or PR01449 polypeptide. 
comprUing an ammo acid sequence scoring at least about 80% positives, preferably at least abou. 85% posi.ives. 
more preferably at least about 90% positives, most preferably at leas. abou. 95% positives when compared w..h the 
amino acid sequence of residues 1 to 295 of Figure 213 (SEQ ID NO:506). 20 to 273 of Figure 215 (SEQ ID 
20 NO-508) and 20 to 273 of Figure 217 (SEQ ID NO:510), respectively. 

in still a further aspect, the invemion provides a polypeptide produced by (i) hybridizing a test DNA 
n^lecule under stnngen. conditions with: (a) a DNA molecule encoding a PR0213.1. PRO1330. and/or PR01449 
polypeptide having a« amino acid residues from 1 to 295 of Figure 213 (SEQ ID NO:506). 20 to 273 of Ftgure 215 
(SEQ ID NO-508) and 20 to 273 of Figure 217 (SEQ ID NO:510). respectively: or the complemem of the DNA 
25 molecute of (a), and if said test DNA molecute has at least about an 80% sequence idemity to (a) or (b). (i.) culmrmg 
a host ceU comprising said test DNA molecule under condiuons suitable for tf« expression of said polypept.de. and 
(iu) recovering said polypeptide Uom the cell culmre. 

Inone embodiment, the present invention concerns an isolated antibody which binds a PR0213-1. PRO1330 
and/or PR01449 polypeptide. In one aspect. Ute antibody .nduces dead, of a cell overexpressing a PR0213-1, 
30 PRO1330 and/or PR01449 polypeptide. In another aspect. U,e antibody is a monoclonal antibody, which preferably 
has nonhuman complemenurity determining region (CDR) residues and human framework region (FR) residues. 
The antibody may be labeled and may be immobilized on a solid support. In a farther aspect, the anttbody ,s an 
antibody fragment, a single-chain antibody, or an anti-idiotypic antibody. 

In another embodimem. the invention concerns a composition comprising an amibody which bmds a 
35 PR0213-1 PRO1330 and/or PR01449 polypeptide in admixmre with a pharmaceutically acceptable carrier. In one 
aspect, the composition comprises a therapeutically effective amount of die antibody. In anodter aspect. d,e 
cot^^sition comprises a farther active ingxediem. which may. for example, be a further antibody or a cytotoxic or 
chemoiherapeutic agent. Preferably, the composition is sterile. 
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In a further embodiment, the invention concerns nucleic acid encoding 3ii-PR02 13- 1 anti-PROl330 
and/or antl-PR01449 antibody, and vectors and recombinant host cells comprising such nucleic acid. 

•me mvemion funher concerns antagonists and agonists of a PR02I3-1. PRO1330 and/or PR01449 
polypeptide that inhibit one or more of the (uncuons or activities of a.e PR0213-.. PRO1330 and/or PR01449 
polypeptide. r^-v^i-wy 

5 In a further embodiment, the invention concerns isolated nucleic acid molecules that hybridize to the 

complemem of the nucleic acid molecules encoding Oie PR0213-1. PROI330 and/or PR01449 polypeptides TT« 
nucleic acid preferably is DNA. and hybridization preferably occurs under stringent conditions. Such nucleic acid 
motecules can aa as andsense molecules of the amplified genes identified herein, v^hich. in turn, can fmd use in U« 
modulation of the respective amplified genes, or as an.iser.e primers in amplification reactions. Furthennorc such 
sequences can be used as pa« of ribozyme and/or triple helix sequence which, in turn, may t« used in regulation 
of the amplified genes. 

In another embodiment, the invemion concerns a method for determining the preset^e of a PR0213 1 
PRO1330 and/or PR01449 polypeptide comprising exposing a cell suspected of conuuning the PR0213-1 PRO1330 
and/or PR01449 polypeptide to an anti-PR0213-l. PRO1330 and/or PR01449 antibody ^ determimng binding of 
15 the antibody to the cell. oi 

In yet another embodiment, the presem invention co«:ems a method of diagnosing mmor in a mammal 
compr^ing detecti.^ the level of expression of a gene encoding a PR0213-1. PRO1330 and/or PR01449 polypeptide 
(a)inatestsamp,eof tissue ecus Obtained from the mammal, and (b)m^ 

Of the same cell type, wherein a higher expression level in dte test sample indicates the presence of mmor i„ .he 
20 mammal from which the test tissue cells were obtained. 

In another embodiment, the present invention concerns a method of diagnosing tumor in a mammal 
composite (a, contacting ana„.i.PRQ213-l.anu-PRO1330 and/or anti^^^^ 

ceUs obtained from ti« mammal, ami (b) detectii^ the formation of a complex between the and-PR0213-l anti 
PRO1330 and/or and.PR01449 antibody a^l the PR0213-1. PRO1330 and/or PR01449 polypeptide in the test 
sample. The detection may be qualitative or quamitative. and may be performed in comparison with monitoring the 
complex formation in a comrol sample of known normal tissue cells of the same cell type. A larger quamity of 
complexes formed in the test sample indicates the prese«. of mmor in the mammal from which the test tissue cel. 
were Obtained, "nte antibody preferably carries a detectable label. Complex formation can be mommred for 
example, by hgh. microscopy, flow cytometry, fluorimetry. or other techniques known in the an. n« .est sample 
30 . usually obtained from an individual suspected to have neoplastic cell growth or proliferation (e.g. cancerous ceils) 
In another embodimem. the p«sem invention concerns a cancer diagnostic kit. comprising an anti-PR0213- 
1. a«ti-PRO1330 a«l/or anti-PR01449 antibody and a carrier (e.g. a buffer) in suiuble packaging. TIk «. preferably 
contams mstnictions for using the antibody to detect d« PR0213-1. PRO1330 and/or PR01449 polypeptide 

In ye, anoUKr embodiment, the invention concerns a method for inhibiting the growd, of mmor cells 
comp^ingexposingaceU which ovet^xpresses a PR0213-1. PRO1330 a«l/or PR01449 polypeptide to an effective 
amoum of an agent inhibiting the expression and/or activity of the PR0213-1. PRO1330 and/or PR01449 
polypepude. The agem preferably is an ami-PR0213-l. anti-PRO1330 and/or a„ti-PR01449 amibody a small 
organic and inorganic motecule. peptide, phosphopeptide. amUense or rib^yme molecule, or a triple helix molecule 
fa a specfic aspea. d« agem. e.g. ami-PR0213-l. anti-PRO1330 and/or ami-PR01449 amibody induces ceU dead." 
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In a funhcr aspect. *e auiio?cells are further exposed to radiation ireaimem and/or a cytotoxic or chcmotherapcuuc 
agent. 

m a funhcr cmbodimeni, ihe invention concerns an anicle of manufacture, comprising: 

a) a container; 

b) a label on the container; and 

5 c) a composition comprising an active agent contained within the container; wherein the composition is 
effective for inhibiting the growth of tumor cells, the label on die container indicates that the composition can be used 
for treating conditions characterized by overexpression of a PR0213-1 . PRO1330 and/or PR01449 polypeptide, and 
the acdvc agent in dte composition is an agent inhibiting the expression and/or activity of the PR0213-1. PRO1330 
and/or PR01449 polypeptKie. b a preferred aspect, the active agent is an anti-PR0213-l. an,i-PRO1330 and/or anti- 

10 PR01449 antibody. 

In yet a further embodiment, the invention provides a method for identifying a compotmd capable of 
inhibiting dte expression and/or activity of a PR0213-1. PRO1330 and/or PR01449 polypepdde. comprising 
contacting a candidate compound with a PR021M. PRO1330 and/or PR01449 polypeptide under conditions and 
for a time sufficient to allow these two componems to interact. In a specific aspect, either the candidate compound 
15 or the PR0213-1. PRO1330 and/or PR01449 polypeptide is immobilired on a solid support. In another aspect, the 
non-immobilized coraponem carries a detectable label. 
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85. mm ^. . 

AppUcants have identified a cDNA clone that encodes a novel polypeptide. TTie DNA ts designated m the 
present application as -DNA39975-1210". encoding a novel muUi-.ransmembrane protein, referred to as -PR0298". 

in one embodiment, .he invention provides an isolated nucleic acid molecule comprising DNA having at least 
about 80%. preferably at leas, about 85%. more preferably at leas, about 90%. most prefenbly at leas. abou. 95% 
sequence idemity to (a) a DNA molecule encoding PR0298, comprismg the sequence of amino acids I » 364 of 
Figure 219 (SEQ ID NO:515). or (b) the complement of *c DNA molecule of (a). In one aspect, the isolated m«:letc 
acid comprises DNA encoding a PR0298 polypeptide having amino acid residues 1 .o 364 of Figure 219 (SEQ ID 
NO:515). or is complememao^ to such encoding mtdeic acid sequence, and remains stably bound to it u«ler at least 
moderatt. and optionally, under high stringency conditions. 

In a fiinher embodiment, the invention concerns an isolated nucleic acid molecule comprising DNA havmg 
a. least an 80% sequence identity to (a) a DNA molecule encoding the same mamre polypeptide encoded by the 
human protein cDNA in ATCC Deposit No. 209783 PNA39975-1210). or (b) the complement of the DNA molecule 

**'^*^* In a still further embodimem. the invention concerns mideic acid which comprises a DNA molecule 
encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 209783 
(DNA39975-1210). 

I„a«Hher embodiment, the invention provides isolated PR0298 polypeptide. In particular, tite mventton 
provides isolated native sequence PR0298 polypeptide, which in one embodiment, includes an amino actd sequence 
comprising residues 1 to 364 of Figure 219 (SEQ ID NO:515). 

in another embodiment, me invention provides an expressed sequence tag (EST^ designated DNA26832 
comprising the nucleotide sequence of Figure 220 (SEQ ID NO:516). 
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86. £BQ222 

Appacan. have identified a cDNA Cone (DNA43316-1237) ^ encodes a ™,ve. poiypepdde, desig«..ed 
in ine presem application as "PR0337". 

m one embodiment. *e invenuon provides an .soiated nucleic acid molecule havmg a. leas, abou, 80% 
sequence identity to (a) a DNA molecule encoding a PR0337 polypeptide comprising the sequence of amino acids 
I.o344ofFigure 222 (SEQ1DN0:523, or (b).he complement Of tl^DNAmolecule 

preferably ,s about 85%. more preferably 90%. mc«t preferably about 95%. ,„ one aspect, the isolated nuc J 
acid bas «.e.. about 80% . preferably at lea.t abou. 85 %. more pre^^^^^^^ 
«^.abo«95(.ncluding 9a. 97.98 and 99%)seque„cei^^^^^ 

to 344 Of Ftgure 222 (SEQ ID NO:523). Preferably, the highest degree of sequence identity occurs within the 
tmmunoglobulin and major histocompatibility domatns (am.no acids 113 to 130 of Figure 222 SEQ ID NO-323) 

In a further embodimem. the .soiated nucleic acid molecule comprises DNA encoding a neurotrimin 
po.ypep«de having ami™, acid residues . ,o 344 of F.gure 222 (SEQ ID NO:523). or is comp.emenutry to such 
encoding nucleic acid sequence, and remains subiy bound to it u^ler at least moderate, and optionally, under high 

DNA43316-1237. depostted w.th the ATCC under accession number ATCC 209487. alternatively the coding 
sequence of clone DNA43316-1237. deposited under accession number ATCC 209487. 

In yet anod«r embodimem. the invendon p^vides .olated PR0337 polypepdde. In panicular. the invention 
provdes isolated native sequence PR0337 polypeptide, which in one embodimem. i«:ludes an ami«> acid sequence 
comprising residues , to 344 of Figure 222 (SEQ ID NO:523). Native PR0337 polypeptides with or without the 
oanve s.g„al sequence (amino acids , to about 28 in Figure 222 (SEQ ,D NO:523). and with or without the initiating 
mctluonine are specifically included. AltemaUvely. the invention provides a PR0337 polypepude encoded by the 
nucleic acid deposited under accession number ATCC 209487. 

In yet another embodiment, the invemion provides an expressed sequence tag (EST) comprising .he 
nucleotide sequences identified in Figure 223 as DNA42301 (SEQ ID NO:524). 

87. EBQm 

AppUcants have identified a cDNA clone (DNA55800-1263) that encodes a novel polypeptide. desig«..ed 
in the present application as "PRO403". 

In one embodiment, the invention provides an isolated nucleic acid molecule having a. least about 80% 
seque^e identity to (a) a DNA molecule encoding a PRO403 po.n,eptide comprising tite sequence of ammo acids 
no 73^ of Figu« 225 (SEQ D) NO:526). or (b) the complement of the DNA molecule of (a). Tl« sequence identity 
Inferably is abou. 85%. more p^ferably about 90%. most preferably about 95%. In one aspect, the isolated nucleic 
and has a. least abou, 80%. prefcntbly a. least about 85% . more preferably at least about 90%. and most preferably 
alcast abou. 95%sequence identity withapolypeptide having amino acid residues 1 to 736 of Figure 2^5 (SEQ ID 
NO:526). Preferably. U« highes. degree of sequence identity occurs within: (1) ti,e putative N-glycosylauttion sites 
(an^mo acid residues 132. ,36. 177. 237. 282. 349. 505, 598 ^ m; (2) Cys residues conserved wid> the Kell blood 
«^^infiunily(aminc^dresidues6S.70.88»^ 
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to a funher cmboSem. -solatcd nucleic acid molecule compnscs DNA encoding a PRO403 polypepude 
having anuno acid residues 1 to 736 of F.gure 225 (SEQ ID NO:526). or is complemenury «, such encoding nuclctc 
acid sequence, and renins subly bound .o i. under a. leas, moderate, and opuonally. under high s.nngency 
condidons. in anoO^er aspecc. U.c .nvemion provides a nucleic add of ^ full IcngO. protein of clone DNA55800- 
,263. deposi^d with d,e ATCC under accession number ATCC 209680. alternatively the coding sequence of clone 
DNA55800-1263. deposited under accession number ATCC 209680. 

tayet another embodiment, the mvention provides isolated PRO403 polypeptide. In particular, the mvemion 
™ovides isolated nadve sequence PRO403 polypeptide, which in one embodunent. .ndudes an amino acd sequence 
comprising residues 1 to 736 of Figure 225 (SEQ ID NO:526). Native PRO403 po.ypepudes with or the uuuatmg 
„„Uuo.une are speciftcally included. Alternatively, the invenuon provides a PRO403 polypepude encoded by the 
nucleic acid deposited under accession number ATCC 209680. 

In ye. another embodiment, the invention provides an expressed sequence tag (EST) and other sequence 
fragments compHsU. the nucleotide sequences identified herei„asDNA344.5 (Figures 226A-B;SEQ« 

DNA49830 (Figure 227 ; SEQ ID NO:528) and DNA49831 (Figure 228; SEQ ID NO:529). 



o*er embodimen« of d,e present invenuon. the invention provides vectors compnstng DNA encodmg 
a^yoftheaboveorhetowdescribedpolypepddes. A host cell comprising any such vector is also provided. Byway 
7e«m,le. thehostcells may be CHO cel.. E. ccli, or yeast. A process for producing any of the above or below 
described polypeptides is further provided and comprises culturing host cells under conditions suitable for expr«ston 
20 ofthedeslredpolypeptideandrecoveringthedesiredpolypepudefromthecellculture. 

,„ other embodiments, the mvention provides chimeric molecules comprising any of the above or below 
described polypeptides (used to a heterologous polypeptide or amino acid sequence. An ex«nple of such a dumertc 
molecule comprises any of the above or below described polypeptides h.sed to an epiu.pe utg seque«. or a Fc regKm 
of an immunoglobulin. 

25 taanotherembodimen.. the invention provides an antibody whidt speciftcally binds «, a^y of *e above or 

below described polypeptides. Optionally, the antibody is a monoclonal antibody. 

in ye. otiter embodiments, the invention provides oligonucleotide probes useful for isolating genomtc and 
cDNA nucleotide sequences, wherein those probes may be derived from any of the above or below described 
nucleotide sequences. 
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FFIFF PF^'-''"PT^^N ^PT^P PRAWINOS 
Rgu. 1 shows a nucleotide sequence (SEQ ID NO: I) of a native sequence PR02I3 cDNA. wherein SEQ 
ID N0:1 is a done designated herein as "UNQ187" and/or ■'DNA30943-1 163". 

Figure 2 shows the amiiK. acid sequence (SEQ ID N0:2) derived fiom tite codmg sequence of SEQ ID NO: 1 

35 ^'»-'";^;^^^^,^^^^,,^3,Q,ONO:6)ofanativ^ 

,D N0:6 is a done designa«d herein as ••UNQ241 " and/or "DN A39987.1 1 84" . , 
Rgure 4 shows .he amino acid sequence (SEQ ID N0:7) derived from the coding sequence of SEQ ID N0.6 

shown in Figoie 3. 
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F.gure 5 shows an EST nucJeoUdc sequence designated herein as DnH 3 (SEQ ID NO-8) 
F.gure 6 shows an EST nucleoUde sequence desig™„ed hereinas DNA36157 (SEQ ID NO-9,* 
F.gure 7 shows an EST nuc.eo.ide sequence designaied he«in as DNA28929 (SEQ ID NO-10, 

ID NO. 18 .s a clone designated herein as ••UNQ263- and/or ••DNA40625-1 189-. ^ 

5 amino acid sequence (SEO ID NO- l9^rf^ri„,Hf u 

N0:18 shown in Figure 8. ^ '"""^ ""^''^ °f ^EQ ID 

Figure .Oshowsanucleodde sequence (SEQ ,^ 
ID N0.27 « a clone des.gna.ed herein as "UNQ247" and/or ••DNA23318-121 1 ^ 

Rgure 11 shows the amino acid sequence fSEOmNH^fi^^- 
10 N0:27 shown in Figure 10. ^ ' '"^ ""^ ""^^ ^EQ ID 

Figure 12 shows an EST oucleodde sequence designa.ed herein as DNA.2982 (SEQ ,D NO-29) 
F-gure 13 shows an EST ™„:,eoude sequence designa.ed herein as DNA15886 (SEQ ,D N0:30 ) 
Figure 14 shows a nucleotide sequence (SEQ ID NO:35) of a native sequence PR0296 .OM a \ 

ID NO:35 is a Clone desig..^ herein as "UN0260. a^/or «DNA39979.r2r "^^^ 

Figure 15 shows ihc amino acid scouenr^ (Kvn m Krr^ -s^v ^ . . 
NO:35 Shown in Figure 14. ^ ''^ ^^"^ SEQ ID 

Fi^« 16showsanESTnuc.eo.de sequence designa^d hereinas DNA23020(SE^ 

Figure 17 shows an EST nucleodde sequence designated herein as DNA2I97, (SEQ ID NoJ 

ID N0:44 .s a clone designated herein as -UNQ291 " and/or »DNA40594-1233". ^ 

Figure 20 shows the amino acid sequence (SEQ ID NO 45) derived fmn, ,k ^• 
N0:44 shown in Figure 19. ^ denved from the codmg sequence of SEQ ID 

~5 ID N0.51 ,s a clone designa,«I herein as -UNQ317" and/or -DNA45416-I251 " 

No:sisr:L:r;r""""'"^"''"""^-^^^^ 

ID NO 5^""' f ' '''^ •''^«363 cDNA. wherein SEO 

ID NO:58 .s a clone designated herein as "UNQ318- and/or ••DNA45419.I252- 

" No:58Z:zr"""""^^'^"'"^""^""^-^-^ 

Rgure|«sh<«.a„ucteotidesequence(SEQDNO:63)ora.tiv^ 
ID NO:63 « a clone designated herein as ••UNQ437" and/or "DNA52594-1270" 

35 No..«s:r.ZT'""""'^'^"'"^^"^^^'*^^^--^^^ 

Figure 27 shows a nucleodde sequence (SEO ID Nn-fiR^ nf , 
roisin.«- . ^^^^"''^"•^*^°^*™«'^««<I"encePR0382cDNA. wherein SEO 

ID N0:68 « a clone desig«.«d herein as '•UNQ323" and/or -DNA45234.1277-. 

Figure 28 shows the amino acid sequence (SEQ ID NO-69) derived <r«» ^ 
NO:68 shown in Figure 27. * ^ u N0.69) denved from the cod»g sequence of SEQ ID 
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Figure 29 showsScleoude sequence (SEQ ID NO:73) of a native sequence PR0545 cDNA. wherein SEQ 
ID NO-73 is a clone designated hereui as -UNQ346" and/or ••DNA49624.1279" 

Figure 30 shows the amino acid sequence (SEQ ID NO:74) derived from the coding sequence of SEQ ID 

NO:73 shown in Figure 29. 

Figure 31 shows an EST nucleotide sequence designated herein as DNA13217 (SEQ ID NO:75). 
Figure 32 shows a nucleodde sequence (SEQ ID NO:84) of a nadve sequence PR06n cDNA. wherein SEQ 
ID NO-84 is a clone designated herein as "UNQSSS" and/or ••DNA48309-1280". 

Figure 33 shows the amino acid sequence (SEQ ID NO:85) derived from the coding sequence of SEQ ID 

NO:84 shown in Figure 32. 

Figure 34 shows a mK:leotide sequence (SEQ ID NO:89) of a native sequence FRO700 cDNA. whe«m SEQ 
ID NO-89 is a clone designated herein as "UNQ364- and/or ••DNA46776-1284-. 

Figure 35 shows the amino acid sequence (SEQ ID NO:90) derived from the coding sequence of SEQ ID 

NO:89 shown in Figure 34. 

Figure 36 shows a nucleotide sequence (SEQ ID N0:%) of a native sequence PRO702 cDNA. whercm SEQ 
ID N0.96 is a clone designated herein as ••UNQ366- and/or •DNA50980-1286". 

Figure 37 shows the amino acid sequence (SEQ ID NO:97) derived from the coding sequence of SEQ ID 

NO:96 shown in Figure 36. . 

Figure 38 shows a nucleodde sequence (SEQ ID NO:101) of a native sequence PRO703 cDNA. wherem 
SEQ ID NO:101 is a clone designated herein as ••UNQ367- and/or "DNA50913-1287-. 

Figure 39 shows the amino acid sequence (SEQ ID NO:102) derived from the coding sequence of SEQ ID 

20 NOilOl shown in Figure 38. . 

Figure 40 shows a nucleotide sequence (SEQ ID NO: 108) of a native sequence PRO705 cDNA. wherem 
SEQ ID NO:108 is a clone designated hereinas •UNQ369" and/or -DNA50914-1289". 

Figur. 41 shows the amino acid sequence (SEQ ID NO-.109) derived fiom the coding sequence of SEQ ID 

NO:i08 shown in Figure 40. ^ ^v,* v 

25 Figu«s 42A-B show a nucleotide sequence (SEQ ID N0:113) of a nadve sequence PRO708 cDNA. wherem 

SEQ ID NO:l 13 is a clone designated herein as -UNQ372" and/or •DNA48296-1292- . 

Figure 43 shows the amino acid sequence (SEQ ID NO: 1 14) derived from the coding sequence of SEQ ID 
N0:113 shown in Figures 42A-B. , 
Figure 44 shows a nucleodde sequence (SEQ ID N0:1 18) of a nadve sequence PRO320 cDNA. wheieu. 
30 SEQ ID N0:118 is a clone designated herein as "UNQ281" and/or -DNA32284-1307-. 

Figure 45 shows the amino acid sequence (SEQ ID NO: 1 1 9) derived from the coding sequence of SEQ ID 

NOillS shown in Figure 44. 

Figure 46 shows a nucleodde sequence (SEQ ID NO:123) of a nadve sequence PR0324 cDNA. wherem 
SEQ ID NO:123 is a clone designated herein as -UNQ285" and/or -DNA36343-1310". 
35 Figure 47 shows the amino acid sequence (SEQ ID NO: 124) derived from the coding sequence of SEQ ID 

NO: 123 shown in Figure 46. 

Figure 48 shows a nucleodde sequence (SEQ ID N0:131) of a nadve sequence PR0351 cDNA. wherem 
SEQ ID N0:131 is a clone designated herein as "UNQ308- and/or -DNA4057M315- 
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Figure 50 shows a nuclcwide sequence fSEO ID Nn- n« , 
SEO in NO . 1/; . ^ ^ "^"^O^SZ cDNA. wherein 

SEQ ID NO:136 ,s a clone designated henrin as -UNQ309" and/or •'DNA4I386-I316-. 

Figure 51 shows the amino acid sequence (SEQ ID NO- 137) derived frnm ,h- 
5 NO:136 shown in Figure 50. "7) dcnved from tf,e codmg sequence of SEQ ID 

SEQ ID NO: 144 .s a clone designated herein as -UNQ322" and/or "DNA44 194-1 317" 

NO: .44 s'i: iZ: " ^^^^ ^^^^^^ 

' SEO ID No'^r 7'"'^ '''' ' ''''' '^^'^^ »^NA. Wherein 

SEQ ID NO:149 ts a clone designated herein as -UN0326" and/or •'DNA45415-1318" 

Figure 56 shows an EST nucleotide sequence des.gna.ed herem as DNA23350 (SEQ ID NO 15 ,) 
Figure 57 shows an EST nucleotide sequence designated herein as DNA23536 (SEQ ID NO: .52) ' 
Figure 58 shows a nucleotide sequence fSEO ID wn- 1 <«\ «p - 
SEO ID NO K/; . . ''"^*^^^^'°N0.156) of a native sequence PRO540CDNA, wherein 

SEQ ID NO: .56 ,s a clone designated herein as -UNQ341 " and/or "DNA44.89..322'-. 

Figure 59 shows the amino acid sequence (SEQ ID NO- 157) derived fmm .k. . a- 
NO:.56 shown in Figure 58. "^""^ 

Figure 60 shows a nucleotide sequence fSEO ID Nn i/;n «»f . - 
SEOmwo iAi . . ^"^""^^^^'''N° l<*»ofa'"«'ve sequence PR0615 CDNA. wherein 

SEQ ID N0:161 .s a Cone designated herein as ■•UNQ352- and/or "DNA48304-1323-. 

Figure 61 shows the amino acid sequence (SEO ID NO i«\ t 

NO:.6. Shown in Figure 60. """^"^ "^^^^ '° 

Figure 62 shows a nucleotide sequence (SEO ID no- 1M^ 
SEOIDNO-168,,»c. . - ""'^^^^'^ NO '68) °f'»'»"ve sequence PR06I8CDNA. wherein 

SEQ ID N0.168 is a clone designated herein as "UNQ354" and/or "DNA49152.1324- . 

Figure 63 shows the amino acid sequence (SEO ID NO- 1 m\ m^^^a «. u 

NO:.68 Shown in Figure 62. "'"^ '° 

Figure 64 shows an EST nuCeotide sequence designated herein as DNA35597 (SEQ ID NO170) 
SEQ ID NO:177 ,s a clone designated herein as ••UN0387" a^l/or -DNA49646.1327" 

No:i77:Zirpt::"."""''""""'''''"''*^""'-'^'™*-"^^^ 

Figure 67 shows a nucleotide sequence fSEO ID NO- 1 a-iv - .• 
SEO ID NO. •« ... "^^^^'°'^° '82)ofa native sequence PR0724cDNA.wherem 

SEQ ID NO.I82 is a clone designated herein as "UNQ389" and/or ■•DNA4963I-1328-. 

Figure 68 shows the amino acid sequence (SEO ID NO. i a,.^«.a «. u 
NO:182 shown in Figure 67. ""^ '"""-^ '''^ 

Figure 69 shows a nucleotide sequence (SEO ID No- 1 bo^ «f . 
SEOIDNO iRQi. . . . ^ ™'^'**'*^'°'^0 '*')°f« native sequence PR0772CDNA. wherein 

SEQ ID N0.189 is a clone designated herein as "UNQ410" and/or "DNA49645-1347-. 
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Figure 70 shows the anuno ac.d sequence (SEQ ID NO:190) derived from ihe coding sequci.ce of SEQ ID 

NO: 189 shown in Figure 69. 

Figure 71 shows ai, EST nucleotide sequence designa«d herein as DNA43509 (SEQ ID N0:191). 
Figure 72 shows a nucleotide sequence (SEQ ID NO:195> of a nat.ve sequence PR0852 cDNA. wherem 
SEQ ID NO:195 is a clone designawd herein as •UNQ418" and/or •DNA45493-1349" 
5 Figure 73 shows the amino acid sequence (SEQ ID NO:196) derived from the coding sequence of SEQ ID 

NO: 195 shown in Figure 72. 

Figure 74 shows a nucleotide sequence (SEQ ID NO:205) of a native sequence PR0853 cDNA. wherem 
SEQ ID NO:205 is a clone designated herein as •■UNQ419" and/or -DNA48227-1350-. 

Figure 75 shows the amino acid sequence (SEQ ID NO:206) derived from the coding sequence of SEQ ID 

10 NO:205 shown in Figure 74. 

Figures 76A.B show a nucleotide sequence (SEQ ID NO:210) of a native sequence PRO860 cDNA. wherein 
SEQ ID NO:210 is a clone designated herein as "UNQ42r and/or "DNA41404-1352-. 

Figure 77 shows the amino ac.d sequence (SEQ ID N0:21 1) derived from the coding sequence of SEQ ID 

NO:210 shown in Figures 76A-B. 
,5 Figure 78 shows a nucleotide sequence (SEQ ID NO:215) of a native sequence PR0846 cDNA. wherein 

SEQ ID NO:2l5 is a done designated herein as ••UNQ422- and/or "DNA44196-1353'. 

Figure 79 shows the amino acid sequence (SEQ ID NO:216) derived from the coding sequence of SEQ ID 

NO:215 shown in Figure 78. 

Figure 80 shows a nucleotide sequence (SEQ ID NO:220) of a native sequence PR0862 cDNA. wherem 
-50 SEQ ID NO:220 is a clone designated herein as "UNQ424- and/or •DNA52187-1354-. 

Figure 81 shows the amino acid sequence (SEQ ID NO:221) derived from the coding sequence of SEQ ID 

NO:220 shown in Figure 80. 

Figure 82 shows a micleotide sequence (SEQ ID NO:225) of a native sequence PR0864 cDNA. wherem 
SEQ ID NO:225 is a clone designated herein as -UNQ426" and/or "DNA48328-1355-. 
25 Figure 83 shows the amino acid sequence (SEQ ID NO:226) derived from the coding sequence of SEQ ID 

NO:22S shown in Figure 82. 

Figure 84 shows a micleotide sequence (SEQ ID NO:230) of a native sequence PR0792 cDNA. wherein 
SEQ ID NO:230 is a clone designated hcrem as -•UNQ43r and/or -DNA56352-1358". 

Figure 85 shows d« amino acid sequence (SEQ ID NO:231) derived from the coding sequence of SEQ ID 

30 NO:230 shown in Figure 84. 

Figure 86 shows a nucleotide sequence (SEQ ID NO:235) of a native sequence PR0866 cDNA. wherein 
SEQ ID NO:235 is a clone designated herein as "UNQ435" and/or •DNA53971-1359". 

Figure 87 shows the amino acid sequence (SEQ ID NO:236) derived from the coding sequence of SEQ ID 
NO:235 shown in Figure 86. 

35 Figure 88 shows a micleonde sequence (SEQ ID NO:244) of a native sequence PR0871 cDNA. wherem 

SEQ ID NO:244 is a clone designated herein as -UNQ438- and/or -DNA509 19-1 361". 

Figure 89 shows the amino acid sequence (SEQ ID NO:245) derived from the coding sequence of SEQ ID 
NO:244 shown in Figure 88. 
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se, ,D ^'"^3'° r T'""* ^"O'" 

SEQ ID N0.253 ,s . cto„ dea|p»KJ hercm » -UNQ«0- a*., •ONA44I79.1362- 

NO «, s*"" "^'^ ,SEQ m NO-254, *H«^ fto™ «„ CKtas SEQ ,D 

NO:253 shown m Figure 90. 

SEQ ID N0.258 .s a clone dcs.giu„cd herein as -UNQ477- awl/or "DNA54002-1367- 

MO T '''^ "^^"'^ '''''' ^'""^ «'^«- Of SEQ ID 

NO:258 shown in Figure 92. 

SEO ,D NOoT *r 7""* »" "*"^»« "<0«l =ONA. 

SEQ ID N0.263 „ a clone des,g,»,ad herein as ■UNQ478- ,M„, •DNA53906.1368- 

NO:263 shown in Figure 94. 

Figure 96 shows an EST nucleoUde sequence designa.ed herein as DNA6415 (SEQ ID N0 265) 

SEQ ID N0.269 .s a clone designated herein as -UNQ48r and/or "DNA52 185- 1370" 

Figure 99 shows an EST nucleotide sequence designated herein as DNA14007 (SEQ ID NO-27n 
Figure 100 shows an EST nucleotide sequence designated herein as DNA12773 (SEQ ID NO-272) 
Figure 101 shows an EST nucleotide sequence designated herein as DNA12746 (SEQ ID NO-273)' 
Figure 102 shows an EST nucleotide sequence designated herein as DNA12834 (SEQ ID NO-274) 
Figure 103 shows an EST nucleotide sequence designated herein as DNA12846 (SEQ ID N0 275) 
Figure 104 shows an EST nucleotide sequence designa.ed herein as DNA13104 (SEQ ID NO-276)' 
Figure 105 shows an EST nucleoUde sequence designated herein as DNAI3259 (SEQ ID NO-277) 
Figure 106 shows an EST nucleotide sequence des.gna.ed herein as DNA13959 (SEQ ID NO-278)' 
Figure 107 shows an EST nucteotide sequence designated herein as DNAI3961 (SEQ ID NO:279)' 
Figure 108 shows a nucleotide sequence fSEO in Nn os-jy . . 
SEOIDIMO mi. , . . '''"''*'*^^'°N0.283) of a native sequence PR0983CDNA. wherein 

SEQ ID N0.283 « a clone designated herein as -UNQ484" and/or "DNA53977-1371 " 

N0.283 '"I'"" ""'^^^ Of SEQ ID 

NO:283 shown in Figure 108. 

30 Figure 1 10 shows an EST nucleotide sequence designated herein as DNA17130 (SEQ ID NO 285) 

Figure 1 1 1 shows an EST nucleotide sequence designated herein as DNA23466 (SEQ ID NO-286) 
Figure 1 ,2 shows an EST nucleotide sequence designated herein as DNA26818 (SEQ ID NO-287)' 
Figure 113 shows an EST nucleotide sequence designated herein as DNA37618 (SEQ ID NO-288)' 
Figure 1 .4 shows an EST nucleotide sequence designated herein as DNA41732 (SEQ ID NO-289)' 
Figure 1 15 shows an EST nucleotide sequence designated herein as DNA45980 (SEQ ID NO-290)' 
Figure 1 16 shows an EST nucleotide sequence designated herein as DNA46372 (SEQ ID NO-291)' 

SEQ ID NO:295 ts a clone designated herein as -UNQ522- and/or -DNA57253-1382". 
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Figure 1 18 shows amino acid sequence (SEQ ID NO:296) derived from a,e coding sequence of SEQ ID 

NO:295 shown in Figure 117. 

Figure 119 shows a nucleotide s«n«nce (SEQ ID NO:300) of a native sequence PRO107I cDNA. wherem 
SEQ ID NO-300 is a clone designated herein as ••UNQ528- and/or -DNA58847-1383" 

Figure 120 shows (he amino acid sequence (SEQ ID NO:301) derived from the codmg sequence of SEQ ID 

NO:300 shown in Figure 119. 

Figure 121 shows a nudeodde sequence (SEQ ID NO:302) of a native sequence PRO1072 cDNA. wherem 
SEQ ID NO:302 is a clone designated herem as "UNQ529" and/or "DNA58747.I384- 

Figure 122 shows die amino acid sequence (SEQ ID NO:303) denved from the coding sequence of SEQ ID 

NO:302 shown in Figure 121 

Figure P3 shows an EST nucleoude sequence designated herein as DNA40210 (SEQ ID NO:304). 
Figure 124 shows a nucleotide sequence (SEQ ID NO:308) of a native sequence PRO1075 cDNA. wherein 
SEQ ID NO:308 is a clone designated herein as ■'UNQ532" and/or "DNA57689-1385- . 

Figure 125 shows the amino acid sequence (SEQ ID NO:309) derived from the coding sequence of SEQ ID 

NO:308 shown in Figure 124. 

Figure 126 shows an EST nucleotide sequence des.gnated herein as DNA13059 (SEQ ID NO:310). 
Figure 127 shows an EST mKleoude sequence designated herein as DNA19463 (SEQ ID NO:3l I). 
Rgure 128 shows a nucleotide sequence (SEQ ID NO:321) of a native sequence PR0181 cDNA. wherem 
SEQ ID NO:32I is a clone designated herein as -UNQI55" and/or "DUA22230-im' 

Rgurt 129 shows the amino acid sequence (SEQ ID NO:322) derived from the coding sequence of SEQ ID 

''O NO:321 shown in Figure 128. 

Figure 130 shows an EST nucleotide sequence designated herein as DNA13242 (SEQ ID NO:323). 
Figure 131 shows a nucleotide sequence (SEQ ID NO-.329) of a nattve sequence PR0195 cDNA. wherem 
SEQ ID NO:329 is a clone designated herem as -UNQ169" and/or •DNA26847-1395". 

Figure 132 shows the amino acid sequence (SEQ ID NO:330) derived from the coding sequence of SEQ ID 

25 NO:329 shown in Figure 131. ^.^v,««,. 
Figure 133 shows an EST nucleotide sequence designated herein as DNA15062 (SEQ ID NO:331). 
Figure 134 shows an EST nucleotide sequence designated herein as DNA13199 (SEQ ID NO:332). 
Figure 135 shows a nucleotide sequence (SEQ ID NO:336) of a native sequence PR0865 cDNA. wherem 
SEQ ID NO:336 is a clone designated herein as -UNQ434" and/or -DNA53974-140r. 

30 Figure 136 shows d>e amino acid sequence (SEQ ID NO:337) derived from the codmg sequence of SEQ ID 

NO:336 shown in Figure 135. 

Figure 137 shows an EST nucleotide sequence designated herein as DNA37642 (SEQ ID NO:338). 
Figure 138 shows a nucleotide sequence (SEQ ID NO:345) of a native sequence PR0827 cDNA. wherem 
SEQ ID NO:345 is a clone designated herein as "UNQ468- and/or ••DNA57039.14O2- . 
35 Rgure 139 shows the amino acid sequence (SEQ ID NO:346) derived from the coding sequence of SEQ ID 

NO:34S shown in Figure 138. 

Figure 140 shows an EST nucleotide sequence designated herem as DNA47751 (SEQ ID NO:347>. 

Rgure 141 shows a nucleotide sequence (SEQ ID NO:351) of a native sequence PROl 1 14 cDNA. wherem 
SEQ ID NO:351 is a clone designated herem as "UNQSS?" and/or -DNA57033-1403- 
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Figmt 142 shows aramo add sequence (SEQ ID NO:352) derived fronSoding sequence of SEQ ID 
NO:351 shown in Figure 141 

Figure 143 shows an EST nucleotide sequence designated herein as DNA48466 (SEQ ID NO-353) 
Figure 144 shows a nucleotide sequence (SEQ ID NO:357) of a native sequence PR0237 cDNA. whereu, 
SEQ ID NO:357 is a clone designated herein as "UNQZll" and/or -DNA34353-1428-. 

Figure 145 shows .he amino acid seque«:e (SEQ ID NO:358) derived from the coding sequence of SEQ ID 
NO:357 shown in Figure 144. 

.Pn ,n mITJ''''"" ' "'""^ ^''^ '^^'''^ ""'^^ '^''"'"« "'^O^' ^^NA. wherem 

SEQ ID NO:362 is a clone designated herein as "UNQ342" and/or "DNA45417-1432". 

Figure 147 shows the amino acid sequence (SEQ ID N0.363) derived from the coding sequence of SEQ ID 
NO:362 shown in Figure 146. 

Figure 148shows a nucleotide sequence (SEQ ID NO:369) of a native sequence PR0273 cDNA wherem 
SEQ ID NO:369 is a clone designated herein as •UNQ240" and/or "DNA39523-1 192". 

Figure '^9showstheaminoacidsequence(SEQIDN0.370)derivedfromthecodingsequenceof SEQID 
NO:369 shown In Figure 148. 

SPO ,n ' '''^ °' ' "^O^O' ^"^NA. Wherein 

SEQ ID NO:374 is a clone designated herein as 'WQ36S" and/or •'DNA44205-I285". 

Figure 151 shows the amino acid sequence (SEQ ID NO:37S) derived from the coding sequence of SEQ ID 
NO:374 shown in Figure 150. 

Figure 152 shows a nucleotide sequence (SEQ ID NO:379) of a native sequence PRO704 cDNA wherein 
SEQ ID NO:379 is a clone designated herein as "UNQ368" and/or "DNA5091 1-1288" . 

Mn ''^ ""^"^ ^'^^ "'"-■'''^ """''^ ^""^ ^ «1"«nce of SEQ ID 

NO:379 shown in Figure 152. 

Figure 154 Shows a nucleotide sequence (SEQ ID NO:384) of a native sequence PRO706 cDNA wherem 
SEQ ID NO:384 is a clone designated herein as "UNQ370" and/or •DNA48329-I290". 

Figure •55shows.heaminoacidsequence(SEQIDNO:385)derivedfrom.hecodingsequenceofSEQID 
NO:384 shown in Figure 154. 

.Pn m .inir ' ^'^^ '° ^^^'"'^ "^"^ ""^^'^ '^'^NA. wherein 

SEQ ID NO:389 is a clone designated herein as "UNQSTI " and/or •DNA48306-I291 

,0 MO ^'^^ "^^'^^ from the coding sequence of SEQ ID 

JO NO:389 shown in Figure 156. 

.Pn m mITJ'* ' """"" '''^ ''•^0322 cDNA. wherein 

SEQ ID NO:394 is a clone designated herein as "UNQ283- and/or "DNA48336-1309". 

"'^^ Shows the amino acid sequence (SEQ ID NO:395) derived from .he coding sequence of SEQ ID 
NO:394 shown in Figure 158. 

5:po ,n ^J""""" ' ^''^ "^^'''^ "^'"""^^ '•«0526 cDNA. wherein 

SEQ ID NO:399 is a clone designated herein as "UNO330" and/or "DNA44184-1319-. 

Mn ''"^ 1« ^"'^ «he amino acid seque«:e (SEQ ID NO:400) derived from *e coding seque«e of SEQ ID 
NO:399 shown in Figure 160. 
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F.gure 162 showSuclcotidc sequence (SEQ ID NO:404) of a nanve sequence PR0531 cDNA. whcrcm 
SEQ ID NO:404 is a Cone designated herem as "UNQSaa" and/or »DNA48314-1320" . 

Figure 163 shows U,e amino acki sequence (SEQ ID NO:405) demed from U,e coding sequence of SEQ ID 

NO-404 shown in Figure 162. 

Rgure 164 shows a nucleoode sequence (SEQ ID NO:409) of a native sequence PR0534 cDNA. wherem 
5 SEQ ID NO:409 is a clone designated herein as -UNQ335" and/or -DNA48333-1321 - 

Figuie 165 shows the amino acid sequence (SEQ ID N0:41W derived from the coding sequence of SEQ ID 

NO:409 shown in Figure 164. 

Figu« 166 Shows a nucleotide sequence (SEQ ID NO:414) of a native sequence PR0697 cDNA. whereu. 
SEQ ID NO:414 is a clone designated herein as "UNQ361 " and/or "DNA50920-1325". 
,0 Figure 167 shows the amino acid sequence (SEQ ID NO:415) derived from the coding sequence of SEQ ID 

N0'414 shown in Figure 166. . 

Figure 168 shows a nucleotide sequence (SEQ ID N0:419) of a native sequence PR0717 cDNA, wherem 
SEQ ID NO:419 is a clone designated herein as "UNQSSS" and/or •DNA50988-1326" . 

Figure 169 shows the amino acid sequence (SEQ ID NO:420) derived from the coding sequence of SEQ ID 

15 NO:419 shown in Figure 168. ^^^^^ , r.vi a 

Figures I70A-B show a nucleotide sequence (SEQ ID NO:424) of a native sequence PR0731 cDNA. 
Wherein SEQ ID NO:424 is a clone designated herein as -UNQ395" and/or "DNA4833 M329". 

Figure 171 shows the amino acid sequence (SEQ ID NO:425) derived from ti,e codmg sequence of SEQ ID 

NO-424 shown in Figures 170A-B. 
20 ■ Figure 172 shows a nucleotide sequence (SEQ ID N0.429) of a native sequence PR0218 cDNA, whcrcm 
SEQ ID NO:429 is a clone designated herein as ••UNQ192" and/or -DNA30867-I335-. 

Figure 173 shows the amino acid sequence (SEQ ID NO.430) derived from the codi.« sequence of SEQ ID 

NO:429 shown in Figure 172. 

Figure 174 shows an EST nucleotide sequent designated herein as DNA14472 (SEQ ID NO:431). 
Figurel75showsanESTnucleotideseque«edesigna.edhereinasDNA15846(SEQIDN^^^^^ 

Figures 176A-B show a nucleotide sequence (SEQ ID NO:436) of a native sequence PR0768 cDNA. 
.herein SEQ ID NO:436 U a clone designa«d herein as ••UNQ406- and/or "DNA55737-.345". 

ng„« 177 shows the amino acid sequence (SEQ ID NO:437) derived from the codmg sequence of SEQ ID 

NO:436 shown in Figures 176A-B. ^^^^ ^^^^ ^^^^ 

30 Figure 178 shows a nucleotide sequence (SEQ lU ^u.^i) oi a iwu m 

SEQ ID NO:441 is a clone designated herein as ••UNQ409" and/or -DNA49829-1346" . 

Figure 179 shows the amino acid sequence (SEQ ID NO:442) derived from .he coding sequence of SEQ ID 

NO:441 shown in Figure 178. DDnw rnNA 

Figures 180A-B show a mKleotidc sequence (SEQ ID NO:446) of a nauve sequence PR0733 cDNA. 
35 Wherein SEQ ID NO:446 is a done designa.«l herein as "UNQ*!!" and/or .DNA52196-1348- 

Figure 181 Show, the amino acid sequence (SEQ ID NO:447) derived from the codmg sequence of SEQ ID 

"="C:::i:";Ldese^e(SEQIDK0..4Sl>ofanatives 
SEQ ID NO:451 is a clone designated herein as "UNQ429" andyor •DNA46965.1356" . 
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NO 45 . T ^''^ rroycoding sequence of SEQ ,D 

NO;451 shown in Figure 182. 

SEQ JD NO:4S3 „ . clon. d„ w h.„i„ „ -UNQIX- ^„ -DNASWS-lSSr 
5 NO:453 shown m Figure 184. ^ * -^cv lu 

SEQ ID N0.455 u a clone des.gna.ed herein as "UNQ492" and/or "DNASTSSO-ISTS" 

NO h"^ ."I"""' ''''^ '° ^'""^ '^^•^^ Of SEQ ID 

NO:455 shown m Figure 186. ^ 

«0 Figure 188 shows an EST nucleoUde seq««Ke designated herein as DNA16508 (SEQ ID NO 457) 

F.gurcs ,89A-B show a nucleotide seque«:e (SEQ ID NO:458) of a native sequence PROIOP cDNA 
whe«u. SEQ ID NO:458 is a clone destgnated herein as -UNQ495- and/or •DNA56439-1376" 

NO 458 T ^^^'''^ ^^"^ «0"-e Of SEQ ID 

NO:458 shown in Figures 1 89A-B. 

SEQ ID N0.463 .s a clone designated herein as "UNQ497" and/or ■•DNA56409-1377-. 

Rgure 192 shows the amino acid sequence (SEQ ID NO-464> derived frnm ,h. 
MA At^y u ■ r-. y «iv nu.w»> oerived from the coding sequence of SEQ ID 

NO:463 shown in Figure 191. i • lu 

SEQ ID N0.465 » a clone designated herein as "UNQSOO" and/or "DNASei 12-1379" 

NO 4., (SEQ ID NO:466) derived from the coding sequence of SEQ ID 

NO:465 shown in Figure 193. 

SEO ID No"^, T " ^'''^ '° ""^^'^^^ 

J.EQ ID N0.467 » a clone designated herein as "UNQSiK" and/or "DNA56045-1380-. 

Figure 196 shows the amino acid sequence (SEQ ID NO-468) deriv<.rf frnm ,1.. 
N0:467 shown in Figure 195. « ° '^0 *68) <"^-ed from the codmg sequence of SEQ ID 

SEQ ID N^r T ' ""^^ ^ "'^^ ""^^'^^^ •^'^NA. Wherein 

SEQ ID N0:469 is a clone designated herein as "UNQ516' and/or "DNA59294-1381" 

30 N0:4«rrFirri".^""^''''"""""""^ 

SEO ID N^ll^^T ' """" ^''^ "^"^"^ <=^NA, whc^u, 

SEQ ID N0:471 „ a clone designated herein as "UNQ475- and/or •'DNA56433.1406". 

NO:471 shown in Figure 199. 

SEQIDN^r'T'''''^'^''^''^'''''^^'''^^ 
ftQ ID N0:476 is a clone designated herein as ■■UNQ539" and/or -DNA5391M457". 

N0.4,. 

NO:476 shown ui Figure 201 . ^ lu 
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Figure 203 shows a nucieodde sequence (SEQ ID NO:482) of a „a«ve sequence PRO1083 cDNA. whcrcn 
SEQ ID NO:482 is a clone designated herein as "UNQ540- and/or •DNA50921-1458" 

Figure 204 shows d. amino acid sequence (SEQ ID NO:483) derived from d« coding sequence of SEQ ID 

NO'482 shown in Figure 203. 

Figure 205 shows an EST nucleotide sequence designated hereta as DNA24256 (SEQ ID NO:484). 
Figure 206 shows'a nucleotide sequence (SEQ ID NO:487) of a native sequence PRO200 cDNA. wherein 
SEQ ID NO:487 Is a clone designated herein as -UNQ174- and/or •DNA29101-1122-. 

Hgu« 207 Shows the amino acid sequence (SEQ ID NO:488) derived from .he coding seque«:e of SEQ ID 

NO*487 shown in Figure 206. 

■ Fip«e 208 shows a nucleotide sequence (SEQ ID NO:495) of a native sequence PR0285 cDNA. wherem 

SEQ ID NO:495 U a clone designated herein as "DNA4002M154" 

Figure 209 shows the amino acid sequence (SEQ ID NO:496) derived from the codmg sequence of SEQ ID 

NO:495 shown in Figure 208. odaojii^ /.hna 

Figures 210A-B show a nucleotide sequence (SEQ ID NO:497) of a nauve sequence PR0286 cDNA. 
whereinSEQIDNO:497isaclonedesigna.edhereinas-DNA42663-1154'. ,^,^,,ce0ID 
Figure 21 1 shows the amino acid sequence (SEQ ID NO:498) derived from the codmg sequence of SEQ ID 

NO:497 shown in Figures 210A-B. 

212 Shows a nudeodde sequence (SEQ ID NO:505) of a native seque«e PR0213-1 cDNA. wherem 

SEQ ID NO:505 is a clone designated herein as •DNA30943.1-il63-l-. 

Rgure 213 Shows the amino add sequence (SEQ ID NO:506) derived from the codu^ 

""^::::irws?n.leo.esequence(^ 

SEQ ID NO:507 is a clone designated herein as ••DNA64907.1 163-1". 

Figure215 shows d. ^ acid sequetKe (SEQ ID NO:508) derived from the codmg sequence of SEQ ID 

, ^°=":::::r:i^deseque«e(SEQiDNo:5o^^ 

SEQ ID NO:509 is a clone designated herein as -DNA64908.1 163-1-. 

Fig«re217shows.heamino«:.dseq»e«x(SEQIDNO:510)derivedfrom.hecodmgsequ^^ 

NO:509 shown in Figure 216. ^^.3, 3,^,^e PR0298 cDNA. wherein 

Figure 218 shows a nuclcondc sequence (SEQ ID NU,3 oi a lun h 
30 SEQ ID N0:514 is a done designated herein as -UN026r and/or -DNA399^^^ 

. 'A ..^.Pnri- (^RO ID NO-5 15) derived from the coding sequence of SEQ ID 

Figure 219 shows the ammo acid sequence (SEQ W lo) wu 

N0:514 shown in Figure 218. 

Figure 220 shows an EST nucleotide sequence designated herein as DNA26832 (SEQ ID NO:316). 
Fill 221 shows a nucleotide sequence (SEQ ID NO:522, of a nauve sequence PR0337 cDNA. wherem 

35 SEQ ID NO:522 is a clone designated herein as -DNA43316-1237-. .«,«nin 

/CCA m Mn W3^ derived from the coding sequence of SEQ ID 
Figure 222 shows the anano acid sequence (SEQ ID N0.523) oenveo irom 
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5EQ ID NO:525 is a clone designated herein as "DNASSgOO-iaW". 

MO k"" ^''^ '^'^ ^^i^nce of SEQ ,D 

NO:525 shown in Figure 224. 

Figures 226A-B show an EST nucleotide sequence designated herem as DNA34415 (SEQ ID NO-527) 
Figure 227 shows an EST nucleotide sequence designated herein as DNA49830 (SEQ ID NO S28) 
Figure 228 shows an EST nucleotide sequence designated herein as DNA49831 (SEQ ID NO:529).' 

PETA1I.FP OF Sr RiFnQN OPTHF PffFFPPPFn PMp ^ p„ .. , p,. |T c , 
I- Definjripn <j 

Tl« terms 'PRO polypeptide" and -PRO- as used herein and when immediately followed by a numerical 
designadon refer to various polypeptides, wherein the complete designation (i.e.. PRO/number) refers to specific " 
polypeptidesequencesasdescril^lhereia Tlie terms TRO/numberpol^^^^^^^^^^ 

encompass native sequence polypeptides and polypeptide variants (which are further defined herein). TT,e PRO 

polypepudes described here^ may be isolated fromavanety of sources, such as fromh^ 
another source, or prepared by recombinant or synthetic methods. 

A "native sequence PRO polypeptide" comprises a polypeptide having the same ami«. acid seque«:e as the 
cor^spo^lii^PROpolypeptide derived fromnamre. Such native sequence PRO polypeptides can be isolated f.om 
nam« or can be produced by recombinant or syndetic means. ^ term "native sequence PRO polypeptide" 
specifically encompasses naturally-occurring .™„cated or secreted fonns of the specific PRO po.ypep.i<te (. , an 
extracellula. domain sequence), namrally-occurring variant forms (e.,.. alternatively spliced forms) and nati^ly. 
occur™, aMc vanams of .he polypeptide. In various embodiments of d.e invention, the native sequence PR0213 
polypeptide isamature or fuH-lengd, native sequence PR0213pol^^^^^^ , to 295 of Figure 

(SEQ ID N0:2). die native sequence PR0274 polypeptide is a mature or Ml-lengti, native sequence PR0274 
polypepnde comprising amino acids 1 to 492 of Figute 4 (SEQ ID N0:7). the native sequence PRO300 polypeptide 
™orfi,U.leng.h native sequence PR 

ID NO: 19). the nanve sequence PR0284 polypeptide is a mamre or fiUl-lengU. native sequence PR0284 polypeptide 
comprising amino acids 1 to285of Fieuie 11 ^«?Pftin Nn-iBv .u. • 

^ " ^^^^ ^ ^°-28>- """ve sequence PR0296 polypeptide is a raamre 

or fiiU-length nanve sequence PR0296 polypeptide comprising amino acids I to 204 of Figure 15 (SEQ ID NO-Sfi) 
*e native sequence PR0329 polypeptide is a mature or fiill-lengti, native sequence PR0329 polypeptide comprising 
a^ac.dslto359 Of Figure 20 (SEQ ID NO:45)..e native seque.ePR0362po.ypeptid^^^ 
icngm native sequence PR0362 polypeptide comprising amino acids 1 to 321 of Figure 22 (SEQ ID NO-52) ti« 
native ,e,ue«» PR0363 po„T>eptide is a mamre or fiiU-length native sequence PR0363 polypeptide comprising 
anuno acids 1 to 373 Of r.g«e 24 (SEQ ID N0..59). die native sequence PR0868 polypeptide 
length nanve sequence PR0868 polypeptide comprising amino acids 1 to 655 of Figure 26 (SEQ ID NO-64) the 
native sequence PR0382 polypeptide is a mam„ or fiiU-Iength native sequence PR0382 polypeptide comprising 
^nno acid. I to 453 of Pi^ 28 (SEQ ID NO:69). .tive sequence PR0545 polypeptide 
length nanve sequence PR0545 polypeptide comprising ami«, acids 1 «> 735 of Figure 30 (SEQ ID NO-74) the 
native sequence PR06,7 polypeptide is a manire or fiUMer^Ui native sequence PR06I7 polypeptide comprising 
aminoac.dslto67ofFigure33(SEQmNO:85).d««uive sequence PRO700^^^^^ 
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native sequence PRO70O polypepude comprismg amino acids I to 432 of Figure 35 (SEQ ID NO:90). the native 
swpttnce PRO702 polypeptide is a mature or fuU-length native sequence PRO702 polypeptide comprising amino acids 
1 ,0 277 of Figure 37 (SEQ ID NO:97). the native sequence PRO703 polypeptide is a mature or full-length native 
sequence PRO703 polypeptide comprising amino acids 1 to 730 of Figure 39 (SEQ ID NO:102). the native sequence 
PRO705 polypeptide is a mature or fiiH-Iength native sequence PRO705 polypeptide comprising am.no acids 1 to 555 
5 of Figure 41 (SEQ ID NO:109). the native sequence PRO708 polypeptide is a mawre or full-length native sequence 
PRO708 polypeptide comprising amino acids 1 to 515 of Figure 43 (SEQ ID N0:1 14). the nadve sequence PRO320 
polypeptide is a mature or fuU-length native sequence PRO320 polypeptide comprising amino acids I to 338 of Figure 
45 (SEQ ID N0:119). the native sequence PR0324 polypeptide is a mamre or full-length native sequence PR0324 
polypeptide comprising amino acids 1 to 289 of Figure 47 (SEQ ID NO:124). the native sequence PR0351 
10 polypeptkle is a mature or M-length native sequence PR0351 polypeptide comprising amino ac.^ I to571 ofFtgure 
49 (SEQ ID N0:132). the native sequence FR0352 polypeptide is a mature or full-length native sequence PR0352 
polypeptide comprising amino acids 1 to 316 of Figure 51 (SEQ ID N0:137). the native sequence PR0381 
polypeptide is a mamre or lim-length native sequence PR0381 polypeptide comprising amino ac^ 1 to 211 of Figure 
53 (SEQ ID NO:I45). the native sequence PR0386 polypeptide is a mature or full-length native sequence PR0386 
15 polypepude comprising amino acids I to 215 of Figure 55 (SEQ ID NO:150). d« native sequence PRO540 
polypeptide is a mature or fidl-lengd. native sequence PRO540 polypeptide comprising amino adds 1 to 412 of Figure 
59 (SEQ ID NO:157). the native sequence FR0615 polypeptide is a mamre or full-length native sequence PR0615 
polypeptide comprising amino acids I to 224 of Figure 61 (SEQ ID NO:162). the native sequence PR0618 
polypeptide is a mamre or luU-length nadve sequence PR0618 polypeptide comprising amino acids 1 to 802 of Ftgure 
20 63 (SEQ ID NO:169). the native sequence PR0719 polypeptide is a mamre or full-length native sequence PR0719 
polypeptide comprising amino acids 1 to 354 of Figure 66 (SEQ ID NO:I78). the «ttive sequence PR0724 
polypeptide is a mamre or full-length native sequence PR0724 polypeptide comprising amino acids I to 713 of Ftguie 
68 (SEQ ID NO. 183). the native sequence PR0772 polypeptide is a mawre or fuU-lengti. native sequence PR0772 
polypeptide comprising amino acids 1 to 152 of Figure 70 (SEQ ID NO:190). the native sequence PR0852 
polypeptide isamature or fiUl-lenglh native sequence PR0852 polypeptide comprising 1 to 518 of Figure 

73 (SEQ ID NO:196). the native sequence PR0853 polypeptide is a mamre or fuH-length native sequence PR0853 
polypeptide comprising amino acids 1 to 377 of Figure 75 (SEQ ID NO:206). the native sequence PRO860 
polypeptide is a matuie or full-lengd. native sequetKC PRO860 polypeptide comprising amino acids I to 985 of Ftgure 
77 (SH)© NO:211). ti« native sequence PR0846 polypeptide is a mature or full-length native sequence PR0846 
polypeptide comprising amino acids 1 to 332 of Figure 79 (SEQ ID NO:216). the native sequence PR0862 
polypeptide U a mamre or fuU-length native sequence PR0862 polypepude comprising am^ 1 to 146ofF.gUfe 
81 (SEQ ID NO-221). the native sequence PR0864 polypeptide is a mamre or full-length native sequence PR0864 
polypeptide comprising amino acids 1 to 351 of Figure 83 (SEQ ID NO:226). the native sequence PR0792 
polypeptide is a mamie or fuU-length native seque«e PR0792 polypeptide comprising amino acids 1 u. 293 of Figure 
35 85 (SEQ ID NO:231). die native sequence PR0866 polypeptide is a mamre or full-length native sequence PR0866 
polypeptide comprising amino acids I to 331 of Figure 87 (SEQ ID NO:236). Ok .native sequence PR0871 
polypeptide is a mature or M-lengA native sequence PR087 1 polypeptide comprising amino acids 1 to 472 of Figure 
89 (SEQ ID NO:245). Uk native sequence PR0873 polypeptide is a mamre or full-lengti. native sequence PR0873 
polypeptide comprising amino acids 1 to 545 of Figure 91 (SEQ ID NO:254). the native sequence PRO940 
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polypepDdeisan«morfi.U.Ie„gU,«nvescque«ePRO940poIyw^^^^ , .o 544 of Figure 

93 (SEQ ID NO:259). flic naiive sequence PR0941 polypeptide is a mature or full-length native sequence PR0941 
polypeptide comprising amino acids 1 to 772 of Figure 95 (SEQ ID NO:264). the mttive sequence PR0944 
polypeptide is a mature or fuU-length nadve sequence PR0944 polypeptide comprising amino acids 1 to 21 lof Figure 
98 (SEQ ID NO:270). the native sequence PR0983 polypepUde is a mature or full-length nanve sequence PR0983 
polypeptide comprising amino acids 1 to 243 of Figure 109 (SEQ ID NO:284), the native sequence PRO1057 
polypeptide is a mamre or full-length native sequence PRO,057 polypeptide comprising amino acids 1 to 413 of 
Figure 1 18 (SEQ ID NO:296). the native sequence PROI071 polypeptide is a mature or full-lengU, native sequence 
PRO1071 polypeptide comprising amino acids 1 to 525 of Figure 120 (SEQ ID NO:30n. the native sequence 
PRO1072 polypeptide is a mamre or full-length native sequence PRO1072 polypeptide comprising amino acids 1 to 
336 of Figure 122 (SEQ ID NO:303), the native sequence PRO1075 polypeptide ts a mature or full-length native 
sequence PROI075 polypeptide comprising amino acids 1 to 406 of Figure 125 (SEQ ID NO:309) the native 
sequence PR0181 polypeptide is a mature or luU-kngth native sequence PR018 1 polypeptide comprising «mno acids 
I «, 144 of Figure 129 (SEQ ID NO:322). .he native sequence PR0195 polypeptide is a mature or full-length native 
sequence PR0195 polypeptide comprising ammo acids 1 to 323 of Figure 132 (SEQ ID NO:330). the native sequence 
PR0865 polypeptide is a mamie or lull-length nanve sequence PR0865 polypeptide comprising amino acids 1 to 468 
of Figure 136 (SEQ ID NO:337). the native sequence PR0827 polypeptide .s a mamre or full-length native sequence 
PR0827 polypeptide comprising amino acids 1 to 124 of Figure 39 (SEQ ID NO:346). ti« native sequence PROl 1 14 
polypeptide is a mamre or full-lengd, native sequence PR01114 polypeptide comprising amino acids 1 to 311 of 
Figure 142 (SEQ ID NO:352). ti.e native sequence PR0237 polypeptide is a mamre or Ml-le,^ native sequence 
PR0237 polypeptide comprising amino acids 1 to 328 of Figure 145 (SEQ ID NO:358). ti,e native sequence PR0541 
polypeptide is a mamre or fiill-length native sequence PR0541 polypeptide comprising amino acids 1 to 500 of Figure 
147 (SEQ ID NO:363). ti« native sequence PR0273 polypeptide is a mamre or full-length nanve sequence PR0273 
polypeptide comprising amino acids 1 titrough 1 1 1 of Figure 149 (SEQ ID NO:370). tite native sequence PRO70I 
polypeptide is a mature or full-le.«,h native sequence PRO701 polypeptide comprising amino a 1 to 816 of Figure 
151 (SEQ ID NO:375). the native sequence PRO704 polypeptide is a ftill-length or mature native sequence PRO704 
polypeptide comprisi«g amino acids 1 or 40 through 348 of Figure 153 (SEQ ID NO:380). native sequence 
PRO706 polypeptide is a mamie or full-lengih native sequence PRO706 polypeptide comprising amino acids 1 .o 480 
of Figure 155 (SEQ ID NO:385). ti« native seque«:e PRO707 polypeptide is a fUlI-length or mamre native sequence 
PRO707 polypeptide comprising amino acids 1 or 31 through 916 of Figure 157 (SEQ ID NO:390) tiie native 
sequence PR0322 polypeptide is a mature or full-length nati^ sequence PR0322 polypeptide comprising «mno acids 
24 or 1 to 2(0 of Figure 159 (SEQ ID NO:395). ti,e «.tive sequence PR0526 polypeptide is a fulI-length or mamre 
nanve sequence PR(J526 polypeptide comprising amino acids I or 27 Utrough 473 of Figure 161 (SEQ ID NO-400) 
tiK native sequence PR0531 polypeptide is a mautre PR0531 polypeptide comprising amino acids 1 to 789 of Figure 
163 (SEQ ID mm, ti« native sequence PR0534 polypeptide is a mamre or full-lengti, native sequence PR0534 
polypeptide comprising amino acids 1 to 360 of Figure 165 (SEQ ID NO:410). die native sequence PR0697 
polypeptide is a full-lengti. or mamre native sequence PR0697 polypeptide comprising amino acids 1 or 21 tiirougK 
295 of Figure 167 (SEQ ID NO:415). ti,e native sequence PR0717 polypeptide is a mamre or fuU-lengU, native 
sequence PR0717 polypeptide comprising amino acids 1 tiuough 560 of Figure 169 (SEQ ID NO:420) tiK native 
sequence PR0731 polypeptide is a full-lengti, or mamre native sequence PR0731 polypeptide comprising a«i«, acids 
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I or 14 through 1 184 of Figure 171 (SEQ ID NO:425). the native sequence PR0218 polypeptide is a full-length or 
namre native sequence PR0218 polypeptide comprising amino acids 1 or 24 through 455 of Figure 173 (SEQ ID 
NO-430). the native sequence PR0768 polypeptide is a fiilHengih or mature naove sequence PR0768 polypepUdc 
comprising amino acids 1 or 34 through 1141 of Figure 177 (SEQ ID NO:437). the native sequence PR0771 
polypeptide is a full-lengih or mature native sequence PR0771 polypeptide comprising amino acids 1 or 17 through 
5 436 of Figure 179 (SEQ ID NO:442). the native sequence PR0733 polypeptide is a mature or full-length native 
sequence PR0733 polypeptide comprising ami«, acids 24 or 1 through 229 of Figure 181 (SEQ ID NO:447). the 
native sequence PR0162 polypeptide is a lull-length or manire native sequence PR0162 polypeptide comprising 
anrino acids I or 27 trough 175 of Figure 183 (SEQ ID NO:452). the native sequence PR0788 polypeptide is a lull- 
length or mawre native sequence PR0788 polypeptide comprising amino acids I or 18 through 125 of Figure 185 
10 (SEQ ID NO-454), the native sequence PRO1008 polypeptide is a full-length or manire native sequence PRO1008 
polypeptide comprising amino acids 1 or 24 through 266 of Figure 187 (SEQ ID NO:456). the native sequence 
PRO1012 polypeptide is a mature or full-length native sequence PRO1012 polypeptide comprising amino acids 1 
toough 747 of Figure 190 (SEQ ID NO:459). the native sequence PRO10I4 polypeptide is a fuU-length or manire 
native sequence PRO1014 polypeptide comprising amino acids 1 or 20 through 300 of Figure 192 (SEQ ID NO:464). 
15 the native sequence PROI017 polypeptide is a fiJUengih or mamie native sequence PRO1017 polypeptide comprising 
amino acids 1 or 32 through 414 of Figure 194 (SEQ ID NO:466), the native sequence PR0474 polypeptide is a 
or M-length native sequence PR0474 polypeptide comprising amino acids 1 through 270 of Figure 196 (SEQ 
ID NO-468). the native sequence PRO1031 polypeptide is a full-length or mamre native sequence PRO1031 
polypeptide comprising amino acids I or 21 through 180 of Figure 198 (SEQ ID NO:470). the native sequence 
20 PROWS polypeptide is a mature or fiiU-length native sequence PR0938 polypeptide comprising amino acids 1 to 349 
of Figure 200 (SEQ ID N0.472). the native sequence PRO1082 polypeptide is a full-length or maure native sequence 
PRO1082 polypeptide comprising amino acids 1 dirough 201 of Figure 202 (SEQ ID NO:477). the native sequence 
PRO1083 polypeptide is a full-length or mawre native sequence PRO1083 polypeptide comprising amino acids I or 
26 through 693 of Figure 204 (SEQ ID NO:483). die native sequence VEGF-E polypeptide is a man»e or full-length 
25 native sequence VEGF-E polypeptide comprising amino acids 1 duough 345 as depicted in Figure 207 (SEQ ID 
NO-488). the native sequence PR0285 is a mamre or full-length native sequence PR0285 polypeptide comprismg 
amino adds 1 to 1049 of Figure 209 (SEQ ID NO:496). the native sequence PR0286 is a manire or full-length native 
sequence PR0286 polypeptide comprising amino acids 1 to 1041 of Figure 211 (SEQ ID NO:298). the native 
sequent* PR0298 is a mature or fuU-length native sequence PR0298 comprising amino acids 1 to 364 of Figure 219 
30 (SEQ ID NO:515). the native sequence PR0337 is a mamre or hill-length native sequence human neurotrimin 
comprising amino acids 1 to 344 of Figure 222 (SEQ ID NO:523). with or without the N-terminal signal sequence 
(residues I to about 28). and wiU, or without the initiating methionine at position I and the native sequence PRO403 
is a mature or full-lengti, native sequence comprising amino acids 1 to 736 of Figure 225 (SEQ ID NO:526). with 
or without dK initiating methionine at position 1. 
35 The PRO polypeptide -'extracellular domain" or "ECD" refers to a form of the PRO polypeptide which is 

essentially free of the transmembrane and cytoplasmic domains. Ordinarily, a PRO polypeptide ECD will have less 
U«n 1% of such transmembrane and/or cytoplasmic domams and preferably. wUl have less than 0.5% of such 
domains. It wUI be understood that any transmembrane domains identified for the PRO polypeptides of Uie presem 
invention are identified pursuant to criteria routinely employed in the art for identifying that type of hydrophobic 
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domain. cxa« bouadaries of a transmembrane domain may vary bu. mos. likely by no more than about 5 amino 
acds at cther end of the domain as initially .dennfied. Optionally, therefore, an extracellular domain of a PRO 
polypepnde may contain from about 5 or fewer ammo acids on either or the transmembrane domam as initially 
identified. 
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•PRO polypeptide variant" means an active PRO polypeptide as defined above or below having at least about 
80% ammo acid sequence identity with the luU-length native sequence PRO polypeptide sequence as dtsdosed herein 
Such PRO polypeptide variant include, for imtance. PRO polypeptides wherein one or more amino acid residues 
are added, or deleted, a. the N- or C-termimis of the fulHeng* native amino acid sequence. Ordinarily a PRO 
polypeptide variant will have a, leas, about 80% amino acid sequence identity, more preferably at least about 85% 
ammo acid sequence identity, and even more preferably at leas, about 90% amino acid sequence identity even more 
preferably a, least about 91 % amino acid sequence identity, even more preferably at least about 92% amino ac.d 
sequence identity, even more preferably at least about 93% amino acid sequence identity, even more preferably a. 
least about 94% ammo acid sequence identity, even more preferably at leas, about 95 % amino acid sequence identity 
yet more preferably at leas, about 96% amino acid sequence identity, ye. more preferably a, leas, about 97% amino 
actd sequence identity, ye. more preferably at least about 98% amino acid sequence idemit>. and most preferably a, 
least about 99% amino acid sequence identity w.th ti,e amino acid sequence of the MHength native amino acid 
sequence as disclosed herein. 

•Percent (%) amino acid sequence identity" wiA respect to die PRO polypeptide sequences identified herein 
,s defined as me percenuge of amino acid residues in a candidate sequence tiuu are identical wiA ti,e amino acid 
residues in dte specific PRO polypeptide sequence, after aligning the sequences and introducing gaps, if necessary 
to achieve *e maximum percent sequence identity, and not considering any conservative substinuions as pan of Ae 
seque«e tdemity . Alignmem for purposes of determining percem amino acid sequence identity can be achieved in 
varH,us ways dutt are widun the skill in dte art. for instance, using publicly available computer software such as 
BLAST. BLAST-2. AUGN or Megalign (DNASTAR) software. The preferred software alignmem program is 
BLAST. n,ose skilled in me art can dcermine appropriate parameters for measuring alignment, including any 
algorithms needed to achieve maximal aligmnent over the fttli length of me sequences bemg compared The % 
.denaty values used hetein have been generated usit« me WU-BLAST.2 computer program (Altschul et al. . Msitm 
HLfigmte 2fifi:46(M80 (1996); ht.p://blas.. wusti/eduAlast/README.hmJ). Mos, of me WU-BLAST.2 search 
parameters were set m me defeuh vah.es. TTie adjustable panmeters were se. wid, me following values: overlap span 
= 1. overlap fraction = 0.125. word mreshold CD - 11. and scoring matrix - BLOSUM62. nKHSPSandHSP 
S2 parameters, which are dynamic values used by BLAST.2. are established by me program itself depending upon 
the composition of me sequence of interest and composition of me database against which me sequence is bemg 
scanned. However, me values may be adjusted to increase sensitivity. A % sequence Identity value is determined 
by me fraction of matching identical residues divided by me total number of residues in me aligned region. 

•Percent (%) nucleic acid sequence identity" wim respect to PRO-encoding nucleic acid sequences ideraified 
herein is defmed as me percentage of nucleotides in a candidate sequence mat are identical wim me nucleotides in 
the PRO nucleic acid sequence of i«eres,. after aligning me sequences and introducing gaps, if necessary, to achieve 
•he maximum percent sequence identity. Alignmem for purposes of determining percem nucleic acid sequence 
.denuo. can be achieved in vanous ways ma. are wimin me skill in me an, for insonce. using publicly available 
computer software such as BLAST. BLAST-2. AUGN or Megalign (DNASTAR) software. TTH»e sWUed in me an 
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can determine appropriate parameters tor measuring alignment, includmg any algorithms needed to acluevc maximal 
aUgnmeW over the M length of the sequences being compared. The idemity values used herein were get«n«cd by 
the BLASTN module of WU-BLAST-2 set to the default parameters, with overlap span and overlap fraction set to 
land 0.125. respectively. 

The terra "positives-, in the context of sequence comparison performed as described above, includes 
5 residues in the setjuences compared that are not identical but have similar properties (e.g. as a resuh of conservative 
substitutions). TT»c % value of positives is determined by the fraciion of residues scoring a positive value in the 
BLOSUM 62 matrix divided by the total raimber of residues in the aligned region, as defined above. 

The term "epitope tagged" where used herein refers to a chimeric polypeptide comprising a PRO 
polypeptide, or domain sequence thereof, fused to a "tag polypeptide". The tag polypeptide has enough residues to 
10 provide an epitope against which an antibody may be made, or which can be identified by some other agent, yet is 
short enough such that it does not interfere with the activity of the PRO polypeptide of interest. TTk tag polypeptide 
preferably is also fairly unique so that the antibody does not substantially cross-react with other epitopes. Suitable 
ug polypeptides generally have at least six amino acid residues and usually between about 8 to about 50 amino acid 
residues (preferably, between about 10 to about 20 residues). 
15 "Isolated." when used lo describe the various polypeptides disclosed herein, means polypeptide that has been 

identified and separated and/or recovered from a component of its natural enviromnem. Contaminant components 
of its natural environment are materials that would typically interfere with diagnostic or therapeutic uses for the 
polypeptide, and may inchide enzymes, hormones, and other proteinaceous or non-proteinaceous solutes. In preferred 
erriHKliments. the polypeptide will be purified (1) to a degree sufficient to obuin a, least 15 residues of N-tenninal 
20 or internal amino acid sequence by use of a spinning cup sequenator. or (2) to homogeneity by SDS-PAGE under non- 
reducing or reducing conditions using Coomassie blue or. preferably, silver suin. Isolated polypeptide includes 
polypeptide in situ within recombinant cells, since at least one component of the PRO polypeptide namral emrironmem 
wUl not be present. Ordinarily, however, isolated polypeptide will be prepared by at least one purification step. 

An "isolated" PRO polypeptide-encoding nucleic acid is a nucleic acid molecule that is idemified and 
25 separated from at least one comaminant nucleic acid molecule with which it is ordinarily associated in the nawral 
source of the PRO polypeptide nucleic acid. An isolated PRO polypeptide nucleic acid molecule U other than in the 
form or setting in which it is found m namre. Uolated PRO polypeptide nucleic acid molecules therefore are 
distinguished fit,™ the specific PRO polypeptide nucleic acid molecule as it exists in namral cells. However, an 
isolated PRO polypeptide nucleic acid molecule includes PRO polypeptide nucleic acid molecules contained m cells 
30 that ordinarily express the PRO polypeptide where, for example, the nucleic acid molecule is in a chromosomal 

location differem from that of namral cells. 

nie trrm "control sequences " refers to DNA sequences necessary for the expression of an operably linked 
coding sequence in a particular host organism. THe comrol sequences that are suitable for prokaryoics. for example. 
iiKlude a promoter. optionaDy an operator se,p«nce. and a ribosome binding site. Eukao^odc cells are known to 
35 utilize promoters, polyadenylation signals, and enhancers. 

Nucleic acid is "operably linked" when it is placed imo a functional relationship with another nucleic acid 
«quence For example. DNA for a presequence or secretory leader is operably linked to DNA for a polypeptide 
if it is expressed as a preprotein that participates in the secretion of .he polypeptide: a promoter or enhancer is 
operably linked to a coding sequence if it affects the transcription of dve sequence; or a ribosome binding site ts 
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opcrably linked ,o a coding sequence if it is posirioned so as lo faciliutt iranslatio^enerally. operably linked" 
means U,at the DNA sequences being linked a,e comiguous. and. in the case of a secretory leader. com.guous and 
m reading phase. However, enhancers do no. have to be contiguous. Unking is accomplished by ligation at 
convenient restriction sites. If such sites do no. exist, du: synthetic oligonucleotide adaptors or linkers are used in 
accordance with conventional practice. 

The term "antibody" is used in the bn>adest sense and specifically covers single anti-PRO polypeptide 
monoclona] antibodies (including agonist, antagonist, and neutralizing anUbodies) and ami-PRO polypeptide anUbody 
compositions with polyepi.opic specificity. The term "monoclonal antibody" as used herein refers to an antibody 
obtamed from a population of substantially homogeneous antibodies. ,.e.. the individual antibodies comprising tht 
population are identical except for possible namrally-occurring mutations that may be presem in mmor amounu. 

•Active- or "activity" for purposes herein refers to form(s) of PRO polypeptide which reuin the biologic 
and/or immunologic activities of Uje specific nadve or naturally-occurring PRO polypeptide. 

"Treatment" or "treating" refers to both dterapeudc treamtem and prophylactic or preventative measures 
TT«se m need of tream«« include those aheady with the disotder as well as dx,se prone to have the disorder of those 
in which the disorder is to be prevented. 

"Mammal" for purposes of treamient refers to any animal classified as a mammal, including humans 
domestic and farm animals, and zoo. sporu. or pe. animals, such as sheep, dogs, horses, cats. cows, and the like. 
Preferably, the mammal herein is a human. 

■Carrier" as used herein inch.de phaimaceurically accepuble carriers, excipients. or stabilizers which are 
nomoxic to the cell or mammal being exposed thereto at the dosages and concentrations employed. Ofien die 
physiologically acceptable carrier is an aqueous pH buffered solution. Examples of physiologically accepuble 
cairien include buffeB such as phosphate, citrate, and other organic acids; antioxidants including ascorbic acid- low 
molecular weight (less than about 10 residues) polypeptide; proteins, such as serum albumm, gelatin, or 
immunoglobulins: hydrophilic polymers such as polyvinylpyrrolidone; amino acids such as glycine, glutamine 
aspaiagine. arginine or lysine; monosaccharides, disaccharides. and other carbohydrates including glucose, mannose' 
or dextrins: chelating agents such as EDTA; sugar alcohols such as mamutol or sorbitol; salt-forming counterion^ 
such as sodium: and/or nonionic surfeoants such as TWEEN™. polyethylene glycol (PEG), and PLURONICS™. 

The term "agonist" is used to refer to peptide and non-peptide analogs of the native PRO polypeptides 
(where native PRO polypeptide refers to pro-PRO polypeptide. pre-PRO polypeptide. prepro-PRO polypeptide or 
mature PRO polypeptide) of the presem invemion and to antibodies specificaUy binding such native PRO 
polypeptides, provided diat tiiey retain at leas, one biological activity of a native PRO polypeptide. Preferably. U« 
agonists of the presem invention retain the qualitative binding recognition properties and receptor activation properties 
of die native PRO polypeptide. 

The term "anugonist" is used to refer to a molecule inhibiting a biological activity of a native PRO 
polypeptide of die presem invention wherein native PRO polypeptide refers to pro-PRO polypeptide. pre-PRO 
polypeptide. p«prt>PRO polypeptide, or mamre PRO polypeptide. Preferably, the antagonists herein inhibit Ae 
binding of a native PRO polypeptide of d>e presem invention .o a binding partner. A PRO polypeptide "anugonist" 
U a molecule which pteve«s. or interferes wiA. a PRO anutgonist effector function (e.g. a molecule which prevents 
or mterferes wid. bimiing and/or activation of a PRO polypeptide receptor by PRO polypeptide). Such molecules 
can be screened for their ability to competitively inhibit PRO polypeptide receptor activation by monitoring binding 
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of naave PRO polypepiid^the presence and absence of ihe Kst anuigomst molecule, for example. An antagomsi 
of Ae invention also encompasses an antisense polynucleotide against the PRO polypeptide gene, which suuiscnse 
polynucleotide blocks transcription or translation of the PRO polypepude gene, thereby inhibiti,« its expression and 
bioloeical activity. 

•Stringency" of hybridization reactions is readUy determinable by one of ordinary skill in the art. and 
generally is an empirical calculation dependent upon probe length, washing temperamrc. and salt concentration. In 
general, longer probes require higher temperanires for proper annealing, while shorter probes need lower 
temperatures. Hybridization generally depends on the ability of denantred DNA to reamieal when complementary 
strands are present in an enviromncm below their melting temperamre. n.e higher the degree of desired homology 
between the probe and hybridizable sequence, the higher the relative temperamre which can be used. As a result, 
i, follows that higher relative temperatures would tend to make the reaction conditions more stringem. while lower 
temperamrcs less so. For additional details and explanation of stringency of hybridization reactions, see Ausubcl et 
at r-..rT.p i Pf"'"'^"'^ in M"!^'-"'" B'"'""^- ^''^^ Imerscience Publishers. (199SV 

"Stringem conditions" means (1) employing low ionic strength and high temperamre for washing, for 
example 0.015 sodium chloride/0.0015 M sodium citrate/0.1 % sodium dodecyl sulfate at 50»C. or (2) cmploymg 
dum« hybridizadon a denamring agem. such as fom^mide. for example. 50% (vol/voH formamidc with 0.1 % bovine 
serum albuminA).l% Rcoll/0.1% polyv.nylpyrrolidone/50 nM sodium phosphate buffer at pH 6.5 with 750 mM 
sodium chloride. 75 mM sodium citrate a, A2'C. Another example is use of 50% formamide. 5 x SSC (0.75 M 
NaCl 0 075 M sodium citrate). 50 mM sodium phosphate (pH 6/8). 0.1 % sodium pyrophosphate. 5 x Denhardt's 
solution, sonicated salmon sperm DNA (50 .g/ml). 0.1% SDS. and 10% dextran sulfate at 42-C. with washes at 
42»C inO 2 X SSC and 0.1% SDS. Yet another example is hybridization using a buffer of 10% dextran sulfate. 2 
X SSC (sodium chloride/sodium citrate) and 50% formamide at 55-C. followed by a high-stringenqr wash consisting 
of O.l X SSC containing EDTA at 55 *C. 

-Moderately stringent conditions" are described in Sambrook « al.. supra, and include the use of a washing 
solution and hybridization conditions (e.g.. temperamre. ionic strength, and %SDS) less stringem than described 
above An example of moderately stringem conditions is a condition such as overnight incubation at 37-C m a 
solmion comprising: 20% formamide. 5 x SSC (150 mM NaCl. 15 mM trisodhm. citrate). 50 mM sodium phosphate 
(pH 7 6). 5 X Dcnhardfs solution. 10% dextran sulfate, and 20 mg/mL denamred sheared salmon sperm DNA, 
followed by washing the filters in 1 x SSC at about 37-50-C. TTie skilled artisan wiU recognize how to adjust d« 
temperature, ionic strength, etc.. as necessary to accommodate factors such as probe length ami the like. 

"Southern analysis" or "Southern bloning" is a method by which the presence of DNA sequences in a 
restriction endonudease digest of DNA or a DNA-comaining composition is confirmed by hybridization to a known, 
labeled oligonucleotide or DNA fragmem. Soudiem analysU typically involves electrophoretic separation of DNA 
digests on agarose gels, denamrauon of Ihe DNA after electrophoretic separation, and transfer of the DNA to 
nitroceUulose. nylon, or another suitable membrane support for analysis with a radiolabeled, biotinylated. or cnzyme- 
35 labeled probe as described in sections 9.37-9.52 of Sambrook « al. , MolmHi i r Clnninf ; A I fl^oraW>TY M a mwt (New 
York: Cold Spring Harbor Laboratory Press. 1989). 

•Nonhem analysis" or -Northern bloning" is a method used to identify RNA sequences that hybridize to 
a km,wn probe such as an oligonucleotide. DNA fragment. cDNA or fragmem thereof, or RNA fragmem. Tie probe 
is labeled with a radiobotope such as -P. or by biomvlation. or with an enzyme. TTie RNA to be analyzed is usually 
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eieccrophcrcicaHy separaced on an agarose or po.yacry.anudc gC. .ransferreZutrocenulosc. ny.on or oO^cr 
su..b. n«.bra„e. and hybridized wiU, ^ probe. usu,g sundard .echn.<,ues we., .cnown in a,e arc such as U,osc 
descnbcd in secuons 7.39-7.52 of Sambrook a al.. supra. 

AS used herein, a,e .em. "in^unoadhesin" designates annbody-hke molecules which combine d,e binding 
;^^^'y^«-P-b,(an.adhesin">wi.h^eefrec.„^ 

w^ch .s od^er .ban .e antigen recog^.o„ and binding sice of an ancibody (i.e.. is "heceroloeou!") and 1 
unnnnK,g.obuiincons.n.don.insecp.Ke. Tbeadhesb,panof» 

annno ac.d sequence comprising ac leas, che binding sice of a recep.or or a ligand. TT,e in^unoglobulin co^can. 
doma» sequence m che immunoadhesin may be obcained from any .mmunoglobuiin. such as IgC. IgG- IgCS 
> or IgG-l subtypes, IgA (including IgA-l and IgA-2),IgE.IgD or IgM. « - 8 ^. 

^ -Chrome- admimscrauon refen » admm,s.adon of *e agent(s) m a cominuous mode as'opposed .o an acute 
mode, so as to maintain the initial a«„peutic effect (acvicy) for an extended period of time. -,n.ermittenf 
admrntstradon is treatment U«, is no. consecutively done withou, in.ern,ption. but rather ts cyclic in namre 

Adminis.ra.ion "in combina.ion wiUt" one or more lurcher therapeutic agents includes simultaneous 
(concurrent) and consecutive administration in any order. 

As used herein, "vascular endothelial ceU growdi factor-E " or " VFfiF b •• . 
fo.,,.., A u.u . '^'^ VEGF-E. refers .0 a mammalian growth 

wh^ ^ *.™ „ ^ ,„ ^ „ ^^^^ ^ ^ 

^« »™n. W*a, . .p*,= Of p^nodn, ^„ 

Cdb p,.,„.,„„„ 0, p«p„„„ „^,„,^„ ^ 

ecu Un.. », e«,s« h,p.„„,phri» c«dl» or .« po^ » i™™. opi„p. ^ „ 

"-^^^ -* «. -Wy ^ ^ =. o™ op„op. 0, o„„,pon*,« „„„ VEOF. 

1 »oTr. — con»„.Uoo .c^s. =pocios. M,«^,, 

«« VEOF-E he,.™ . « p™* pl« „,«» to ,ho™ » ombni» f»u ™yo<:,„, 

« dote. . U.S. 5.»2.6,.. 1*. «.,^o. „ ^„ VEOF . ^ ^ „ ,„^, 

^to.apM*o.fp«»i^^^^,^^,„„^^__^^^^ 

,0^ com. ,.,o«l 0.™, oolU. BHK-21 «b™««s, ., „ ^ „ 

^»P=^.u»™.k*..^o»™«^..«,„.^ 

The tenm "VEGF-E polypeptide" and "VEGF-F- wh^n ..c^/i 

F /PF c djiu vt:u^t When used herein encompass native sequence VEGF-E 
polypoprid. ^ VEOF.E polypopa* „o ^ted VEOF-E p,^^ „„, 

^ ftcn, . vo«.^ „ souroo,. „ ta. ,i.»o fr,„„^, ^. „ J 

recombinant or synthetic methods. ^ 

InhibLors of VEGF-E include those which reduce or inhibn Ute activity or expression of VEGF-E and 
includes anusense molecules. 
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The abbreviation "KDR" refers to the kinase domain region of ihe VEGF molecule VEGF-E has no 

homoloay with VEGF in this domain 

" m abbreviation "FLT-I" refers to the FMS-like tyrosine kinase bindmg domain which is known to bind 
,0 the corresponding FLT-l receptor. VEGF-E has no homology with VEGF in this domain. 

"Toll receptor!-. ■TLR2" and "huTUU" are used interchangeably, and refer to a human Toll receptor 
designated as "HuTLRZ" by Rock e, aL Prpr m Awd. Sti. USA 2S. 588-593 (1998). 

•me teim "expression vector" is used to define a vector, in which a nucleic acid encoding a PRO polypeptide 
herein is operably linked to comrol sequences capable of affecting its expression is a suiubie host cells. Vectors 
ordinarily carry a replication site (although this .s no. necessary where chromosomal mtegrauon will occur). 
Expression vectors also include marker sequences which are capable of providing phenotypic selection in transformed 
cells For example. E. coli is typically transformed using pBR322, a plasmid derived from an E. coli speeds 
(Bolivar et al.. Gene 2: 95 11977]). pBR322 contains genes for ampicillin and tetracycline resistance and thus 
provides easy means for identifying transformed cells, whether for purposes of cloning or expression. Expression 
vectors also optimally wiU contain sequences which are useful for the control of transcription and translation, e.g.. 
promourrs and Shine-Dalgamo sequences (for prokaryotes) or promoters and enhancers (for mammalian celU). TTk 
promoters may be. but need not be. inducible: even powerful consumtive promoters such as the CMV promoter for 
n^nunalian hosu have been found to produce the LHR without host cell toxicity. While it is conceivable that 
expression vectors need not contain any expression control, replicative sequences or selection genes, their absence 
may hamper the identification of hybrid transformants and the achievement of high level hybrid immunoglobulin 
expression. 

■n« term "Upopolysaccharide" or "LPS" is used herein as a synonym of "endotoxin." Upopolysacchandcs 
(LPS) are characteristic componems of the outer membrane of Gram-negative bacteria, e.g.. Escherichia coU. Tbcy 
consist of a polysaccharide part and a fa. called Upid A. The polysaccharide, which varies from one bacterial specKS 
to another, is made up of the O-specific chain (built from repeating units of three ,o eight sugars) and the two-par, 
core LipKi A virtually always includes two glucosamine sugars modified by phosphate and a variable number of fatty 
25 acids For further infonnation see. for example. Rietschel and Brade. V i miifK American August 1992, 54-61 

The tenn "sepdc shock" is used herein in the broadest sense, including all definitions disclosed in Bone. AfflL 
InamMiLm 332-333 (1991). Specifically, septic shock starts with a systemic response to infection, a syndrome 
called sepsis. When this syndrome results in hypotension and organ dysfunction, it is called sepuc shock. Sepuc 
shock may be initiated by gram-positive organisms and fimgi. as well as endotoxin-containing Gram-ncgauve 
30 organisms. Accordingly, the presem definition is not limited to "endotoxin shock." 

The phrases "gene amplification" and "gene duplication" arc used interchangeably and refer xo a process 
by which muldple copies of a gene or gene fragment are formed in a particular cell or cell line. The duplicated region 
(a stretch of ampUfied DNA) is often referred to as "amplicon". Usually, the amount of the messenger RNA 
(mRNA) produced. i.e.. the level of gene expression, also increases in the prt)portion of the number of copies made 

35 of the particular gene expressed. 

"Ttoor". as used herein, refers to ail neoplasnc cell growth and proliferation, whether malignant or benign, 
and aU pce-cancenxB and cancennis cells and tissues. TTie tenns "cancer" and "cancerous" refer to or describe the 
physiotogical condidon in mammals .hat is typically characterized by unregulated cell growdi. Examples of cancer 
include bu. are not limited to. carcinoma, lymphoma, blastoma. sarcoma, and leukemia. More particular examples 
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of such cancers include breasi cancer, prosute cancer, colon cancer, squamous cdl^er. small-cell lung cancer 
non-snall ceD lung cancer, gasooouesanal cancer, pancreatic cancer, glioblastoma, cervical cancer, ovarian cancer 
Uver cancer, bladder cancer, hepatoma, colorectal cancer, endometrial carcinonu.. salivary gland carcinoma, kidnev 
cancer, vulval cancer, thyroid cancer, hepadc carcinoma and various types of head and neck cancer. 

nie term "cytotoxic agent" as used herein refers to a substance that inhibits or prevents the function of cells 
and/or causes desmicuon of cells. TTie term is imended to include radioactive isotopes (e.g. 1131. 1125. Y90 and 
Rel86). chemoiherapeutic agents, and toxins such as enzymatically active toxins of bacterial, fungal, plant or animal 
origin, or fragments thereof. 

A "chemotherapeutic agent" is a chemical compound useful in the treatment of cancer. Examples of 
chemotherapeutic agents include adriamycin. doxombicin. epirubicin. S-fluorouracil. cy.osine arabinoside (- Ara-C") 
cyclophosphamide, thiotepa. busulfan. cytoxin. taxoids. e.g. paclitaxei (Taxol. Bristol-Myers Squibb Oncology 
Princeton, NJ). a«l doxeutxel fTaxo^re-. Rhonc-Poulenc Rorer. Antony. France), .oxotere. methotrexate, cisplatin' 
melphalan. vinblastine, bleomycin, etoposide. ifosfamide. mitomycin C. mitoxantrone. vincristine, vinoreibine 
carboplatin. teniposide. daunomycin. camunomycin. aminopterin. dactinomycin. mitomycins, esperamicins (see U S 
Pa. No. 4.675.187). melphalan and other related nitrogen mustards. Also included in diis definition are hormonal 
agents that act to regulate or inhibit hormone action on tumors such as tamoxifen and onapristone. 

A -growth inhibitory agem" when used herein refers to a compound or composidon which inhibits growth 
of a cell, especially cancer cell overexpressing any of the genes identified herein, either in vitro or in vivo TTius 
die growm inhibi.00. agent is one which significantly reduces die percentage of cells overexpressing such genes in 
S phase. Examples of growth inhibitory agents include agents that block cell cycle progression (at a place odier than 
S phase), such as agents diat induce GI arrest and M-phase arrest. Classical M-phase blockers Include the vl«as 
(vmcristine and vinblastine), taxol, and .opo II inhibitors such as doxorubicin, epirubicin. daunorubicin. etoposide 
and bleomycin, nwse agents that arrest Gl also spill over into S-phase arrest, for example. DNA alkylating agents 
such as tamoxifen, prednisone, dacarbazine. mecWorethamine. cisplatin. methotrexate. 5-fluorouracil and ara-C 
Further information can be found inTT* Molecular Basis of Cancer. Mendelsohn and Israel, eds.. Chapter 1. entitled 
"Cell cycle rcgularion. oncogens, and antineoplastic drugs "by Murakami et al. (WB Saunders: Philadelphia. 1995) 
especially p. 13. 

"Doxorubicin" is an athracydine antibiotic. 

The term "cytokine" is a generic term for proteins released by one cell population which act on another cell 
as mtercellular mediators. Examples of such cytokines are lymphokines. monokines, and traditional polypeptide 
hormones. Inchuled among Ae cytokines are growth honnone such as human growth hormone. N-methionyl human 
growth hormone, and bovine growth hormone; parathyroid hormone; thyroxine; insulin; proinsulln- relaxin- 
prorelaxin; and ^ like. As used herein. d,e term cytokine includes proteins from natural sources or from 
recombinant cell culmrc and biologically active equivalents of die native sequence cytokines. 

•Immunological cross-reacdvity" as used herein means that the candidate polypeptide is capable of 
competitively inhibiting the qualitative biological activity of a PR0213-I. PRO1330. or PR01449 polypeptide having 
this activity wiU,polyctenal antisera raised against the known active PR02I3-1. PROI330. or PROI449 polypeptide 
Such amisera may be prepared in comrentional fi^on by injecting goats or rabbits, for example, subcutaneously witi, 
Oie known active a«dog„e in complete Freund's adjuvam. followed by booster intraperitoneal or subcuumeous 
miecnonmincompleteFreunds. The immunological cross-reactivity preferably is "specific", which means that ti,e 

120 



BNSOOCID- <WO_994S?eiA2.L> 




PCT/US99/05028 

WO 99/46281 

binding affuuty of the3unological crosweacnve molecule (e.g.. antibody) idcnufied. to the corresponding 
PR0213-1 PRO1330. or PR01449 polypeptide is s.gnif.candy higher (preferably ai least about 2-nine8. more 
preferably at least about ^tiracs. even more preferably at least about 6-times. most preferably at least about 8-times 
higher) than the binding aftoty of that molecule to any other known native polypeptide. 

-Native antibodies" and "native immunoglobulins" are usually heterotetrameric glycoproteins of about 
5 150 000 daltons. composed of two idcnUcal light (L) chams and two identical heavy (H) chains. Each light chain 
is linked to a heavy chain by one covalent disulfide bond, while the number of disulfide Imkages varies among the 
heavy chains of different immunoglobulin isotypes. Each heavy and light chain also has regularly spaced intracham 
disulfide bridges. Each heavy chain has at one end a variable domain (VH) followed by a number of consum 
Each Ught chain has a variable domain at one end (VL) and a constant domain at its other end; the consun. 
,0 domain of a>e l.ght chain is aligned with the first constant domain of the heavy chain, and the light chain varutble 
domain is aligned with the variable domain of the heavy chain. Particular amino acid residues are believed to form 
an interface between the light- and heavy-chain variable domains. 

The term "variable" refers to the fact that certain poruons of the variable domains differ extensively m 
sequence among antibodies and are used in the binding and specificity of each particular antibody for its particular 
15 antigen However, the variability is not evenly disuibuted throughout the varmble domains of ant.bod.es. It is 
concentrated in three segments called complementarity-determining regions (CDRs) or hypervariable regions both 
i„ the light-Chain and the heavy-chain variable domains. The more highly conserved portions of variable domaK. 
are called the framework (FR). TTie variable domains of native heavy and light chains each compnse four FR 
regions, largely adopting a P-sheet configuration. con«cted by three CDRs. which form loops connecting, and m 
20 some cases forming pan of. me P-sheet stnicmre. m CDRs in each chain are held together m close pro«mi.y by 
the FR regions and. with the CDRs from the other chain, contribute to the formation of the andgen-bindmg s.tc of 
antibodies (see Rabat et al.. NIH Publ. No.91-3242. Vol. I. pages 641^ (1991)). TTie constant domains are no. 
involved directly in bindmg an antibody to an antigen, but exhibit various effector functions, such as panicipaoon 
of the antibody in antibody-dependent cellular toxicity. 
25 "Antibody fragments" comprise a portion of an inuct antibody, preferably the antigen binding or variable 

region of the intact antibody. Examples of antibody fragments include Fab. Fab". F(ab-)2. and Fv fragments; 
diabodies; linear antibodies (Zapata et al. . Protein Eng. 8(10). 1057-1062 119951); single-chain antibody molecules; 
and muluspccific antibodies formed from antibody fragments. 

Papain digestion of antibodies produces two identical antigen-binding fragments, called "Fab" fragments. 
30 each with a single antigen-binding site, and a residual "Fc" fragment, a designation reflecting the ability to crystallize 
readily. Pepsin treatment yields an F(ab')2 fragment that has two antigen<ombining sites and is sull capable of cross- 
linking antigen. 

•Fv" is the minimum antibody fragment which contains a complete antigen-recognition and -bindmg site. 
Tte region consis«> of a dimer of one heavy- and one light-chain variable domain in tight, non-covalem association. 
35 It is in this configuration that the three CDRs of each variable domain interact to define an antigen-binding sue on 
the surface of the VH-VL dimer. Collectively, tiie six CDRs confer antigen-binding specificity to the anubody. 
However, even a single variable domain (or half of an Fv comprising only three CDRs specific for an antigen) has 
the ability to recogniie and bind amigen. although at a lower affinity than the entire binding site. 
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The Fab fragnmm also coma«s A. constant domain of the light chain andRrst constant domain (CH I ) 
of ,he heavy chain. Fab fragments differ ftom Fab fragmems by the addition of a few residues a. the carboxy 
tcn^nusofcheheavychainCHI domain including one or more cysteines from the antibody hinge region Fab'-SH 
« the designaUon herein for Fab' in wh.ch the cysteine residue(s) of the constant domains bear a free thio. group 
F(ab )2 annbody fragmcnu originally were produced as pairs of Fab' ftagments which have hinge cysteines between 
them. Other chemical couplings of antibody fragmems are also known. 

•me "light Chains- of antibodies (immunoglobulins) from any venebrate species can be assigned to one of 
two Clearly distirK. types, called kappa and lambda, based on the amino acid se<,uences of U,eir cons^u,. domains 
DependuVJ on the amino acid sequence of the constant domain of thcir heavy chains, immunoglobulins can 
be assigned to different classes. There are five major classes of immunoglobul.ns: IgA. IgD. IgE IgG and IgM 
and seve.^ of these may be fimher dWided into subclasses (isotypes). e.g.. IgGl. IgG2, lgG3. IgG4. IgA 'and IgA2' 
"S.ngle<haifl FV or "sFV antibody fragments comprise the VH and VL domain of antibody wherein 
these domains are present in a single polypeptide chain. Preferably, the Fv polypeptide further comprises a 
Pdypepdde linker between the VH and VL domains which enables the sFv to form the desired strucmre for antigen 
buxhng. For a review of sFv see Pluckthun in Pharmacology of Monoclonal Antibodies, vol. 113. Rosenburg 
and Moore eds.. Springer-Veriag, New York. pp. 269-315 (1994). 

TDe term "diabodies" refers to small antibody fragments with two antigen-bimling sites, which fragments 
compnse a heavy-chain variable domain (VH) connected to a light-chain variable domain (VL) in the same 
polypeptide chain (VH - VL). By using a linker that is too shon to allow pairing between the two domains on dte 
«^ Chain, the domains art forced ,0 pair with flte complementary domains of anorher chain and create two antigen- 
butdu« ti.es. Kabodie, are described mo« fitUy in. for example. EP 404.097: WO 93/1 1 161; and Holli,«er et al 
Proc. NaU. Acad. Sci. USA. 90:6444-6448 (1993). 

An -isolated- amibody is one which has been idemified and separated a«l/or recovered from a component 
of .ts namral environment. Contami»mt componems of its namral environmem are materials which would imerfere 
wtth d^gnosuc or therapeutic uses for ^ antibody, and may include enzymes, hormones, and otiier p,o„inaceous 
or nonproceinaceous solutes. In prefened embodiments, the antibody will be purified (1) to greater than 95* by 
we.ght of antibody as determined by the Uwry method, and most preferably more than 99% by weigh. (2) «, a 
degree sufficient to obtain a. leas. 15 residues of N-tenninal or internal ammo acid sequence by use of a spinning cup 
sequenator. or (3) .o homogeneity by SDS-PAGE under reducing or nonreducing conditions using Coomassie blue 
or. p^ferably. silver stain. Isolated antibody inchides the antibody in situ withm recombinant cells since a. leas, one 
component of Ae antibody's namral enviromncn. wUl no. be presem. Ordinarily, however, isolated antibody wtll 
be prepared by at least one purification step. 

nc word -label- when used herein refers u, a detectable compou«l or composition which is conjugated 
dtrecUy or i«lirectly to the antibody so as to generate a -labelled- antibody. The label may be detectable by itself 
(e g. n^oisoun. labels or fluo«scent labels) or. in tite case of an enzymatic label, may catalyze chemical alteration 
■J5 Of a subsu-att compound or composition which is delectable. 

By -solid phase" is meant a non-aqueous matrix to which die amibody of fte present imrention can adhere 
Eumples of solid phases encompassed herein include diose formed partially or entirely of glass (e.g.. comroUed pore 
8lass). polysaccharides (e.g.. agarose), polyacrylamides. polystyrene, polyvinyl alcohol and siUcones In cenain 
embodiment, depending on the context, the solid phase can comprise the well of an assay plate: in odiers 
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purificanon column (e g . an affinit>' chromaiography column). This term also mcludes a discominuous sohd phase 
of discrete particles, such as those described in U.S. Patent No 4.275.149. 

A -liposome" is a smaU vesicle composed of various types of lipids, phosphoUpids and/or surfactant which 
is useful for delivery of a drug (such as the anti-ErbB2 antibodies disclosed herein and, optionally, a chemothcrapeuttc 
agent) to a mammal. TTic components of the liposome are commonly arranged in a bilayer formation, similar to the 
lipid arrangement of biological membranes. 

II r^mposiiions m -^ M*'"'"'*^ °f Invemion 

The present mveniion provides newly identified and isolated nucleotide sequences cncodmg polypeptides 
referred to in the present application as PR0213. In particular. Applicants have tdemified and isolated cDNA 
encoding a PR0213 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and FasiA 
sequence alignment computer programs. Applicants found that a portion of the PR0213 polypeptide has significant 
homology with the human growth arrest-specific 6 (gas6) protein. Accordingly, it is presently believed that PR0213 
polypeptide disclosed in the present application may have the same or simular activity as does the gas6 protem 
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2. F,ri|.l.nyth PR077^ Pnlvtwottdes 

•nte present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0274. In particular. Applicants have identified and isolated cDNA 
encoding a PR0274 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and FastA 
20 sequence alignment computer programs. Applicants found that various portions of the PR0274 polypepdde have 
significant homology with the 7 transmembrane segmem receptor proteins and Fn54 protein. Accordmgly. « « 
presently believed that PR0274 polypeptide disclosed in the presem application ,s a newly identified member of the 
7 transmembrane segmem receptor protein and/or Fn54 protein family. 

25 3. Firll-ltnCt'' Pnlvngptides 

The presem invemion provides newly identified and isolated nucleoude sequences encoding polypeptides 
referred to in the presem application as PRO300. In particular. Applicants have identified and isolated cDNA 
encoding a PRO300 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and FastA 
sequence alignmem computer programs. Applicants found that various portions of the PRO300 polypepude have 

30 significamhomologywithdtehumanDifr33protein. Accordingly, it is presemly believed that PRO300 polypeptide 
disclosed in the presem application is a newly idemified member of the Diff 33 family. 

4. "B*'' PK f>?y^ Polvnenttdcs 

TTK presem invention provides newly idemified and isolated nucleotide sequences encoding polypeptides 
35 refened to in the presem application as PR0284. In particular. Applicants have idenUfied and isolated cDNA 
encoding a PR0284 polypepdde. as disclosed in further detail in the Examples below. To Applicants presem 
lotowledge. the UNQ247 (DNA23318-1211) micleotide sequence encodes a novel factor; usmg BLAST and FastA 
seqt^nce alignmem computer programs, no sequence identities to any known proteins were revealed 

123 



BNSOOCID- <W0 W462fl1A2 J^> 



wo 99/46281 



PCTa!S99/05028 

cfc.ed . .„ .e p„se„. app,icaUo„ . PR0296. . pa^cu... AppHcan. Have .de„.incd an. LZ 
enccn, a PR0296 po,ypep..e. as disCos. i„ de^. . .e Examples ^.ow Using BUST JZ 

5 ^ sarcon. a^p^., SAS prccin. According,, i. is p„se„.ly believed ^. PR0296 polypep.de disclosed il 
present apphcation is a newly identified SAS protein hon»log. 

^- Fuli-lgnyth Pnlyp ypntf^ 

iZI^'™' '"^ - 

15 

n.T" """"^ """^ «^ p»i»«p*^ 
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>• FulMenpth PP0363 Pnlypf 

apphcanon is a newly HCAR homolog. 

Full-bnpth PUfysan Pniypyp,!,,^ 
35 referred ^ T" « -'x'"^ PO'XPeptidcs 

encod.ng a PR0868 polypepUde. as disclosed in l.r*er de..l U ^ E«n^,es be.w. Using BLAST and Z 
2^ ahgnn^m con^„. p,ogran«. Applicants found that the PR0868 polypepdde has sig^ficam simUarity to 
d^^^ornecrosts factor receptor.Accordi^ly.U.p^^^^^^ 

present apphcatton ts a newly idenUf^d mentber of the «»or necrosis factor receptor fanuly of proteins. 
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10. Fnll-lfnpt'- P'^»387 PoWwrti<>« 
-n,e prescm mvenuon provides newly idenuficd and isolated nuc.eo.ide sequences encodmg polypepndes 
referred to m d,e present appUcanon as PR0382. In particular. Applicants have tdentifted and isolated cDNA 
encoding a PR0382 po.>pept.de. as disclosed in further detail m the Examples below. Us.ng BLAST and Fas.A 
sequence altgnment con^puter programs. Applicants found that the nauve PR0382 polypeptide shares s.gtufic^t 
, hon^logywithvartousserineproteaseproteins Applicants have also found that the DNA encodmg Ote PR03 2 
" polypeptide shares signiftca^ homology with nucleic acid encodtng var.ous serine protease prote.ns. Accordmg.y. 
i. is presently believed that PR0382 polypeptide disclosed in the presem application is a newly identified sert^: 
protease homolog. 



,Q 11. p..l|.l»nflh PRQMg p,flvneDtides 

The present mvemion provides newly identified and isolated nucleotide sequences encoding po.ypepudes 
referred to in Ote present application as PR0545. In particular. Applicants have identified a^ Uo^ted cDNA 
encoding a PR0545 po.ypept.de. as disclosed m farther deu.il .n the Examples below Usmg BLAST and F as^ 
J alig^nent computer programs. Apphcan. found that various portions of .e ^^^^^ 
15 sigmncamhomology w.,h.hesequences.dentifieddesignatedas:humanmeta..opro.e.naseCP WO^^^ ). mouse 

1. .Pha ,S^Sr, metalloprotease-disintegrin ™itri.alpha rOHW.36."). ADAM .3 - X«« 
,XLU6«X)3JTmouseme..rmbetarS«)258-).rabbitme.a,.oprotease«^^^ 
humanme.ninSrAF023477JThumanme.trinprecursorrAF023476J^humanADAM2 (AR^^^ 

^ human ADAM 20 rAF029899.1"). thereby indicatmg that PR0545 may be a «.vel me.trtn protcm. 
20 Accordingly, it is presently bC.eved that the PR0545 po.ypeptide disclosed in the present appUca.K,n ts a new^ 
idend«ednI^ofaten..^fa^.yandpossesses.l«ce..ularadhesivenesstypica.ofU^ 

comprise bodi metalloprotease and disintegrin domains. 

■n« present .nvention provides newly identified and isolated nucleotide sequences encoding po.ypepudes 
,. . „„.. PHOfil7 In oarticuiar. App.icants have identified and isolated cDNA 
referred to in the presem appl.cat.on as PR06I7. m parttcu . pp «, .cx FastA 

encoding a PR0617 po.ypept.de. as disclosed in further detail in the Examples below. Usmg BLAST ««» F stA 
Pn>gr^. AppUcan. found .at .e PR06.7 po.ypeptide shares sigm«ca„. homology 
irCD24 pro... Apphcan. have also found *at the DNA encodmg .e PR06n polypepude ^ s.gn.fica^. 
30 Tmology wi. DNA encoding .e CD24 prote.. Accordingly, it is presently believed PR06.7 polypept.de 
disclosed in the presem application is a newly identified CD24 homolog 

m presem invention provides newly identifed and isolated nucleotide sequences encoding polypepudes 
35 referred to m d« present appl.ca.ion as PR07^. In particular. Apphcants have identified and isolated cDNA 
encodi„gaPRO700polypeptide.asdisc.osedinft.r.herdetailin*eExamp.esbelow. An^ys.s of the a^no ac.d 

sequence of the fUlMength PRO700 polypepude using BLAST and FastA sequence aligmnem computer programs. 

^ various portions of ti« PRO700 polypeptide possess s.gt.r.cant sequence similarity to v..ous prctetn 
Z^isomerases. More specifiadly, an analysis of the Dayhoffdau.base (version 35.45 SwissPro. 35> .^^^^ 
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signincan. sequence sin^iariq, between che PRO700 am.„o acid sequence and Slowin. Dayhoff sequences- 
polypcpnde with protein disulfide isooKrase activity, designated as ("P P80664-,. human PDI designated as' 
("P_R31696 ■). hunun PDI. designated as (P.R25297-,. probable protein disulfide ,son«rase er-60 precursor 
d^ignated as ("ER60.SCHMA"). protein disulfide tson^rase precursor - Drosopm .elanosasur, designated a^ 
CPDLDROME"). protein disulfide-isomerase precunor - Niconana ua>accum, designated as CNTPDIGENE 1 ") 
protem disulfide isomerase - Onchocerca .ol^U^. designated as ("OVU12440.1-). human probable protein disulfide 
.somerase p5 precursor . designated as ("ERPS.HUMAN"). human protein disulfide isomerase-related protein 5 
( HSU79278J and protein disulfide isomerase precursor / prolyl 4- hydroxy. (-PDI HUMAN") thereby 
hidicating that PRO700 may be a novel protein disulfide isomerase. Accordingly, it is p^sently be.iived Uut 
PRO700 polypeptide disclosed in .he present application .s a newly identified member of the protein disulfide 
^omerase family and possesses the ability to cauly« the formation of disulfide bonds typical of the protein disulfide 
isomerase family. 

FMM-lenrth PRn702 P«fYr Pti'^n 
TTie presem invendon provides newly identified and isolated m«:leotide sequences encoding polypeptides 
referred to in the presem application as PRO702. m particular. Applicants have identified and isolated cDNA 
encodmg a PRO702 polypepddc. as disclosed in lurther deuil in the Examples below. Using BLAST and FastA 
sequence ahgnmen. computer programs. Applicants found that the PRO702 polypepdde has significant similarity to 
the congluunin protein. Accordingly, it is presently beheved that PRO702 polypeptide disclosed in the present 
application is a newly identified conglutinin homolog. 



1^- FulMpnrth Pp O70:t PnfY PTPti^'Tff 
TTe presem invention ptovides newly idemified and isolated nucleoude sequences encoding polypeptides 
referred to in the presem application as PRO703. In particular. Applicants have identified and isolated cDNA 
cncodtng a PRO703 polypeptide, as disclosed in further detail in d« Examples below. Analysis of the amino acid 
sequence of the full-length PRO703 polypeptide using BLAST and FastA sequetKe aligmncnt computer programs 
suggests that various portions of the PRO703 polypeptide possess significant sequence sunilarity to the VLCAS 
protein, thereby indicating that PRO703 may be a «>vel VLCAS protein. More specifically, an amUysis of the 
Dayhoir database (version 35.45 SwissProt 35) evidenced significam sequence similarity between the PRO703 amino 
acid sequence and the following Dayhoff sequences, human mRNA for ve.y-long<hain acyl-CoA. (-D88308-) rat 
30 mRNA for very-long-chain acyl-CoA synthetase. ("DSSIOO"). Mas musculus fatty acid tramport protein 
(-MMU15976"). human very-long-chain acyl-CoA syntiietase. ("DSSBOBJ"). Mas muscubis very.long<hai„ 
acyl<:oAsymhetase.rATO3(»lJ"). vety-long^hainacyl-CoA synthetase-/?^. ("DW P) rat long-chain 
fatty acid trampon protein. rFATP.RAT"), mouse long<hain fatty acid transport protein. ("FATP MOUSE") 
probable long^hain fetty acid transpon protein. ("FATl.YEASr). and fatty acid transporier protein' 
rCHY15839_2-, . thereby Indicating that PRO703 may be a novel VLCAS. Accordingly, it is presently believed 
•hat PRO703 polypepdde disclosed in the preaem application is a newly idemified member of the VLCAS family and 
possesses the ability to feciliu«e the cellular transport of long and vety long chain fatty acids typical of the VLCAS 
family. 
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16. f»iup«fth PRO705 Polvpeptltfes 
The present invention provides nev^ly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO705. In particular. Applicants have identified and isolated cDNA 
encoding a PRO705 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and FastA 
sequence alignment computer programs. Applicants found that the PRO705 polypeptide has significam similarity to 
5 the K-glypican protein. Accordingly, it is presently believed that PRO705 polypeptide disclosed in the present 
application is a newly identified member of the glypican family of proteoglycan proteins. 

17. Fill!-'-"?**' P""^^" Polvi>eptid« 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
10 referred to in the present application as PRO708. In particular. Applicants have idenufied and isolated cDNA 
encoding a PRO708 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and FastA 
sequence alignment computer programs. AppUcants found that the PRO708 polypeptide has significam homology with 
the aryl sulfatase proteins. Applicants have also found that the DNA encoding the PRO708 polypeptide has 
significam homology with DNA encoding the aryl sulfatase protems. Accordingly, it is presenUy believed that 
15 PRO708 polypeptide disclosed in the present application is a newly identified aryl sulfauue homolog. 

18. FvH-lpnyth PRCWIO PolYPCrtite 

m present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
refened to in d« presem application as PRO320. In particular. AppUcants have identified and Uolated cDNA 
20 encoding a PRO320 polypeptide, as disclosed ir> further detail in the Examples below. Analysis of the anuno acid 
sequence of the full-length PRO320 polypeptide using BLAST and FastA sequence aligmnent computer programs, 
suggests that various ponions of the PRO320 polypeptide have significam homology to the fibulin protein. More 
specifically an analysis of the Dayhoff dattbase (version 35.45 SwissProt 35> evidenced significam homology 
between the PRO320 amino acid sequence and the following Dayhoff sequences, human fibulin.2 precursor. 
25 designated -FBL2 HUMAN", human fibulin-l isoform a precursor, designated "FBLA.HUMAN". ZK783.1 - 
Caenorhabditis el^gans, designated -CELZK783.1". human-notch2, designated -HSU77493.r. Nel protein 
precursor - ranus norvegicus. designated "NEL.RAr. Mus musculus cell surface protein, designated ••D32210_l". 
mouse (fragmem) Notch B protein, designated -A49175-. C50H2.3a - Caenorhabditis elegarn. designated 
•CEC50H2 3". MEC-9L - Caenorhabditis elegans. designated "CEU33933J". and Mus musculus notch 4. 
30 designated "10 MMMHC29N7 2". thereby indicating that PRO320 may be a novel fibulin or fibulin-like protein. 
Accordingly, it .s presently beUeved diat PRO320 polypepdde disclosed in the presem applicarion U a newly idemified 
member of the fibulin family and possesses biological activity typical of the fibulin family. 



35 



19. FiiM-lfnfli Pan?i7.4PolviieetMw 
The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0324. In particular. Applicants have identified and isolated cDNA 
encoding a PR0324 polypeptide, as disclosed in fimher detail in the Examples below. Using BLAST and FastA 
sequence alignmen. computer programs. AppUcants found dutt *e PR0324 polypeptide has significam smnlanty to 
o«doreduc.ases. Accordingly, it is presently believed that PR0324 polypeptide disclosed in the presem apphcauon 
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is a newly identified oxidoreductase horaotog. 

20' FMll-lenirth PR0151 Pnlvpppti^r^ 
TTic present invention provides newly .dentiHed and isolated nucleotide sequences encoding polypeptides 
referred to in Ute present application as PR0351. In particular. Applicants have identified and isolated cDNA 
encoding a PR0351 polypeptide, as disclosed in funher detail in the Exan^les below. A^Uysis of the amino acid 
sequence of the full-length PR0351 polypeptide using BLAST and FastA sequence alignment computer programs 
suggests that various portions of the PR0351 polypeptide possess significant sequence simlUrity to the prostasin 
protem. thereby indicating U«t PR035. may be a novel prostasin protein. More specifically, an analysis of the 
Dayhoff database (version 35.45 SwissProt 35, evidenced significant sequence similarity between the PR0351 amino 
acid sequence and tf,e following Dayhoff sequences. •'AC003965 I". "CELC07GI 7" •'GEN12917" 
"HEPS.HUMAN". -GEN14584". "MCT6_M0USE". ••HSU75329_r. -PLMN.ERIEU". "TRYB HUMAN" and 
••P_W22987-. Accordingly, i, is presemly believed d,at PR0351 polypeptide disclosed in die prese'nt application is 
a newly .denufied member of dw prostasin family and possesses propenies and activities typical of dte prosasin 
family. 

21- Ftill-lenirth PROaa Pniyppptif^f^ 

TTie present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in tite presem appUcation as PR0352. In particular. Applicants have identified and isolated cDNA 
encoding a PR0352 polypeptide, as disclosed in lurdier detail in die Examples below. Using BLAST and FastA 
sequence aUgnment computer programs. Applicants found diat Ae PR0352 polypeptide has significant similarity to 
the butyrophiiin protein. Accordingly, it is presemly believed d«t PR0352 polypeptide disclosed in the present 
apphcation is a newly identified butyrophiiin homolog. 

22- Full-lenrth PR0381 Pnivp..ntiri>5 

TTie presem invemion provides newly idemified and isolated micleotide sequences encoding polypeptides 
referred to in tite present application as PR0381. In particular. Applicams have identified and isolated cDNA 
encoding a PR0381 polypeptide, as disclosed in farfter detail in die Examples below. Using BLAST and FastA 
sequence alignmem computer prt^. Applicants fomul dut dte PR0381 polypeptide has significant similarity to 
.mmunophilin proteins. Accordingly, it is presemly beUeved ti«t PR0381 polypeptide disclosed in ti« present 
appUcadon is a newly idemified FKBP immunophilin homolog. 



23. Full-lenrth PB 0386 Pnlvfii.ntiriP« 
TTie presem invemion provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the presem application as PR0386. In particular. AppUcants have idemified and isolated cDNA 
35 encoding a PR0386 polypeptide, as disclosed in furtiier detail in die Examples below. Using BLAST and FastA 
sequence alignmem computer programs. Applicants found diat die PR0386 polypeptide has significant similarity to 
the beta-2 subunit of a sodium chamiel protein. Accordingly, it is presemly believed Uiat PR0386 polypeptide 
disclosed in d« presem application is homolog of a beta-2 subunit of a sodium cham«l expressed in mammalian cells 
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24. F..n.|>rffh PRO'i40 PolvKPtidCS 
The present invemion provides ne^ly .demif.ed and isolated nucleotide sequences encoding polypeptides 
referred to m the present application as PRO540. In particular. Applicants have tdemifted and isolated cDNA 
encoding a PRO540 polypeptide, as disclosed m funher detail in the Examples below. Analysis of the amino acid 
sequence of the full-length PRO540 polypeptide using BLAST and FastA sequence alignmei>t computer programs. 
5 suggests that various ponions of the PRO540 polypeptide possess significant sequence similarity to the LC AT protem. 
.hereby indicating that PRO540 may be a novel LCAT protein. More specifically, an analysis of the Dayhoff 
database (version 35.45 SwbsPro. 35) evidenced signiftcan, sequence similarity between the PRO540 amino ac.d 
sequence and the foUowing Dayhoff sequences, phosphatidylcholine-sterol acyUransferase, des.gnated 
"LCAT HUMAN-, hypothetical 75.4 kd protein, designated "YN84.YEASr. Bacillus licHenifomis esterase. 
,0 designated -BLU35855 l". macrotetrolide resistance protem - S.rep,omyces. designated -iH0655". T-cell receptor 
delta chainprecursor. designated -C30583". Rhesus kringle 2. designated "P.WOTSSr. RAGE-l 0RF5. desig«..ed 
.HSU46191 3- human Ig kappa chain VKin.JK3. designated "HSU07466_r . and Alstroemeria inodora reverse 
.ranscriptase:designa.ed -ALI223606_1 " . Accordingly, it is presently believed that PRO540 polypeptide disclosed 
in the present appUcation is a newly identifed member of the LCAT protein famtly and possesses lipid transpon 
15 capability typical of die LCAT family. 

25. FvlHfiif*'' PAlvnentides 
•n« present invemion provides newly identified and isolated micleotide sequences encoding polypeptides 
referred to in the present application as PR0615. In particular. Applicant have idenUfed and isolated cDNA 
20 encoding a PR0615 polypeptide, as disclosed in further detail in the Examples below. Analysis of U« amino ac.d 
sequence of the full-length PR0615 polypeptide using BLAST and FastA sequence alignmem computer programs, 
suggests that various ponions of the PR0615 polypeptide possess significant sequence similarity to the hum«. 
synaptogyrin protein, thereby indicating that PR0615 may be a novel synaptogyrin protein. More specifically, an 
a^,s.ofd.Dayho.da.base.ersion35.45SwissProt35)ev.e..edsigni«cam^^^^^^ 
PR0615 amino acid sequence and the following Dayhoff sequences. -AF039085J . RNU39549_l . 
.CELTt)8A9 8", "FSUfiZOlg V, -S73645". ■•Y348_MYCPN". •AC0OO1O3.5-. -RT12_LSITA-. and 
-EBVlAlP2li r. Accordingly.!, is presently beUeved that PR0615 polypeptide disclosed in the presema^^^^^^^^ 
is a newly id««ified member of the synaptogyrin femily a«l possesses activity and properties typical of ti,e 
synaptogyrin family. 

26. fiin-ifpff**' PolVKPtito 
•n« present invention provides newly identified and isolated nucleotide sequ«Kes encoding polypepndes 
refened to in the presem application as PR0618. In particular. AppUcants have identified and isolated cDNA 
encoding a PR0618 polypeptide, as disclosed in farther detail in *e Examples betow. Analysis of dK amino actd 
35 sequence of the full-length PR06i8 polypeptide using BLAST and FastA sequence alignme« computer progran»^ 
suggests d^it various portions of the PR0618 polypeptide possess significant sequence similarity to the 
enteropeptidase protein, tiiereby indicating titat PR0618 may be a novel emeropeptidase. More specifically, an 
analysis of d« Dayhoff database (version 35.45 SwissProt 35) evidenced sigmficant sequence similarity between ti« 
;:m.tid sequence and ti. follow^ Dayhoff sequet^s. ■•P.W22987.. .KAL.HUMAN". 
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-AC003965.r. .GEN12917-. "ENTK.HUMAN". -FA11.HUMAN-. -HsSj29 I" "P W22986" and 
••PLMN.HORSP. Accordii^iy. , Relieved .ha. PR06I8 polypeptide disclosed in a,e pr'esem applica.,o„ 

•s a newly idenufied cnember of a« en«ropepUdasc fanUly and possesses cau..y,ic activity ,T,ica. of 
en.eropep.idase family. 

27- Full-letirth PB9719 P«lyppp«^,tfff 
-n-c prcsem invemion provides newly identified and isolated nucleotide sequences em:oding polypeptides 

referred to m the present application as PR0719. In particular. Applicants have identified and isola«d cDNA 
encodtng a PR07I9 polypeptide, as disclosed in further detail in the E:tan,ples below. Using BUST and FasU 
sequ^ce altgnmem computer programs. Applicants found that the PR07,9 polypeptide has significant sinularin. .0 
the hpoprcein lipase H prcein. Accordingly, it is presently believed ti«. PR0719 polypeptide disclosed in *e 
presen. apphcation is a newly idemified lipoprotein lipase H homolog. 

28- rglHtneth PR0724 Poivn^pt.^ 
The presen. invention provides newly identified and isolated nucleotide sequem:es encoding polypeptides 

referred to m the present appUcation as PR0724. In particular. Applicants have idemified and isolated cDNA 
cncodtng a PR0724 polypeptide, as disclosed in further deuil in ti« Examples below. Using BLAST and Fas.A 
-q^ alignment computer programs. Applicant found that the PR0724 polypeptide has significant simiU.rity to 
the human low density lipoprotein (LDL) receptor protein. Accordingly, i. is presently believed tiuu PR0724 
polypepude disclosed in the present application is a newly identified LDL receptor homolog. 

29- Full-knyth Pl^n772 Pnlyiw'fftMi'^ 
p«se« invention provides newly identified and isolated nucleotide sequences encoding polypeptides 

referred .0 m ti« presem application as PR0772. In particular. Applicants have identified and isolated cDNA 
encoding a PR0772 polypeptide, as disclosed in further deuil in the Examples below. Using BLAST and FastA 
^^quence ahgnment computer programs. Applica«s found O^t the PR0772 polypeptide has significant similarity to 
the human A4 protein. Accordingly, i. is presently believed that PR0772 polypeptide disclosed in ti« presem 
application is a newly idemified A4 protein homolog. 

30- FtilHength PROStt P^iypy pf j^if, 
TT^ presem invention provides newly identified and isolated nucleotide sequences encoding polypeptides 

referred .0 m the presem application as PR0852. In particular. Applicants have identified and isolated cDNA 
encoding a PR0852 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and FastA 
sequence ahgmnem compu.er programs. Applicant found that the PR0852 polypeptide has significant similarity ,0 
va.10^ protease proteins. Accordingly, i. is presendy believed that PR0852 polypeptide disclosed in the presem 
35 apphcaoon is a newly identified prouase enzyme homolog. 



30 



31- FulHcnrth PR08n P«ly~T«i,tn 
Tbc presem invention provides newly idemified and isolated nucleotide sequences encoding polypeptides 
referred to m Ute presem appUcation as PR0853. In particular. Applicant have identified ami isolated cDNA 
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cncodmg a PR0853 polypcpude. as disclosed m funher deu.il in the Examples below. Ai^lysis of *e ammo ac.d 
sequence of d.e full-lengd, PR0853 polypepude usmg BLAST a^l FastA sequence al.gnmen. computer programs, 
suggests .hai various portions of .he PR0853 polypeptide possess significani sequence smiitoty .o *c reducuse 
protein thereby indicating that PR0853 may be a novel reductase. More specifically, an analysis of the Dayhoff 
database (version 35.45 SwissProt 35) evidenced signiftcant sequence similarity between the PR0853 ammo acKl 
5 sequence and U.e foUow^g Dayhoff sequences. "P W03.98-. •CEC15HU.6-. ■•MTV03O_.2-. »P_W15759-, 
..S4265r "ATAC00234314-. -MTV022_13-. •SCU43704_r. 'CEi^TeV. and "ALFAJ". Accordingly. 
is presently believed that PR0853 polypeptide disclosed in the present application is a newly identified member of 
^ reducutse family and possesses the antioxidant enzymatic activity typical of the reductase family. 

Q 32. FvW-li-"?"' Polvpcptiriffi 

m present invention provides newly identified and tsolated nucleotide sequences encoding polypepudcs 
referred to in the present application as PRO860. In particular. Applicants have identified and isoUted cDNA 
encoding a PRO860 polypeptide, as disclosed in funher detail in the Examples below. Analysis of the ammo ac.d 
sequence of the fuU-length PRO860 polypeptide ustng BLAST and FastA sequence altgnmem computer programs. 

,5 suggest .hat various portions of the PRO860 polypeptide possess significant sequence similanty to *e ne»rofa«:.n 
plin d^ereby ^dica..^ that PRO860 may be a novel neurofitscin. More specificaUy. an analysis of the Dayho^ 
database (version 35.45 SwissProt 35) evidenced significant sequence similarity between dte PRO860 ammo ac.d 
database (vers ..aunooo l" -AF041O53 1" "CELZIO?? 2". ••RNU8l035_r. 

sequence and d« following Dayhoff sequences. •AF040990_l . AF041O53_l ,v„ 

r "S26180".-MMB1G2A r. -S46216". and ••RNU68726.1-. Accordingly, it .s presently beheved th^ 
20 PROSw'pol^peptide diseased in the present application is a newly identified member of the neurofascin family and 
possesses the cellular adhesion properties typical of the neurofascin family. 

33 nr„| l-l>nyth PR A"'*'^ Pnlvptotides 
m present invention provides newly identified and isolated nucleotide seque™:es encoding polypeptides 
^ .ferred to . d. present appUcation as PR0846. In particular. Applicams ^ve idetui^d ^^^^^^J^ 
encoding a PR0846 fK,lypeptidc. as disclosed in fi^ther detail in the Examples ^^^^ 
seque.eof.heM-l««*PR0846po.ypept^«sir« BLAST at. Fas^ sequence alignmemco^^^^^ 

suggests that various ponloas of the PR0846 polypcpdde possess significant sequence sim.lar.ty to the CMRf 35 
pin. a«reby^icati.«thatPR0846«^beanove,CMRP35 protein. ^-^^'^^^^^^^''^ 
Dayhoff daubase (version 35.45 SwissP.« 35) evidenced significant sequence similarity between dte PR0846 amm^ 
Tsequence and ^ follow^ Dayhoff sequences."CM35.HUMAN". —J". "PlOR.RABTr. 
.AFO43L I" «RNU89744 r.«A52(»l.l"."S4884,-."EUC06A9.3-.and"AF(W9588.r. Accordmgty, .t ts 
p^semly believe. d.t PR084'6 polypeptide disclosed in d,e present application is a newly idemified member of the 
CMRF35 protein family and possesses propen.es typical of Ae CMRF35 protein family. 

34. FiilUfnglh P""«*^ Polvntntidw 
The present invention provides newly identified and isola«d nucleotide sequences encoding polypepddes 
referred «, in the present appUcation as PR0862. In particular. Applicants have identified and isolated cDNA 
encoding a PR08« polypepdde. as disclosed in fitrther detail in the Examples below . Analys.s of the ammo actd 
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secuencc of U,e Ml-.e„g. PR0862 polypcp^dc using BUST and Fas.A sccp.„l!f„„„« p„,,^ 
suggest .ha. vanous pon.ons of .he PR0862 po.ypep.ide possess sig^f.can. sequence sinulari.y .0 *e lysozyn.' 
prcein, .hereby indica.ing ..at PR0862 may be a novel lysozyn« pro«in. More specifically, an analysis of *e 
Dayhoff daubase (version 35.45 SwissPro, 35) evidenced significan. sequence similarity between the PR0862 amino 
acd sequence and tt^ foUowing Dayhoff sequences. •■P.P90343-. and "LYC.HUMAN. Accordingly, i. is presently 
5 beHeved ^ PR0862 polypeptide disclosed in ^ present applicadon is a newly idendfied men*cr of the lysozyn.e 
fanuly and possesses cauUytic activity typical of d« lysozyme family. 

Full'knifth PR08M P«iyp,p||,j^ 
m presem invention provides newly idemified and isola.ed nuclccidc sequences encoding polypeptides 
10 referred to in the presem applicat.on as PR0864. pardcular. Applicant have idemified and isolated cDNA 
encoding a PR0864 polypeptide, as disclosed in fi.nher deu.il in the Examples below. Analysis of the amino acid 
sequence of the full-length PR0864 polypepUde using BLAST and FastA sequence alignmem computer programs 
suggests tha, various portions of the PR08« polypeptide possess s.gnificant sequence similarity u, ^ Wn.^ pro«in" 
therebyuK.catingthatPR0864maybea™,velWn.^p,otein. More specifically, an analysis of the Dayhoff daubase" 
(version 35.45 SwissPro. 35) evidenced significan. sequence similarity between the PR0864 amino acid seque«. 
and the following Dayhoff sequences. -WNT4.M0USE-. 'WNTS MOUSE". "WNSA HUMAN- 
-WN7B.M0USP. •WN3A MOUSE". ■■XLU«^288_r. .VN13_HUMAN-. "WN5B ORYLA-. W MOUSE" 
and -^NTA.MOUSE-. Accordingly, it is presently believed cha. PR0864 polypip.ide disclosed in"*e presem 

20 ptri"^' """"^ '"^'^ ''"^"^ W„M 

■n.e presem invention provides newly identified and isolated nucleoude sequences encoding polypeptides 
referred .0 in ti,e presem applicadon as PR0792. to particular. Applicams have identified and isolated cDNA 
encod.ng a PR0792 polypeptide, as disclosed in further detail in the Examples below. Usi,g BLAST ami FastA 
sequence ahgnmem computer programs. Applicams found that the PR0792 polypeptide has significant similarity to 
*e CD23 protein. Accordingly, i. is p«sentiy believed that PR0792 polypeptide disclosed in the presem application 
u a newly identified CD23homolog. 

^ 37. FMlHenrth PR08<i<i Pniyp-p^irtf^ 

m presem invention provides newly identified and isola.cd nucleotide sequences encoding polypeptides 
referred to in the presem application as PR0866. In particular. Applicams have identified and isolated cDNA 
encoding a PR0866 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and FastA 
sequence ahgmnent computer programs. Applicams foumi .ha. U,e PR0866 polypeptide has significam similarity to 

varuH^mmdinandspondinproteins. Accordingly, i. is presently believed ti,at PR0866 polypeptide disclosed in the 
presem application is a newly identified mindin/spondin homolog. 
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38. fyiu.r^h PR0«7I Polypeptides 
The presem invention provides newly identified and isolated nucleotide sequences encoding polypepu^ 
.efened to in the present application as PR087,. In particular. Applicants have identified and isoUted cDNA 
a PR0871 polypeptide, as disclosed tn .r^r de.i, .n .e Examples helow. Ustn. BLAST an PascA 
ZeJaUg™..™ computer pro^ran.. Applicant found ^.t^PROSTlpolypepUdehassi^^^^^ 

3 irCyP-6o7o«in. Accord^., it is presenUy believed that PR0871 polypeptide disclosed .„ ^e present 
application is a «w,y idendfied mender of ^ cydop^lin protein family and possesses acnv.ty typtca, of the 
cyclophilin protem family. 

39 F-ll-lf " f*** P^f ^"^^ P^lvneptides 
,0 m pr=«. pn,.i*s »w,y id.mir,.a »d .»«d -.o* «,^»=» »codi« p.l,M«*s 

typical of the carboxylcsicrasc family. 

40 p,.n-f yfiyth PR ^^Q^ff pnlvDCDtides 

^ p»»'hw.«i» p™.ito M'""-"" '"^ f'TC 

Cr^s«-»=o™p-»,p,.»«».AppU=^r»^«-«.P«o«p.i,P^^s.^«^^ 

p»„ni wti"*" « • 

a 

41. piihj— .1. PBOIMI P""*"^ 
p^ pnM*. n..!, ""l"** -^"^ '"'^^ 

^odin, . PR094, pcypepd*. » d,.Co.=d ,n tohc, i„ d. Ex»pl.. b..o.. U.n, Bl^ F«A 

aOTlication is a newly identified cadherin homolog. 

42. FvH-lf Pfth PR0944 FolvMrtWC8 
35 m presem invention pro^^des newly identified and isolated nucteoUde sequences encoding polypepu^s 

referred » in d. present appUcation as PRO^. In particular, Appl^an. have idenofied 
encoding a PR0944 polypeptide, as disclosed in funher detail in the Examples below. Usmg BLAST and FastA 
^^TaHg^ con^uter programs. AppUcants f^d ..t ^ PR09. .^^^^^^ 

TcPE-R cell surface pro^in. Accordingly, it is presenUy believed that PR0944 polypepudc dtsCosed . the 
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present applicaUon is a newly jdentified CPE-R homolog. 

^3- Full-knpfh pB^«>g3 P«ir|wptit|ri 
TT>e present invention provides newly identified and isolated nucleotide sequences encoding polypepddes 
referred to in the present application as PR0983. In particular. Applicants have idendfied and isolated cDNA 
encodtng a PR0983 pol>T«p,ide. as disclosed in further detail in the Examples below. Using BLAST and FastA 
sequence ahgn^ent computer programs. Applicants found that the PR0983 polypeptide has sig^fican. similarity to 
the vesicle-associated protein. VAP-33. Accordingly, it is presently believed that PR0983 polypeptide disclosed in 
the ptesen. applicadon is a newly identified member of the vesicle-associated membrane protein family and possesses 
acttvity typical of vesicle-associated membrane proteins. 

Full-fenrth POCflOSJ Pnlvpyptj^y^ 

Tl« presem invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present applicatton as PRO1057. In particular. Applicants have identified and isolated cDNA 
encoding a PRO1057 polypeptide, as disclosed in fiirther detail in the E«mples below. Ustng BLAST and FastA 
sequence al.gnmem computer programs. Applicants found that the PRO1057 polypeptide has significant similarity 
to vanous protease proteins. Accordingly. ,t ts presently believed that PROI057 polypeptide disclosed in the present 
apphcaaon is a newly identified protease homolog. 
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Full-lenyfh PRQlQTl Pnlvp^p|j^y.^ 

•me presem invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the presem application as PRO,071. In panicular. Applicants have identified and isolated cDNA 
encoding a PRO107, polypeptide, as disclosed in further detail in d« Examples below. Using BLAST and FastA 
sequence altgmnem computer programs. Applicant fouml Aat the PRO1071 polypeptide has significant similarity 
to the thrombospondin protein. Accordingly, it is presenUy believed that PRO1071 polypeptide disclosed in the 
25 presem appUcation is a newly identified thrombospondin homolog. 

^- FMll-lcnoth PRoio7^ P"«vpfrtii1ri 

nc presem invenUon provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the presem application as PRO1072. In panicular. AppHcams have identified ami isolated cDNA 
30 e.«oding . PROI072 polypeptide, as disclosed in further detail in *e Examples below. Using BLAST and FastA 
sequemre alignmem computer programs. Applicants found that the PRO1072 polypeptide has significant similarity 
to vaaous «ducu«e protein,. Acconfingly. it is presemly beUeved that PRO1072 polypeptide disclosed in the presem 
apphcaiion is a newly idemified member of the reductase protein family. 

47. Full.li.nyth PPHIOTS Pftlypfpttlfn 
The present imrenUon pnwides newly idemified ami isolated mtcleoude sequem:es encoding polypeptide, 
referred to m the presem application as PROI075. In panicular, Applicams have idendfied ami isolated cDNA 
encoding a PROI075 polypeptide, as disclosed in further detail in the Examples below. Using BUST ami FastA 
sequence aligmnem computer programs. Applicanu foum. dut the PRO1075 polypeptide has significan. similarity 

134 



BfSOOCIO- •«V»0_»»<e»iA4.|., 



• PCT/US99/05028 

.0 protein disulfide .somerase. Accordingly. U is presently belteved that PRO1075 polypeptide dtsclosed in .he 
present application is a newly idendfied member of the protem disulfide «omerase famtly and possesses ac.iv..y 
typical of that family- 

5 The present invention provides newly identified and isolated nucleotide sequences encodmg polypeptides 

referred to in the present application as PR0181. In particular. Applicants have identified and isolated cDNA 
eroding a PR0181 polypeptide, as dbclosed in further deuil in the Examples below. Using BLAST and FastA 
sequence alignment computer programs. Applicants found that the PR0.81 polypeptide has significant similartty to 
the comichon protein. Accordingly, it is presently believed that PR0181 polypeptide disclosed m the present 

10 application is a newly identified comichon homolog. 

49. pvn-i^^yth PROi«>5 PoWpertidts 
The present mvention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0195. In particular. Applicants have >demified and isolated cDNA 
15 encodingaPR0195polypeptide.asdisclosedinfurtherde.ailintheExa^^^^ TTie PRO. 95-encoding clone 

was isolated (torn a human fetal placenu library using a trapping technique which selects for nucleotide sequences 
encoding secreted pn,.eins. To Applicants presem knowledge, the UNQ169 (DNA26847-1395) nucleotide sequence 
encodes a novel factor; using BLAST and FastA sequence alignment computer programs, no sequence tdenuttes to 
any known proteins were revealed. 

20 

50. f,ffl.|>nyth PROfty P»lvi1CDtideS 
m present invention provides newly identified and isolated mideotide sequences encoding polypeptides 
referred to in the presem application as PR08«. In particular. Applicants have identified and isolated cDNA 
encoding a PR0865 polypeptide, as disclosed in further deuul in d,e Examples bek.w. The PR0865-encoding clone 
25 was isolated from a human feud kidney library using a trapping technique which selects for nucleotide sequences 
encodingsecretedproteins. Tims, the PR08d5-encoding clone may encode a secreted factor. To Applicants present 
knowledge the UNQ434 (DNA33974.1401) nucleotide sequence encodes a novel factor; using BLAST and FastA 
sequence alignment computer programs, no sequence identities to any known proteins were revealed. 

30 51. Fiill-lf"?*'' P'^Q"'^ Polvwptidw 

The presem invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in present appUcation as PR0827. In particular. AppUcants have identified and isolated cDNA 
encoding a PR0827 polypeptide, as disclosed in fimher detail in the Examples below. Using BLAST and FastA 
sequence alignment computer programs. Applicams found that Ute PR0827 polypeptide has significant similanty to 

35 VlA-2 and various other integrin proteins. Accordingly, it is presently believed that PR0827 polypeptide disclosed 
in the presem application is a novel iniegrin protein or splice variant thereof. 
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EHlHtnrth PROiiu Poivtwptjt^f:; 
TTie present invention provides newly idcmified and isolated nucleotide sequences encoding polypeptides 
referred to tn .he present application as PR011I4. In particular. Applicants have identified and isolated cDNA 
encoding a PR01114 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and FastA 
sequence alignment computer programs. Applicants found that the PROl 1 14 polypeptide has stgmficant similarity 
to the cytokine receptor femily of pmteins. Accordingly, it is presently believed that PROl 1 14 polypeptide disclosed 
m the present application is a newly identified member of the cytokine receptor family of proteins and possesses 
activity typical of that family. 

TT.e present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred .o in the present application as PROl 1 14 interferon receptor (UNQ557). In panicular. cDNA encoding a 
PROllH interferon receptor polypeptide has been identified and isolated, as disclosed in furtiwr detail in the 
Examples below. It is noted that proteins produced in separate expression rounds may be given different PRO 
numbers but the UNQ number is unique for any given DNA and the encoded protein, and will not be changed 
However, for sake of simplicity, in the present specification ti,e protem encoded by DNA57033.1403 as well as all 
further native homologues and variants included m the foregoing definition of PROl 1 14 interferon receptor, will be 
referred to as -PR01114 interferon receptor", regardless of their origin or mode of preparation. 

Using the WU-BLAST2 sequence alignment computer program, it has been found tiut a full-length native 
sequence PROl 1 14 interferon receptor polypeptide (shown in Figure 142 and SEQ ID NO:352) has sequence idemity 
witi, die other known interferon receptors. Accordingly, h is presently believed tiut PROl 1 14 interferon receptor 
possesses activity typical of oUier interferon receptors. 

53. Full-ll-nc«h PP 0237 Pnlyp fptjHyff 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in die present application as PR0237. In panicular. Applicants have idemified and isolated cDNA 
encoding a PR0237 polypeptide, as disclosed in furtiier detail in the Examples below. Using BLAST and FastA 
sequence aligranem computer programs. Applicants found flat Uk PR0237 polypeptide has significant similarity to 
carbonic anhydrase. Accordingly, it is presently believed titat PR0237 polypeptide disclosed in die presem 
application is a newly identified carbonic anhydrase htmiolog. 

FMi-i^noth mcf^t P»^vp»|^*^^^ff| 
The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in tfte presem appUcation as PR054I. In particular. Applicants have identified and isolated cDNA 
encoding a PR0541 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and FastA 
sequence alignment computer programs. Applicants found that tite PR0541 polypeptide has significant similarity to 
a tiypsm mhibitor protein. Accordingly, it is presendy believed that PR0541 polypeptide disclosed in the presem 
application is a newly identified member of tiie uypsin inhibitor protein family. 

Full-lenrth PR0273 Pnivp^ p«^,|f^ 
The presem invention piovides newly idemified and isolated nucleotide sequences encoding polypeptides 
referred to in the presem appKcation as PR0273. in panicular. AppUcams have idemified and isolated cDNA 
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encoding a PR0273 poi^Spnde. as disclosed in funher deu.l in Ac Examples below. Using BLAST and FasOV 
sequence alignmem computer progran.s. Apphcanis found Uiat various pontons of *e PR0273 polypeptide have 
significant sequence identity wi* various chemokines. Accordingly, it is presently believed .hat PR0273 polypeptide 
disclosed in the present applicauon is a newly identified member of the chemokine family and possesses activity 
typical of the chemokine family. 

56. F,ip^*«rth PRO701 PolYUCPtite 
TTie presem invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred .o in the presem appUcauon as PRO701. In particular. Applicants have identified and isolated cDNA 
encoding a PRO701 polypeptide, as disclosed in funher detail in the Examples below. Using BLAST and Fa«A 
sequence alignment computer programs. Applicants found that various portions of the PRO701 polypeptide have 
signifcant homology with the neuroligins 1. 2 and 3 and esterases including carboxyesterases and 
acytlchohnesterases. Accordingly, h is presently believed that PR0701 polypeptide disclosed in the presem 
application is a newly identified member of the neoroligin family and is involved in mediating recogmtton processes 
between neurons and/or functions as a cell adhesin molecule as is typical of neuroligins. 



IS 



57. Fll1l-lT"g*'' PRfWM PolvDentWts 
Tbc presem invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in .he present application as PRO704. In particular. Applicants have identified and isolated cDNA 
encoding a PRO704 polypeptide, as disclosed in fur^er detail in the Examples below. Using BLAST and FastA 
20 sequence alignment computer programs. Applicants found titat various portions of the PRO704 polypeptide have 
significant homology with the VIP36 and GP36b. Accordingly, it is presently believed that PRO704 polypeptide 
disclosed in the presem application is a newly identified member of the vesicular mtegral membrane protein family 
and possesses ^ ability to biml .o sugars and cycle between tiK plasma membrane and the Golgi cypical of this 
fanuly. 
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58. FiilHf ffgf^ P""7n< PoivwptidM 
TTie present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in die presem application as PRO706. In particular. Applicant have identified and isolated cDNA 
encoding a PRO706 polypeptide, as disclosed in ftinher detail in .he Examples below. Using BLAST and FastA 
sequence alignment computer programs. Applicants found tiuit various portions of die PRO706 polypeptide have 
sequence identity with the human prostatic acid phosphause precursor and U« bunan lysosomal acid phosphause 
precursor. Accordingly, it is presenUy beUeved tha. PRO706 polypeptide disclosed in the presem application » a 
newly identified member of tf« human prostatic acid phosphause precursor family and possesses phosphatase .ypKal 
of Uie acid phospbattse family. 

59. FiTlHf Pf*** PolvnentUte 
nie presem invemion provides newly identified and isoUted m^leotide sequences encoding polypeptides 
referred to in ti« presem application as PRO707. In particular. Applicants have identified and isolated cDNA 
encoding a PRO707 polypeptide, as disclosed in ftmher detail in the Examples below. Using BLAST and i^asiA 
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sequence aJig^nem compmer progranu. Applicams found various pomoas of a,e PRO707 polypeptide have 
significant homology with cadherins. panicularly cadherin FIB3 found in fibroblasts. Accordingly, i. .s presently 
bebeved that PRO707 polypeptide disclosed in the present application is a newly identified member of the cadhenn 
family and possesses cell interaction signaling typical of the cadherin family. 

«>• FMlMength FROVl P„h«^^;^ 
The present invenuon provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred ,o in the present application as PR0322. In particular. AppUcant, have idendfied and isolated cDNA 
encoding a PR0322 polypeptide, as disclosed in further detail in .he Examples below. Using BLAST and FastA 
sequence alignment computer programs. Applicants found diat various portions of the PR0322 polypeptide have 
significant homolt^sy with human neuropsin, serine protease, neurosin and .rypsmogen. Accordingly, it is presently 
believed that PR0322 polypeptide disclosed in the presen. application is a newly identified member of ^e serine 
protease family and possesses protease activity typical of this family. It is also believed that PR0322 is involved in 
hippocampal plasticity and is associated with extracelhdar matrix modifications and cell migrations. 

Fwll-knirthPROS76Poivp>prt^«, 
TTie presem invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
refened to in the presem application as PR0526. In particular. Applicants have identified and isolated cDNA 
encoding a PR0526 polypeptide, as disclosed in fiirther detail in the Examples below. Using BLAST ami FastA 
sequence alignmem computer programs. AppUcants fouml that various portions of .he PR0526 polypeptide have 
20 significam homology with the acid labile subunit of the insulin-like growth faaor complex (ALS) as well 
carboxypeptidase. SLFT. and platelet glycoprotein V. Accordingly, i. is presently believed that PR0526 polypeptide 
disclosed in the presem application is a newly identified member of the leucne-repeat rich superfamily. and possesses 
protein-protein interaction capabilities typical of this family. 

TTie presem invemion provides newly identified and isolated micleotide sequences encoding polypeptides 
referred to in the presem application as PR0531. In particular. Applicams have identified and isolated cDNA 
encoding a PR0531 polypeptide, as disclosed in fimher detail in the Examples below. Using BLAST and FastA 
sequence alignmem computer programs. Applicams found that various portions of the PR0531 polypeptide have 
sigmficant sequence identity and similarity widi members of Uie cadherin superfamily. parnculariy. protocadherin 
3. Accordingly, it is presently believed diat PR0531 polypeptide disclosed in d« presem application is a nev^ly 
.dennfied member of the cadherin superfamily. and is a protocadherin. PR0531 is a transmembrane protein which 
has extnceUular cadherin modfs. PR0531 is believed to be involved in ceU-ceU activity, in p,„icular. cell signaling. 

«• Fwlhteneth PRO.<34 Pniyppfftirtn 
UK presem invention provides newly idemified and isolated nucleotide sequences encoding polypeptides 
refcr«d .o in the presem application as PR0534. In particular. Applicants have idemified and isolated cDNA 
encoding a PR0534 polypeptide, as disclosed i„ fi«her deuul in the Examples below. Using BLAST and FastA 
sequence aligmnem computer programs. Applicams found that various portions of the PR0534 polypeptide have 
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significanl identity or similanty with puu.ive disulfide isomerasc erp38 precursor and thioredoxin c-3 
Accordingly, it is presently believed Out PR0534 polypeptide disclosed m the present application is a newly .demtf.ed 
«mber of the disulfide isomerase fanuly and possesses .he ability to recogmze and unscramble either mtcrmedtate 
or incorrect folding patterns typical of this family. 

64. r,.||.|yi ^pih PRC ^VJ ?fllYP<'P«'«'« 
Tlte present invention provides nev.ly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0697. In particular. Applicants have identified and isolated cDNA 
encoding a PR0697 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and FastA 
sequence alignment computer programs. Applicants found that vanous portions of the PR0697 polypeptide have 
significant identity or similarity with sFRP-2. sFRP-1 and SARP-1 . -2 and -3 Accordingly, it is presently believed 
that PR0597 polypeptide disclosed in tiie present application is a newly identified member of the sFRP family and 
possesses activity related to the Wnt signal pathway. 

65. f ;.||-|fpgt^ PR07I7 Pnlvpeotides 
The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0717. In particular. Applicants have identified and isolated cDNA 
encoding a PR0717 polypeptide, as disclosed in fimher detail in the Examples below. To Applicants pres«« 
knowledge the UNQ385 (DNA50988-1326) nucleotide sequence encodes a novel factor: using BLAST and FastA 
sequence alignment computer programs, no significant sequence identities to any known human proteins wcrt 
20 revealed. 

66. FlllHf fff*'' PRQ"^"*' PolvntPtMeS 
The presem invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the presem application as PR073.. In particular. Applicants have identified and isolated cDNA 
encoding a PR0731 polypeptide, as disclosed in furti«r detail in Uie Examples below. Using BLAST and FastA 
sequence alignment computer programs. Applicants found that various portions of the PR0731 polypeptide have 
significant homology with die protocadherins 4. 68. 43. 42. 3. and 5. Accordingly, it is presently believed that 
PR0731 polypeptide disclosed in die present application is a newly identified member of the protocadherin fiumly 
and possesses celKell aggregation or signaling activity or signal transduction im^olvement typical of this family. 
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67. f nH>ngth PR^aifl Pnlvpeptides 
The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in die present application as PR0218. In particular. Applicants have identified and isolated cDNA 
encoding a PR0218 polypeptide, as disclosed in further detail in die Examples below. The PROZlS^ncoding clone 
35 was isolated fiom a human fetal Iddney library. To Applicants presem Icnowledge. die UNQ192 (DNA30867..3351 
nucleotide sequence encodes a novel factor; using BLAST and FastA sequence alignment computer programs, no 
significam sequence identities to any known protein, were revealed. Some sequence idemiv was found w«h 
n^mbranc regulator proteins, indicating that PR0218 may fimcuon as a membrane regulator. 
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The present invention provides newly idenufied and isolaujd nucleotide sequences encodtne polypeptides 
referred to m the present application as PR0768. In particular. AppHcams have identified and isolated cDNA 
encodmg a PR0768 polypeptide, as disclosed in further detail in the Examples below. Usmg BLAST and FastA 
sequence ahgnmen, computer programs. Apphcants found tha, various portions of the PR0768 polypeptide have 
5 s-gmfican. homology with integrins. including integrin 7 and 6. Accordingly, it is presently beheved that PR0768 
polypep.,de disclosed in the present application .s a newly identtfted member of the iniegrm family either a 
homologue or a splice variant of integrin 7. and is involved with cell adhesion and communication between muscle 
cells and the extracellular mauix. 

'° Full-lenrth PB 077I Pnlvprp fj^^ 

The present tnvendon provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the presem application as PR0771. ,„ particular. Applicants have identified and isolated cDNA 
encoding a PR0771 polypeptide, as disclosed in further detail in the Examples below. Using BUST and FastA 
sequence alignment computer programs. Applicants found tha. vanous portions of the PR077, pohpeptide have 
stgrnfican, sequence identity and similarity with testican. Accordingly, it is presentlv beheved 'that PR077I 
poljwdde disclosed in the ptesem application is a « wly identified member of the testican family and possesses cell 
signalmg. bindmg. or adhesion properties, typical of this family. 

Full-lfnyth pi^ 0733 Pnlyp pptj,!^^ 
20 TTie present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 

referred to In the present applicauon as PR0733. In panicular. Applicants have identified and isolated cDNA 
encodmg a PR0733 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and FastA 
sequence altgnmem computer programs. Applicants found that various ponions of the PR0733 polypeptide have 
stgrnfican. sequence identity with the Tl/ST receptor billing protein. Accordingly, it is presently believed that 
PR0733 polypeptide disclosed in ti« presem application is a newly identified member of d,e mterleukin-like family 
bmd«« proteins which may be a cytokine and which may be involved in cell signaling. I, is believed that PR0733 
is an ApoAIV homologue. 
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71- Full-knyth Pn^\62 P«lvp>mil1n 
TTie present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to m the presem appUcauon as PR0162. In panicular. Applicants have Identified and isolated cDNA 
encodmg a PR0162 polypeptide, as disclosed in further detail in the Examples below. Using BUST and FastA 
sequence aligmnent computer programs. Apphcants found that various portions of the PR0162 polypeptide have 
s^mficant homology with human pancreatitis-associated protein (PAP). Applicants have also foaniHuu^ DNA 
««xUn8 the PR0162 poly^ide has significa« homology with bovine lithostathine precursor and bovine pancreatic 
*read prouin (PTP). Accordingly, h is presently beheved *at PR0162 polypeptide disclosed in the presem 
application is a newly identified member of the pancreatitis-associated protein family and possesses activity typical 
of the pancreatitis-associated protein family. 
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72. f^.ll- fon yth PR0788 Polvpeptldtt 
The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0788. In particular. Applicants have identified and isolated cDNA 
encoding a PR0788 polypeptide, as disclosed in further detail in the Examples below Using BLAST and FastA 
sctiuence alignment computer programs. Applicants found that various portions of die PR0788 polypeptide have 
significant homology with the anti-neoplastic urinary protein. Applicants have also found diat the DNA encoding 
the PR0788 polypeptide has significant homology with human E48 antigen, human component B protein, and human 
prostate stem cell antigen. Accordingly, it is presently believed that PR0788 polypeptide disclosed in the presem 
application is a newly identified member of the anti-neoplastic urinary protein family and possesses anti-neoplasiic 
activity typical of the anti-neoplasuc urinary protein family. 

73, y,. H.Ifnglh PR f>l W Pftlvpeptides 
The present invention provides newly idenuficd and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO1008. In particular. Applicants have identified and isolated cDNA 
encoding a PRO1008 polypeptide, as disclosed in fiirther detail in the Examples below. Using BLAST and FastA 
sequence alignment computer programs. Applicants found dtat various portions of the PRO1008 polypeptide have 
significant sequence identity and similanty with mouse dkk-1 (mdkk-1). Accordingly, it is presendy believed that 
PRO1008 polypeptide disclosed in the presem application is a newly identified member of ti» dkk-l fiuiuly and 
possesses head inducing activity typical of this family. 

20 74. ft| |i.l*inrth PP 9""^ PolvDCDtides 

n,e present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in die present application as PRO1012. In panicular. Applicants have identified and isolated cDNA 
encoding a PRO1012 polypeptide, as disclosed in furtiier detail in the Examples below. Using BLAST and FastA 
sequence alignment computer programs. Applicants found dtat various portions of tite PRO1012 polypeptide have 

25 sequenceidendtywidtdisulfideisomerase. Accordingly, h is presently believed d«t PRO1012 polypeptide disclosed 
in the present application is a .Kwly identified member of the ER retained protein family and possesses activity related 
to tite processing, production and/or folding of polypeptides typical of die disulfide bomerase family. 
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75. Fllll-lfnr'' PgQift'^ PolYPeptidts 
TUe presem invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred lo in the presem application as PRO1014. In particular. Applicants have idemified and isolated cDNA 
encoding a PR01014 polypeptide, as disclosed in funher detail in *e Examples below. Using BLAST ami FastA 
sequence aligmnem computer programs. Applicants found dtat various ponions of Uk PRO1014 polypeptide have 
sequence idewity with reductase and dehydrogenase. Accordingly, it b presently believed that PR01014 polypeptide 
35 disclosed m ti>e presem application is a newly identified member of d« reductase super family ami possesses 
reduction capabilities typical of tius family. 
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f^' FulUktiPth PRfllOn Pftlvnpptt^f, 
The prcscm invcmion provides newly ideraified and .sola.ed nucleotide sequences encodmg polypeptides 
referred to tn the present applicauon as PRO1017. In panicular. Applicants have idemifed and isolated cDNA 
encoding a PRO1017 polypeptide, as d«closed in fitrthcr detail m the Examples below. Using BLAST and FastA 
sequence alignment computer programs. Applicants found that vanous ponions of the PRO10I7 polypeptide have 
sequence idennty with HNK-1 sulfotra^sferase. Accordingly, it is presently believed .hat PRO1017 polypeptide 
disclosed m the present application is a ,«.wly identified member of the HNK-1 sulfotransferase family and .s mvolved 
with the synthesis of HNK-1 carbohydrate epitopes typical of this famUy. 

^- Full-lfnrth PR 0474 Pnlyp pptiH^ff 

TTte present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0474. In particular. Applicants have ideraified and isolated cDNA 
encoding a PR0474 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and FastA 
sequence alignmem computer programs. Applicants found that various ponions of the PR0474 polypeptide have 
sequence identity with dehydrogenase, glucose dehydrogenase and oxidoreducuse. Accordingly, i. ,s presemlv 
believed that PR0474 polypeptide disclosed in the present application is a newly identified member of the 
dehydrogenase family and is involved in die oxidation of glucose. 



Full-length PRO1031 Pniyp<.p^,My. 
■me present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
20 referred .o in the present application as PR01031. In particular. Applicants have identified and isolated cDNA 
encoding a PRO103I polypeptide, as disclosed in further detail in the Examples below. Using BLAST and FastA 
sequence aligmnem computer programs. Applicants found that various portions of the PR01031 polypeptide have 
sequence idemity wid, IL-17 and CILA-8. Accordingly, it is presently believed that PRO1031 polypepude disclosed 
.n the present application is a newly identified member of the cytokine family and tiius may be involved in 
25 mflammation and/or the immune system. 
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79- FHll-knrth PR0938 PnlvppnttHf^ 
TTie present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in Uie present application as PR0938. In particular. Applicants have idemified and isolated cDNA 
encoding a PR0938 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and FastA 
sequence aligmnem computer programs. Applicants found that the PR0938 polypeptide has significant similarity to 
pioiem disulfide isomerase. Accordingly, i, is presently believed diat PR0938 polypeptide disclosed in the presem 
application is a newly idemified member of the thioredoxin family proteins and possesses activity typical of protein 
disulfide isomerase. 

^- FMlHcngth PR01M2 P«lyp»p|j,^^ 

THe presem invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the presem application as PRO1082. In particular. Applicants have identified and isolated cDNA 
encoding a PROI082 polypeptide, as disclosed in further detail in Umj Example, below. Using BLAST and FastA 
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sequence alignment compmer programs. Applicants found that various ponions of the PRO1082 polypep.ide have 
sequence identity with a lectin-like oxidi«d LDL receptor appearing in the database as •ABOlO7l0.r . Accordingly, 
i, is presently believed that PRO1082 polypeptide disclosed in the present application is a newly identified member 
of the LDL receptor family. 

, 81. Fiill-Hg*'' PROiOH-^ PolvMrtite 

The presem invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the presem application as PRO1083. In pardcular. Applicants have identified and isolated cDNA 
encoding a PRO1083 polypeptide, as disclosed in further detail in the Examples below. Tl,e PRO1083-encoding clone 
was isolated from a human fetal kidney library using a trapping technique which selects for nucleodde sequences 

0 encoding secre«d proteins. To Applicants present knowledge, the UNQ540 (DNA50921-1458) nucleotide sequence 
encodes a novel factor; using BLAST and FastA sequence alignment computer programs, some sequence .demtty with 
arm receptor, latrophilin related protein I and a macrophage restricted cell surface glycoprotein was shown. The 
ki«.se phosphorylation site and G-coupled receptor domain shown in Figure 204 indicate tha. PRO1083 is a novel 
member of the 7TM receptor superfamily 

15 

82. p,.ii.i>nath PRCnn PA'rT"""*''^ 
nie present im^ention provides newly idemified and isolated nucleotide sequences encoding polypeptides 
referred to in the presem application as VEGF-E. In particular. Applicants have identified and isolated cDNA 
encoding a VEGF-E polypeptide, as disclosed in further detail in the Examples below. Using BLAST sequeiKe 
20 aligmnem computer ptograms, Applicants found that the VEGF-E polypeptide has significant homology wtth VEGF 
andbonemotphogenencproteinl. hi particular, the cDNA sequence of VEGF-E exhibits 24% amino acid similarity 
with VEGF and has strucn^ral conservation. In addition. VEGF-E comains a N-.erminal half which is no. presem 
in VEGF and diat has 28% homology to bone morphogenetic protein 1 . 

25 83. FlllHf"?*'' ""^"^ ^""^ ^^^^^ PfflYBtrtite 

•me presem invemion provides newly identified ami isolated micleotide sequences encoding polypeptides 
^ «, in the presem application as PR0285 ami PR0286 In particular. Applicants have identified and isolated 
cDNAs em:odi«g PR0285 am! PR0286 polypeptides, as disclosed in further detail in the Examples below. Usmg 
BLAST and FastA sequem:e aligmnem computer programs. Applicants found that die coding sequences of PR0285 
30 and PR0286 are highly homologous to DNA sequences HSU88540J. HSU88878.1. HSU88879.1. HSU88880_1. 

and HSU88881 1 in the GenBank database. 

Accordingly, it is presently believed that the PR0285 and PR0286 proteins disclosed in the pn»e« 
application are newly idemified human homologues of die DrosopHila protein Toll. am. are likely to play an important 
,„,e in adaptive immunity. More specifically, PR0285 ami PR0286 may be involved in inflammanon. septtc shock. 
35 ami response to padiogens. ami play possible rotes in diverse medical comlitions diat are aggravated by mimune 
response. suchas.fbre«mple. diabetes. ALS. earner, rheumatoid arthritis, ami fleers. Tl« role of PR0285 ami 
PR0286 as pathogen panem recognition receptors , sensing the presem:e of conserved molecular structures presem 
o„microbes.isltHthersuppo,tedby.hedatadisclosedinthepr«^^ showing that a taiown human ToU- 

like receptor. TLR2 is a direct mediator of LPS signaling. 
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EwlHcnrth PR021M. PROn^o and PRm^io Pfri^R n 
n^e presem invcndon provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0213-1. PRO1330 and/or PR01449. In panicular. cDNA encoding a 
PR0213-1, PRO1330 and/or PR01449 polypeptide has been idenufied and isolated, as disclosed in further detail in 
U,e Examples below. ,t ,3 noted that proteins produced in separate expression rounds tnay be given different PRO 
nun,bers but the UNQ number is unique for any given DNA and the encoded protein, and will no. be changed. 
However, for sake of simplicity, in the present specification the protein encoded by DNA30943-1 163-1 DNA64907 
1163-1 and DNA64908-1163-1 as well as all lurther native homologucs and variants included in 'the foregoing 
definition of PR0213-1. PRO1330 and/or PR01449. wai be referred to as "PR0213-1. PRO1330 and/or PR01449". 
regardless of ihcir origin or mode of preparation. 

FulHcnrth PR029g Polvf«>priHff, 

The presem ,m,ention provides newly identified and isolated nucleotide sequences eiKoding polypeptides 
referred to in the presem applicanon as PR0298. m is noted that PR0298 is an arbitrary designation of a protein 
encoded by the nucleic acid shown i„ Figure 218. SEQ ID NO: 514. and havmg the amino acid sequence shown .n 
Ftgure 219. SEQ ID N0:515. Further proteins having the same am.no acid sequence but expressed in different 
rounds of expression, may be given differem "PRO" numbers.) 

In particular. Applicants have identified and isolated cDNA encoding a PR0298 polypeptide as disclosed 
m lurther detail in the Examples below. Using BLASTX 2.0a8MP-WashU computer program, socring parameters- 
T-12. S-68. S2=36. Matrix: BL0SUM62. Applicants found that the PR0298 protein specifically disclosed herein 
shows a imiited (27-38%) sequence identity with the following sequences found in the GenBank database: 55939^ 
(probable membrane protein VLR246w - yeast): S58154 (hypothetical protein SPAC2F7. 10 - yeast): CELF33D1 1 9 
(F33D11.9b-Caenorhabditis elegans): Y04I_CAEEL (hypothetical 68.7 kd protein zk757.1); CEAC3 5 (AC3 4 - 
Caenorhabditis elegans): S52691 (probable transmembrane protem YDR126w - yeast); ATn2H17 14 (protein - 
Arabidopsis thaliana): S55963 (probable membrane ptotein YNL326c - yeast); CELC43H6 2'(C43H6 7 - 
Caenorhabditis elegans); TMO18A10_14 (A_TMO18A10.8 - Arabinosa thaliana). 

f ull-lenrth PR0137 P»ivp^p*tf^,>s 
nic presem invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present appIicaUon as PR0337. In particular. Applicants have identified and isolated cDNA 
encoding a PR0337 polypeptide, as disclosed in further detail in the Examples below. Using BLAST. BLAST-2 and 
FastA sequence alignment computer programs. Applicams found that a ftiU-length native sequence PR0337 has 97% 
anrino acid sequence identity with rat «u««imin. 85% sequence identity with chicken CEPU. 73% sequence idenuty 

with chicken G55. 59% homology with human UMP and 84% homology with human OPCAM. Accordingly it 
rs presentiy believed that PR0337 disclosed in the presem applicanon is a newly identified member of the IgLON 
sub family of the immunoglobulin superfamily and may possess neurite growth and differentiation potemiaung 
properties. 
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87. F..ii.f^ pgth P^ fy^n^ fnlvpentides 
The present invemion provides newly identified and isolated nucleotide sequences encoding polypepudes 
referred to in the present application as PRO403. In particular. Applicants have identified and isolated cDNA 
encoding a PRO403 polypeptide, as disclosed in further deuil in the Examples below Usmg a BLAST. BLAST-2 
and FastA sequence alignment computer programs. Applicants found that a full-length native sequence PRO403 has 
94% identity .o bovine ECE-2 and 64% identity to human ECE-1. Accordingly is presently believed that PRO403 
- member of the ECE protein family and may posses ability to catalyze the production of active endothcUn. 



is a new i 
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88. p pn Pnlvpep tide Variante 
In addition to the M-length native sequence PRO polypepddes described herein, u is contemplated that PRO 
polypeptide variants can be prepared. PRO polypeptide variants can be prepared by introducing appropriate 
nucleotide changes into the PRO polypeptide DNA. or by synthesis of the desired PRO polypeptide. TlK.se skUled 
in .he art will appreciate that amino acid changes may alter post-translational processes of the PRO polypeptides, such 
as changing the number or position of glycosylation sites or altering Qie membrane anchormg charactensucs. 

Variations in the native full-length sequence PRO polypeptides or in various domains of the PRO 
15 polypeptides described herein, can be made, for example, using any of the techniques and guidehnes for conservauvc 
and non^onscrvauve muution, se, forth, for instance, in U.S. Patent No. 5.364.934 Variations may be a 
subsUmtion. deletion or insertion of one or more codons encoding the PRO polypeptide that results in a change in 
U« amino acid sequence of the PRO pobT^pdde as compared with the native sequence PRO polypeptide. Optionally 
the variation is by substimtion of a. least one amino acid with any other amino acid in one or more of the domams 
of the PRO polypeptide. Guidance in detennining which amino acid residue may be inserted. subsUmtcd or deleted 
without adversely affecting the desired activity may be found by comparing the sequence of the PRO polypeptide with 
that of homologous known protein molecules and minimizing the mm*er of amino acid sequence changes made m 
regions of high homology. Amino acid substitutions can be the result of replacing one amino acid with another 
acid having similar strucmral a«i/or chemical properties, such as the repbcemem of a leucine with a serine, i.e.. 
25 conservativeaminoacidreplacements. insertions or deletions may optionally be in the range of I to 5 ammo acids. 
The variation allowed may be determined by systematicaUy making insertions, deletions or substitutions of ammo 
acids in the sequence and testing .be resulting vananu for activity in the in .tro assay described m the Examples 
below. 

in particular embodiments, conservative substimtions of interest are shown in Table 1 under tiie headmg of 
30 p^fcrredsubstitutions. If such substitutions result in a change in biological activity, tiien more substantial changes, 
denominated exemplary substitutions in Table 1 . or as further described below in reference to ammo acid classes, 
are introduced and ihe products screened. 
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Original 




Resjdue 


5 


Ala (A) 




Arg (R) 




Asn (N) 




Asp fD) 




Cys (C) 


10 


Gin (0) 




GIu(E) 




Gly (G) 




His (H) 




lie (I) 


15 






Leu(L) 




LysflC) 




Met (M) 


20 


Phe(F) 




Pro(P) 




Ser (S) 




ThrO) 




Trp(W) 


25 


Tyr(Y) 




Val(V) 



Exemplary 
SubstirufippQ 

val: leu; ile 

lys; gin; asn 

gin; his; lys; arg 

glu 

ser 

asn 

asp 

pro; ala 

asn; gin; lys; arg 

leu; val; met; ala; phe; 

norlcucine 

norleucine; ile; val; 

met; ala; phe 

arg; gin; asn 

leu; phe; ile 

leu; val; ile; ala; tyr 

ala 

thr 

ser 

tyr; phe 

irp; phe; thr; ser 
ile; leu; met; phe; 
ala; norleucine 



Preferred 

SMfrMttmiQQs 

val 

lys 

gin 
glu 
ser 
asn 
asp 
ala 
arg 

leu 

ile 
arg 
leu 
leu 
ala 
thr 
ser 
tyr 
phe 

leu 



Substannal modifications in function or immunological idennty of the PRO polypeptide are accomplished 
30 by selecting substimtions that differ significantly h their effect on maintaining (a) the structure of the polypepUde 

backbone in the area of the substimtion. for example, as a sheet or helical conformation, (b) the charge or 

hydrophobicity of the molecule at the target site, or (c) the bulk of the side chain. Namrally occurring residues are 

divided mto groups based on common side-chain properties: 

(1) hydrophobic: norleucine, met, ala, val, Icu, ile; 
35 (2) neutral hydrophilic: cys, ser, thr; 

(3) acidic: asp, glu; 

(4) basic: asn, gin. his, lys, arg; 

(5) residues that influence chain orientation: gly, pro; and 

(6) aromatic: trp, tyr. phe. 

40 Non^onservative subs.in.dons will en«il exchanging a member of one of Uiese classes for anoCer class 

Such subsftu^d residues also may be in«xl«ced i„«, fte conservative substimtion sm or, more preferably, into tt,e 
remaimng (non-conserved) sites. 

nve variadons can be made using meftods known in U« an such as oligonudeotide-mediated (site-direcied) 
45 Si«^irec.edmu«genes.,Carterec...My.L^^. 
13:4331 (1986): Zoller « al.. NmcI, Afid^ Rr5 . ifi:6487 (1987)]. cassette muugencsis fWells e, al. ficns 34.3,5 
(1985)1. restricdon selecdon mutagenesis fWells et al.. Philn. Tn.n. p mnrlnn^rrA 20:415 (1986)] or other 
known techniques can be performed on the cloned DNA to produce dte desired PRO polypeptide variant DNA 
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Scanning amino acid analysis can also be employed lo idemify one or more anuno acids along a conuguous 
sequence. Among the preferred scanning amino acids are relatively small, neutral amino acids. Such amino acds 
i«:lude alanine, glycine, serine, and cysttinc. Alanine is typically a preferred scanning amino acid among this group 
because it eliminates the side-chain beyond the beta-carbon and is less likely to alter the main-chain conformation of 
the variant. Alanme is also typically preferred because it is the most common amino acid. Further, it is frcquenUy 
5 found in both buried and exposed positions (Crcighton. IhSLEiaffims. (W.H. Freeman & Co., N.Y.). Chothia. L 
MiOiaL. 15Q:1 0976)]. If alanine substimtion does not yield adequate amounts of variant, an isoteric amino acid 
can be used. 

89. |Vl>>rfirication!; i ff ppn Pnlvneptides 
10 Covalem modifications of PRO polypeptides are included within the scope of this invention. One type of 

covalent modif.cauon includes reacting targeted amino acid residues of the PRO polypepude with an organic 
derivatizing agent that is capable of reacting with selected side chains ot the N- or C- terminal residues of the PRO 
polypepude. Derivatization with bifunctional agents is useful, for instance, for crosslinking a PRO polypeptide to 
a water-insoluble suppon matrix or surface for use in the method for purifying anti-PRO polypepude antibodies, and 
15 vice-versa. Commonly used crosslinking agents include, e.g.. l.l-bis(dia2oacevl)-2-phenylethane. glutaraldehyde. 
N-hydroxysuccinimide esters, for example, esters with 4.azidosalicylic acid, homobifimctional imidoesters. including 
disuccinimidyl esters such as 3.3'-dithiobis(succinimidylpropionate). bifunctional raaleimides such as bis-N- 
maleimido-1.8-octane and agents such as methyl-3-l(p-a2idophcnyl)dithiolpropioimidatc. 

Other modifications include dcamidation of glutaminyl and asparaginyl residues to the corresponding 
20 glutamyl and aspanyl residues, respectively, hydroxylation of proline and lysine. phosphorylaUon of hydroxyl groups 
of setyl or threonyl residues, methylation of the a-amino groups of lysine, arginine. and histidine side chains fT-E. 
Creighton. mrinr ^mrnrrntllt^"— W-H. Freeman & Co.. San Francisco, pp. 79-86 (1983)). 
aceiylauon of the N-terminal amine, and amidaiion of any C-terminal carboxyl group. 

Another type of covalem modification of the PRO polypeptides included within the scope of this invention 
25 comprises altering the nauve glycosylauon pattern of the polypeptide. "Altering the native glycosylation pattern" is 
intended for purposes herein to mean deleting one or more carbohydrate moieties found in a native sequence PRO 
polypeptide, and/or adding one or more glycosylation sites that are not present in the native sequence PRO 
polypeptide, and/or alteradon of the ratio and/or composition of the sugar residues atuiched to the glycosylation 
site(s). 

30 Addition of glycosylation sites to the PRO polypeptide may be accomplished by altering the amino acid 

sequence. THe alteration may be made, for example, by the addition of. or substitution by. one or more serme or 
U«omne residues to die native sequence PRO polypeptide (for 0-linked glycosylation sites). IHe PRO polypeptide 
amiiK, acid sequence may optionally be altered through changes at the DNA level, particularly by mutaUng die DNA 
encoding the PRO polypeptide a. preselected bases such that codons are generated thai will translate imo the desired 

35 amino acids. 

Another means of increasing the number of carbohydrate moieues on the PRO polypeptide polypeptide is 
by chemical or enzymatic coupling of glycosides to the polypeptide. Such methods are described in the an. e.g.. in 
WO 87/05330 published 11 September 1987. and in Aplin and Wrision. cpP Cril RffY . Biwhcm. . PP- 259-306 
(1981). 
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Removal of carbohydrate moieues present on the PRO polypeptide may be accomplished chemically or 
enzymadcally or by mutatio«il substtmtion of codons encoding for amino acd res.dues that sem as urgets for 
glycosylation. Chemical deglycosyladon techniques are known in the an and described, for instance, bv Hakimuddin 
e. al.. Areh. m^m Piophys .. 252:52 0987) and by Edge e< al.. Anal R.n.h.n, ui:m (1981). Enzymatic 
cleavage of carbohydrate moieties on polypeptides can be achieved by dte use of a vanety of endo- and exo- 
glycosidases as described by TTiotakura ei al.. Meth. En7vn.ni I2g:350 (1987). 

Another type of covalem modification of PRO polypeptides of the invention comprises linking Ote PRO 
polypeptide to one of a variety of nonproteinaceous polymers, e.g.. polyethylene glycol, polypropylene glycol or 
polyoxyalkylenes. in the manner set forth in U.S. Patent Nos. 4.640.835; 4.496.689: 4.301.144: 4 670 417 
4,791.192 or 4.179,337. ' ' 

TTie PRO polypeptides of .he present invention may also be modified in a way to form a chimeric molecule 
comprising a PRO polypeptide fused ,o another, heterologous polypeptide or am.no acid sequence In one 
embodiment, such a chimeric molecule comprises a fusion of the PRO polypeptide with a tag polypeptide which 
provdes an epitope to which an ami-tag antibody can selectively bind. The epitope ug .s generally placed a. the 
ammo- or carboxyl- terminus of the PRO polypeptide. The presence of such cpitope-tagged forms of the PRO 
polypepode can be detected using an antibody against the tag polypeptide. Also, provtston of the epitope ug edibles 
fte PRO polypeptide to be readily purified by affinity p«ificadon using an anti-tag antibody or another type of affimty 
matrix that binds to the epitope tag. In an altenuttive embodiment, the chimeric molecule may comprise a fusion of 
the PRO polypeptide with an immunoglobulin or a panicular region of an immunoglobulin. For a bivalem form of 
the chimeric molecule, such a fusion could be to die Fc region of an IgG molecule. 

Various ug polypeptides and their respective antibodies are well known in the art. Examples include poly 
Wstidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; the flu HA tag polypeptide and its antibody 12CA5 
(Field et al.. Mol , Ce l l . Piol . 8:2159-2165 (1988)]; the c-myc tag and the 8F9. 3C7. 6E10. G4. B7 and 9E10 
antibodies ,bcn^ [Evan e. al.. MolPCMlar and THhllfir Piolon-. 5:3610-3616 (1985)); and the Herpes Simplex virus 
glycoprotein D (gD) tag and its antibody (Paborsky et al.. Protein Pn<,in..Hnp 2(6) 547-553 (I990)|. Other tag 
polypeptides include the Flag-peptide fHopp et al.. fiialichnalm. 6:1204-1210 (1988)); dte KT3 epitope peptide 
fMartin et al.. SofflCfi. 255:192-194 (1992)): an «-.ubulin epitope peptide fSkim«r et al.. J.Biol r^m 26&: 15163- 
15166 (1991)): and the T7 gene 10 piotdn peptide lag [Latz-Freyeimuth et al.. Proc. N..I a.,h g2:6393- 
6397 (1990)]. 

^ ^- Preparation tff pro Pnlypy ptit)i«j 

Hie description below relates primarily to producuon of PRO polypeptides by culmring cells transformed 
or wnsfected with a vector contaiiung the desired PRO polypeptide nucleic acid, h is. of course, comemplated that 
alternative methods, which are well known in the art. may be employed to prepare the PRO polypeptide. For 
instance, the PRO polypeptide sequence, or portions thereof, may be produced by direct peptide synthesis using solid- 

35 phase techniques [see. e.g.. Stewan et al.. Sglid-P)iaK Pfptidr 5;ymhni,. W.H. Freeman Co.. San Francisco. CA 
(1969): Merrifield. J . Am . rhcm ^OT . 25=2149-2154 (1963)). In v,tro protein syndesis may be performed using 
manual techniques or by automation. Automated symhesis may be accomplished, for instance, using an Applied 
Biosysiems Peptide Synthesizer (Foster City. CA) using manufacturer's mstnictions. Various portions of ihe desired 
PRO polypeptide may be chemically synthesized separately and combined using chemical or enzymatic methods to 
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produce the ftill-length PRO polypepade 



A. ic^intmp «f nNA FnTiiilinr PBf> 
DNA encoding PRO polypeptides nuiy be obtained from a cDN A library prepared from tissue believed to 
possess the desired PRO polypeptide nANA and to express .t at a detectable level. Accordingly, human PRO 
5 polypeptide DNA can be com^enienily obtained from a cDNA library prepared from human tissue, such as described 
in the Examples. The PRO polypepude-encoding gene may abo be obtained from a genomic library or by 
oligonucleotide synthesis. 

Ubraries can be screened with probes (such as antibodies to the desired PRO polypeptide or ohgonucleottdes 
of at least about 20-80 bases) designed to identify the gene of interest or the protein encoded by it. Screemng the 
10 cDNA or genomic library with me selected probe may be conducted using suindard procedures, such as described 
in Sambrook et al., M~..--|,r ^-^w- ^ ' ^Hor.,nrv Manual (New York: Cold Spring Harbor Uboratory Press. 
1989) An alternative means to isolate the gene encodmg the des.red PRO polypeptide is to use PCR methodology 
(Sambrook et al.. smi. Dieffenbach et al.. PCF PrimrfA I ilhoriim Mmual (Cold Spring Harbor Uboratoiy 
Press, 1995)). 

15 The Examples below describe techniques for screemng a cDNA hbrary. Tlie oligonucleotide sequences 

selected as probes should be of sufficient length and sufficiently unambiguous that false positives are minimized. The 
oligonucleotide is preferably labeled such that it can be detected upon hybridization to DNA in the library betng 
screened Methods of labeling arc well known in the art. and include the use of radiolabels like «P-labeled ATP. 
bioD^ylaiion or enzyme labeling. Hybridization conditions, including moderate stringency and high stringency, are 

20 provided in Sambrook et aL . sui2m. 

Sequences identified in such library screening methods can be compared and aligned to other known 
sequences deposited and available in public databases such as GenBank or other private sequence daubases 
Sequence identity (at either the amino acid or nucleotide level) within defined regions of the molecule or across the 
full-length sequence can be determined through sequence aligmnem using computer software programs such as 

25 BLAST ALIGN. DNAstar. and INHERIT which employ various algorithms to measure homology. 

Nucleic acid havmg protein coding sequence may be obtained by screening selected cDNA or genomic 
libraries using die deduced amino acid sequence disclosed herein for *e first time. and. if necessary, using 
comrentional primer extension procedures as described in Sambrook e. al. . aJBia. to detect precursors and processmg 
imermediates of mRNA that may not have been reverse-transcribed into cDNA. 

30 

B. fr>l<«-«inn and T ri""=f"rmation of HaSt CcllS 

Host cells are transfected or transformed with expression or cloning vectors described herein for PRO 
polypeptide production and culmred in conventional nutriem media modified as appropriate for inducing promoters, 
selecting trans formants. or amplifying the genes encoding the destred sequences. Tl« cul«it conditions, such as 
35 media, temperawre. pH and the like, can be selected by the skilled arttsan without u«lue experimentanon. In 
general, principles, protocols, and practical technique, for maximizing the productivity of cell culmres can be found 
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Me*odsof«mfccoona«la»w„,od«orduBrily skilled anisan. for example. CaPO, and elec.ropora.ion 
Depcndmg on d« host cell used, uansfonnauon .s perfonned using sumdard .echmques appropr.aie ,o such cells 
The calcum .ream^m employing calcum chloride, as described in Sambrook e. al.. or elec.ropora.ion ,s 
generally used for prokaiyoBs or odKr cells da, contain subsuncial ccll-wall barriers. Infecon w.th Agrobaaenun, 
tumefaaens is used for transformauon of certain plan, cells, as described by Shaw e. al.. fisnc. 22:315 (1983) and 
WO 89/05859 published 29 June 1989. For manunalian cells widu,ut such cell walls. Ae calcium phosphate 
prec.p.a„on method of Graham and van der Eb. Yinte. 52.45M57 (1978) can be employed. General aspect 
of mammalian ceU host system transfonnanons have been described in U.S. Paten. No. 4,399.216. TransfonnaUons 
m,o yeas, are typically carried out according ,o .he method of Van Solingen et al.. OaCL. I2Q:946 (1977) ^ Hsiao 
etal.. Prop, N^M,Ar.d Sri n 'S ^) . 76:3829 (1979). However. o±er me±ods for imroducmg DNA ,«o cells such 
as by nuclear microinjection, elec.ropora.ion. bacterial protoplast fusion wiUi mtact cells, or polycations eg 
polybrene. polyomithine. may also be used. For various techniques for transforming mammalian cells, see Keown 
ct al.. MgtflftdsinF,nrYm9lnfY . iM:527-537 (1990) and Mansour et al.. MailllS. 226:348-352 (1988). 

Suitable host cells for cloning or expressmg the DNA in die vectors herein include prokaryo.e yeast or 
higher eukaryote cells. Suitable prokaryotes i«:lude but are no. limited to eubacteria. such as Gram-negative' or 
Gram-positive organisms, for example. Emerobacteriaceae such as E. coti. Various £. coU strains are publicly 
avadable. such as £. cdi KI2 strain MM294 (ATCC 31.446); E. coU X1776 (ATCC 31.537); £. coU strain W3l 10 
(ATCC 27.325) and K5 772 (ATCC 53.635). Odier suitable prokatyotic host cells include Enterobactcriaceae such 
as Eschenc,ua, e.g. . £. coU, Enterobaaer, En.in,a. Klebsiella, Prceus. Salmonella, e.g. . Salmonella typhimurim 
Serrana, e.g., Serratia marcescar^. and Shigella, as well as Bacilli such as B. subtilis and B. lichenifomis (e g B 
licheniformis 4IP disclosed in DD 266.710 published 12 April 1989). Pseudomnas such as P. aeruginosa, ««J 
Sireptomyces. Various £. coU strains are publicly available, such as £. coli K12 strain MM294 (ATCC 31 446)- £ 
coll XI776 (ATCC 31.537); £. coli strain W31 10 (ATCC 27.325); a^l K5 772 (ATCC 53.635). TTiese examples 
are illustraUve rather Uian limiting. Strain W31 10 is one particularly preferred host or parent host because it .s a 
common host strain for rccombinam DNA produa fermentauons. Preferably, dic host cell secretes minimal amount 
of proteolytic enzymes. For example, strain W3I10 may be modified to effect a genetic mutation in die genes 
encodu^B proteins endogenous to the host, wid, examples of such hosts including E. coti W31 10 strain 1A2 which 
has die complete ge«,type ,onA ; E. coti W31 10 strain 9E4, which has d,e complete genotype ,onA ptrS- E coli 
W31 10 strain 27C7 (ATCC 55.244). which has the complete genotype lonA p,r3 phoA E15 (argF-lacm degP 
o^rW;£.«ffW3110strain37D6.whichhas the complete genotype l^«^prrJ;,A.^£/5 (argF^lacHeP degP 
ompT rbs7 ilvC W; £. coU W31 10 saain 40B4, which is strain 37D6 with a non-kanamycin resistant degP deletion 
mutation: and an£. coli strain having muuuii periplasmic protease disclosed b U.S. Patent No. 4.946.783 issued 7 
August 1990. Altemadvely. ,„ W/ro mediods of cloning, e.g.. PCR or other nucleic acid polymerase reacdons. are 
suitable 



In addition to prokaryotes. eukaryo.ic microbes such as filamentous fUngi or yeast are suiuble cloning or 
expression hosts for PRO polypeptide-encoding vectors. Saccharomyces cerevisiae is a commonly used lower 
eukaryotic host microorganism. Others include Schizosaccharomyces pombe (Beach and Nurse. Jeanne. 22Q: 140 
[19811; EP 139.383 published 2 May 1985): Kluyveromyces hosts (U.S. Patent No. 4.943.529; Fleer et al 
BiaajKhnalfigx. S-. 968-975 (1991)) such as. e.g.. K. lactis (MW98-8C. CBS683, CBS4574: Louvcncoun et al L 
fiaCKlifiL. 737 ri983D. K. fragilis (ATCC 12.424). K. bulgancus (ATCC 16.045). K. mckeramii (ATCC 24 178) 
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K. waini (ATCC 56.500). K. drosoplulamm (ATCC 36.906; Van den Berg el al., Pio/TcclmPK?EY . 8: 135 (1990)). 
K. thermowlerans. and K. marxianus: yarrowia (EP 402.226): Pictuapaswrn (EP 183.070; Srtekrishna « aA. L 
R;.^ jr. Micmbioi. . 28: 265-278 11988)): Candida: Trichoderma reesia (EP 244.234); Neurospora crassa (Case et al.. 
pp. Arnri Sci.USA . 7fi: 5259-5263 [19791): Schwamiomyces such as 5c/iwfl/m/om'c« occ/rfenw/« (EP 
394.538 published 31 October 1990); and filamemous fungi such as. e.g.. Neurospora. PeniciUium. Tolypocladium 
5 (WO 91/00357 published 10 January 1991). and Aspergillus hosts such as A nidulans (Ballance « al. BifichSllL 
p i ^K„c r^nrnm. . il2: 284-289 [19831: Tilbum « fl/. . SsnS. 2fi: 205-221 (19831. Yelton « .A . Pzoc^ 
^ rpH Sri USA . SI: 1470-1474 (19841) and A. niger (Kelly and Hynes. F.MPO J. . 4: 475-479 [1985|). 
Methylotropic yeasts are suitable herein and include, but are not limited to. yeast capable of growth on methanol 
selected from the genera consisting ciHansemla. Candida, Kloeckera. Pichia. Saccharomyces. Tondopsis. and 
10 Rhodotomla. A list of specific spec.es that are excn^lary of diis class of yeasts may be found in C. Anthony, m 
Piff^ -hgmi-itrv of MethvlotrophS. 269 (1982). 

Suiuble host cells for the expresston of glycosylated PRO polypeptides arc derived from muliiccUular 
organisms. Examples of invenebrate cells include insect cells stich as Drosophila S2 and Spodoptcra Sf9. as well 
as plant cells. Examples of useful mammalian host cell lines include Chinese hamster ovaiy (CHO) and COS cells 
,5 More specific examples include monkey kidney CVl line transformed by SV40 (COS-7. ATCC CRL 1651): human 
embryonic kidney line (293 or 293 cells subcloned for growth in suspension culture. Graham « al.. J.QgnVirol. . 
2fi:59 (1^); Chinese hamster ovary ceUs/-DHFR (CHO. Urlaub and Chasm. Pfor Nml Aw4, S^l, U$A . n:42l6 
(1980)); mouse Sertoli ceils (TM4. Mather. fiiaLBSBBlsL. 22:243-251 (1980)); human lung cells (W138. ATCC CCL 
75): human liver cells (Hep 02. HB 8065); and mouse mammary mmor (MMT 060562. ATCC CCL51). The 
20 selection of the appropriate host cell is deemed to be within the skill in the art. 

C. selection and H'ii' nf a Replicable Vector 
nucleic acid {e.g.. cDNA or genomic DNA) encoding a desired PRO polypeptide may be inserted into 
a replicable vector for cloning (amplification of the DNA) or for expression. Various vectors are publicly available. 
25 TTic vector may. for example, be in the form of a plasmid. cosmid. viral particle, or phage. TTie appropriate nucleic 
acid sequence may be inserted into the vector by a variety of procedures. In general. DNA is inserted into an 
appropriate restriction endonuclease site(s) using techniques known in the art. Vector components generally include, 
but are not limited to. one or more of a signal sequence, an origin of replication, one or more marker genes, an 
enhancer element, a promoter, and a transcription termination sequence Construction of suiuble vectors containing 
30 one or more of these components employs standard ligation techniques which arc known to the skilled artisan. 

The PRO polypeptide of interest may be produced recombinanily not only direcUy. bui also as a fusion 
polypeptide with a heterologous polypeptide, which may be a signal sequence or other polypeptide having a specific 
cleavage site at the N-terminus of the mamre protein or polypepude. In general, the signal sequence may be a 
component of the vector, or i, may be a part of the PRO polypepude DNA that is inserted into the vector. THe signal 
35 sequence may be a prokaryotic signal sequence selected, for example, from the group of the alkaline phosphatase, 
penicillinase. Ipp. or heat-stable enterotoxin 11 leaders. For yeast secretion the signal sequence may be, e.g.. the 
yeast invertasc leader, alpha factor leader (including Saccharomyces and Kluweromyces a-factor leaders, the laner 
described in U.S. Paiem No. 5.0I0.I82), or acid phosphaosc leader, the C. albicans glucoamylase leader (EP 
362.179 published 4 April 1990). or the signal described in WO 90/13646 publbhed 15 November 1990. In 
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mammalian cell expression, mammalian signal sequences may be used to direci secretion of the protc.n. such as signal 
sequences from secreted polypeptides of the same or related species, as well as viral sccretoi>' leaders. 

Both expression and cloning vectors contain a nucleic acid sequence that enables the vector to replicate m 
one or more selected host cells. Such sequences are well known for a variety of bacteria, yeast, and vintses The 
odginof repUcauon from the plasmid pBR322 ,s suiuble for most Gram-negative bacteria, the 1, plasmtd origin is 
suitable for yeast, and vanous vinU ongins (SV40. polyoma, adenovirus. VSV or BPV) are useful for cloning vectors 
in mammalian ceils. 

Expressmn and cloning vectors will typically contain a selecuon gene, also termed a selectable marker 
Typical selection genes encode proteins that (a) confer resistance to amibioiics or other toxms. e.g.. amplcillin. 
neomycin, methotrexate, or tetracycline, (b) complement auxotrophic deficiencies, or (c) supply critical nutncnts nol 
available from complex media, e.g.. the gene encoding D-alanine racemase for BacilU. 

An example of suitable selectable markers for mammalian cells are those that enable the idemificadon of 
cells competent to take up the PRO polypeptide nucleic acid, such as DHFR or thymidine kinase An appropriate 
host cell when wild-type DHFR is employed is the CHO cell line deficiem in DHFR activit)-. prepared and 
propagated as described by Urlaub et al.. Ptqc, NPtI Ars^l ,^1 HS^. 22.4216 (1980). A suitable selection gene 
for use in yeast is the trp\ gene presem in the yeast plasmid YRp7 fStinchcomb et al.. tiaaicfi. 252:39 (1979) 
Kingsman et al.. fifim. 7:141 (1979); Tschemper e. al.. fisnt. IflM57 (1980)). The trpX gene p,.,vides a selection 
marker for a mutant saain of yeast lacking the ability to grow in tryptophan, for example. ATCC No. 44076 or PEP4- 
1 (Jones, Genetics , 35:12 (1977)]. 

Expression and cloning vectors usually contain a promoter operably linked to the PRO polypeptide nucleic 
acid sequence to direct mRNA synthesis. Promoters recognized by a variety of potential host ceUs are well known. 
Promoters suitable for use with prokaryotic hosts include the (J-lactamase and lactose promoter systems [Chang et 
al.. mm, meiS Hmy, Goeddel et al.. Naiure. 2M:544 (1979)). alkaline phosphatase, a tryptophan (trp) 
promoter system [Goeddel. Nwleic Aci>l,^ Rn . 2:4057 (1980); EP 36.776J, and hybrid promoters such as the tac 
promoter [deBoer et al.. Prw . N a tl , Awd .Sf j V<y^ , 8Q:21-25 (1983)]. Promoters for use in bacterial systems also 
will contain a Shine-Dalgamo (S.D.) sequence operably linked to the DNA encoding the desired PRO polypeptide. 

Examples of suitable promoting sequences for use with yeast hosts include the promoters for 3- 
phosphoglyccrate kinase [Hiizemanet al.. f . B i P l , Qimi ,. 255:2073 (1980)] or other glycolytic enzymes [Hess et al.. 
J. My,fm Rrg .. 2:149 (1968): Holland. Biflchanisax. 12:4900 (1978)]. such as enolase. glyceraldchydc-^ 
phosphate dehydrogenase, hexokinase. pyruvate decaitoxylase. phosphofiuctokinase. glucosc.6-phosphate isomeiase. 
3i*osphoglycerate mutase. pyruvate kinase, triosephosphate tsomerase. phosphoglucose isomerase. and glucokinase. 

Other yeast promoters, which are inducible promoters having die additional advantage of transcription 
controlled by growd, conditions, are the promoter regions for alcohol dehydrogenase 2. isocytochromc C. acid 
phosphatase, degradadvc enzymes associated widi nitrogen metabolism, metallothionein. glyceraldehyde-S-pho^phate 
dehydrogenase, and enzymes responsible for maltose and galactose utilization. Suitable vectors and promoters for 
35 use in yeast expression are fiinher described in EP 73.657. 

PRO polypeptide transcription from vectors in mammalian host cells is controlled, for example, by 
promoters obuiined from the genomes of viruses such as polyoma virus, fowlpox virus (UK 2.2 1 1 .504 published 5 
July 1989). adenovirus (such as Adenovirus 2). bovine papiUoma vinu. avian sarcoma virus, cytomegalovirus, a 
retrovirus. hepautis-B vims and Simian Virus 40 (SV40). from heterologous mammalian promoters, e.g.. the acdn 
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promoter or an iiranunoglobulin promoter, and from heat-shock promoters, provided such promoters are companble 
with the host cell systems. 

Transcription of a DNA encoding the desired PRO polypeptide by higher cukaryoies may be increased by 
inserting an enhancer sequence into the vector. Enhancers are cis-acttng elements of DNA. usually about from 10 
to 300 bp. that act on a promoter to mcreasc its transcription. Many enhancer sequences are now known from 

5 mammalian genes (globin. elastase. albumin, a-fetoprotein. and insulin). Typtcally. however, one will use an 
enhancer fiom a eukaiyodc ceU virus. Examples include the SV40 enhancer on the late side of the replication origin 
(bp 100-270). the cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side of the replication 
origin, and adenovirus enhancers. TTie enhancer may be spliced into the vector at a position 5' or 3' to the PRO 
polypeptide coding sequence, but is preferably located at a site 5' from the promoter. 

10 Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal, human, or nucleated 

cells from o^er multicellular organisms) will also contain sequences necessary for the termination of transcription 
and for stabilizing the mRNA. Such sequences are commonly avaUable from the 5' and. occasionally 3' . untranslated 
regions of eukaryotic or viral DNAs or cDNAs. These regions contain nucleotide segments transcribed as 
polyadenylated fragments in the untranslated portion of the mRNA encodmg PRO polypeptides. 

15 SiiU other methods, vectors, and host cells suitable for adaptation to the synthesU of PRO polypeptides in 

recombinant vertebrate cell culture are described in Gething et al.. mm. 222:620-(i25 (1981): Mantei et al.. 
I4ajui£. 221:40-46 (1979); EP 117.060: and EP 117.058. 

D. nptx-ting r.eii > Amnlineation/ExDressiffiD 
20 Gene amplification and/or expression may be measured in a sample directly, for example, by convcmional 

Southern blotting. Northern blotting to quantitate the transcription of mRNA ITT^omas. Pmr Nail A«d. Sti . USA . 
22-5201-5205 (1980)]. dot bloning (DNA analysis), or in situ hybridiiatioa using an appropriately labeled probe, 
based on the sequences provided herein. Alternatively, antibodies may be employed that can recognize specific 
duplexes, including DNA duplexes. RNA duplexes, and DNA-RNA hybrid duplexes or DNA-protein duplexes. Tbt 
25 antibodies in mm may be labeled and the assay may be carried out where the duplex is bound to a surface, so that 
upon the formation of duplex on ti« surface, die presence of antibody bound to the duplex can be detected. 

Gene expression, alternatively, may be measured by immunological methods, such as immunohistochemical 
staining of cells or tissue sections and assay of ceU culture or body fluids, to quantitate direcUy the expression of gene 
product. Antibodies vscM for immunohistochemical staining and/or assay of sample fluids may be eitiier monoclonal 
30 or polyclonal, and may be prepared in any mammal. Conveniently, the antibodies may be prepared against a native 
sequence PRO polypeptide or against a synti,etic peptide based on die DNA sequences provided herein or against 
exogenous sequence fused to a PRO polypeptide DNA and encoding a specific antibody epitope. 



35 



FoTOB Of PRO polypeptides may be recovered from cutane medium or from host cell lysaies. If membrane- 
bound. U can be released from the membrane using a suitable deiergem solution {e.g. Triton-X 100) or by enzymatic 
cleavage. Cells employed in expression of PRO polypeptides can be disrupted by various physical or chemical 
means, such as freezc-tiiaw cycling, sonicauon. mechanical disruption, or cell lysing agents. 

I, may be desired to purify PRO polypeptides from recombinant ceU proteins or polypeptides. The followmg 
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or small molecule inhibitors or agoms.s of ihc binding imeraciion. Screening assays can be designed to find lead 
compounds that munic Ok biological acnv^ of a native PRO polypepude or a hgaml for Uie PRO polypeptide. Such 
screening assays will include assays amenable to high-throughput screening of chemical libraries, making them 
particularly suitable for idenufying small molecule dmg candidates. Small molecules contemplated include synthetic 
organic or inorganic compounds. The assays can be performed in a variety of formats, including protcm-protein 
5 binding assays, biochemical screening assays, mimunoassays and cell based assays, which are well characterized m 
the art. 

Nucleic acids which encode a PRO polypeptide or its modified forms can also be used to generate either 
transgemc animals or "knock ouf animals which, in mm. are useful in the development and screemng of 
therapeutically useful reagents. A transgenic ammal (e.g.. a mouse or ra.) is an ammal havmg cells Uiat conuin a 
10 transgene. which transgene was introduced into ihc animal or an ancestor of the animal at a prenatal, e.g.. an 
embryonic stage. A transgene is a DNA which is integrated into the genome of a cell from which a transgenic animal 
develops, m one embodiment. cDNA encoding a PRO polypeptide of interest can be used to clone genomic DNA 
encoding the PRO polypepddc in accordance w.Ui established techniques and die genomic sequences used to generate 
transgenic animals that conuin cells which express DNA encoding the PRO polypeptide. Methods for generating 
15 transgenic animals, particularly animals such as mice or rats, have become conventional in the an and are described, 
for example, in U.S. Patent Nos. 4.736.866 and 4.870.009. Typically, parucular cells would be targeted for PRO 
polypeptide transgene incorporadon with tissue-speciftc enhancers. Transgenic animals Oia. include a copy of a 
tnuisgene encoding a PRO polypeptide introduced into the germ line of the animal at an embryonic stage can be used 
to examine the effect of increased expression of DNA encoding the PRO polypeptide. Such animals can be used as 
.0 tester ammals for reagents ihought to confer protection from, for example, pathological conditions associated with 
its overexpression. In accordance wid, this facet of the invention, an animal is treated with U,e reagent and a reduced 
incidence of the pathological condition, compared to untreated animals bearing the transgene. would indicate a 
potenual therapeutic intervention for the pathological condition. 

Alternatively, non-human homologues of PRO polypeptides can be used to construct a PRO polypeptide 
25 "knock out" animal which has a defective or altered gene encoding the PRO polypcptKle of interest as a result of 
homologous recombination between the endogenous gene encoding the PRO polypeptide and altered genomic DNA 
encodmg Ok PRO polypeptide introduced into an embryonic ceU of the animal. For example . cDN A encoding a PRO 
polypepude can be used to clone genomic DNA encoding the PRO polypepude in accordance with established 
techniques. A portion of the genomic DNA encodmg a PRO polypeptide can be deleted or replaced with anodter 
30 gene such as a gene encoding a selecuble marker which can be used to momtor mtegrauon. Typically, several 
kilobases of unaltered flanking DNA (both at the 5' and 3' ends) are included in die vector |see e.g.. Thomas and 
Capecchi. CfiU. 51:303 (1987) for a description of homologous recombination vectors! The vector is introduced into 
an embryonic stem cell line (e.g.. by electroporation) and cells in which the introduced DNA has homologously 
recombined with d« endogenous DNA are selected |see e.g.. Li et al.. £dl. 62:915 (1992)). The selected ceUs are 
35 dien injected into a blastocyst of an animal (e.g.. a mouse or rat) to form aggregation chimeras fsee e.g.. Bradley, 
in Termcarcinomcs and Embryonic Stem Cells: A Praaical Approach. E. J. Robertson, ed. ORL. Oxford. 1987). 
pp. 113-1521. Achimericembryocarthenbe implanted intoasuitablcpseudopregnant female foster anim^ 
embm brought to term to acate a "knock ou," animal. Progeny harboring die homologously recombined DNA in 
OKir germ cells can be identified by sumdard techniques and used to breed animals m which all cells of the animal 
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conuin *e homologous.y rccombincd DNA. K«H:kou. admals can be charac«Sffor imtance. for thc.r ab.,.. 
.odefe^agai„s.c«uinpaaK,,og.ca.co™,U^^^ 
of the PRO polypeptide. 

When m v,.. admiiustrauon of a PRO polypeptide .s employed, nonml dosage a^oums may vary from 
abou. 10 „g/Rg ,0 up ,0 100 mg/kg of mammal body weigh, or more per day. preferably about 1 ,g/kg/dav to .0 
> n^/kg/day. depending upon the route of administration. Guidance as to particular dosages ^ methods of delivery 
« provded m the U.ran.e; see. for example. U.S. Pat. Nos. 4.657.760: 5.206.344; or 5.225.212. ,t is anticipated 
c at different formulaUons wil, be effective for different treatmem compounds ^ different disorders that 
admrnastration targeting one organ or tissue, for ex«,ple. may necess.tate delivery in a manner different from that 
to another organ or tissue. 

Where sustained-release administration of a PRO polypept.de is desired in a formulation with release 
characterisdcs suitable for the treatment of any disease or disorder requtring administration of the PRO polypeptide 
microencapsulation of the PRO polypeptide is contemplated. Microencapsulation of recombinant proteins fo^ 
sustatned release has been successfully perforn«d with human growth hormone (rhOH). interferon- (rhlFN- ) 
interleukin-2, and MN rgpI20. Johnson et al. . fiaUdfisL 2" 795-799 noo^v y«,.h, n „T^ 
223 (1993): Hora « al., fiMidi«te. fi: 755-758 (1990): Cleland. "Design axKl Production of Single 
^don vaccines Us^ Poly^de Polyglycolide Microsphere Sys^n..- in ya..u^i2,.|g^l,,,,^ 
AtoLtoh. PoweU and Ne^. eds. (Plenum Press: New York. 1995). pp. 43<M62: WO 97/03692 WO 
96/40072. WO 96/07399: and U.S Pat. No. 5.654.010. 

The susuined-release formulations of mese proteins were developed using po.y-lacdc<og,ycolic acid 
(PWA) polymer due to its biocompatibility and wide range of biodegradable properties. The degradation products 
of PLGA. lacdc and glycolic acids, can be cleared quickly within d,e human body. Moreover, the degradability of 
U..S polymer can be adjusted from mond.s to years depending on its molecular weight and composition Uwis 
-Controlled release of bioactive agents from lactide/glycolide polymer." in: M. Chasin and R. Langer (Eds )' 
PlWtewajabh; Polymffl PniP rviivrrv (Marcel Dekker: New York. 1990). pp. 1-41. 

Forexample.foraformula.ion,hatcanprovideadosingofapproximately80 g/kg/day m mammals with 
a maximum body weight of 85 kg. the largest dosing wouM be approximately 6.8 mg of the PRO polypeptide per day 
In or^r to achieve this dosing level, a sustained- release fonnulation which conutins a nuximum possible protein 
toadmg (15-205. w/w PRO polypeptide) widt the lowest possible initial burst «20%) is necessary. A continuous 
(zero-order) release of the PRO polypeptide from micropanicles for 1-2 weeks is also desirable. I„ addition the 
encapsulated protein to be released should maintain iu integrity and stability over .he desired release period ' 

PR0213 polypeptides and portions hereof which possess the ability to regulate the growth induction cascade 
and/or the blood coagulation cascade may also be employed for such purposes bod, /„ v/^ therapy and in vara 
Those of ordinary skill in the an will well know how to employ PR0213 polypeptides for such uses 

PR0274 polypeptides and portions d,ereof which have homology to 7TM protein and Fn54 may also be 
useful for in wV. therapeutic purposes, as well as for various od,er applications. The identification of novel 7TM 
protem and Fn54.1ike molecules may have relevance to a number of human disorders which involve recogmtion of 
Lgands and U« subsequent sig«a transduction of information contained within those Ugands in order to control 
ceUular processes. Thus, the idenuficadon of new 7TM protein a^l Fn54-like molecules is of special in,portance 
mthatsuchptotei^mayserveaspotendal therapeutics foravariety of dtffer^^^ Such polypeptides 
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may also play miponant roles in biotechnolog.cal and medical research as well as m various .ndusirial applicauons 
As a result, there is particular scientific and medical interest in new molecules, such as PR0274. 

PRO300 polypeptides and ponions thereof which have homolog)- to Diff 33 may also be useful for m vivo 
dKrapeutic purposes, as weU as for various other applicanons. Ute identification of novel Diff 33-like molecules may 
have relevance to a number of human disorders such as the physiology of cancer. Thus, the identificauon of new 
5 Diff 33-lite molecules is of special importance in that such proteins may serve as potential therapeutics for a variety- 
of different human disorders. Such polypeptides may also play important roles m b.otechnological and medical 
research as well as various industrial applications As a result, there is particular scientific and medical interest in 

new molecules, such as PRO300. 

PR0296 polypeptides of the present invention which possess biological activity related to that of the 
10 sarcoma-amplified SAS protein may be employed both ,n v.vo for therapeutic purposes and m vuro. ntosc of 
ordinary skill in the art will well know how ,o employ the PR0296 polypeptides of the present invention for such 

purposes. 

PR0329 polypeptides of the present invention which possess biological activity related to that of 
immunoglobulin F, receptor protein or subunit thereof may be employed both in wvo for therapeutic purposes and 
15 in mro. Those of ordinary skill m the art will well know how ,o employ the PR0329 polypeptides of the presem 

invention for such purposes. 

PR0362 polypeptides of die present invention which possess biological activity related to that of the A33 
antigen protem. HCAR protein or the NiCAM related cell adhesion molecule may be employed both in vivo for 

therapeutic purposes and in vitro, 
20 PR0363 polypeptides of the presem invemion which possess biological activity related to that of the cell 

surface HCAR protein may be employed both in vivo for therapeutic purposes and ,n vara. Tliose of ordinary skill 
in ,he art will well know how to employ the PR0363 polypeptides of the presem invendon for such purposes. 
Specifically, extracellular domains derived from d« PR0363 polypeptides may be employed therapeudcally in vivo 
for lessening the effects of viral infection. 
25 PR0868 pdypepodes of the presem invention which possess biological activity related to that of the mmor 

xerosis factor protein may be employed both in v,vo for therapeutic purposes and in v,tro. Those of ordinary skill 
in the an will wcU know how to employ the PR0868 polypeptides of the present invention for such purposes. 

PR0382 polypeptides of the presem invention which possess biological acnvity related to that of the serine 
protease proteins may be employed both ,n vivo for therapeutic purposes and in vnro. Those of ordinary skill in the 
30 art will well know how to employ the PR0382 polypeptides of the presem invention for such purposes. 

PR0545 polypeptides and portions thereof which have homology to meltrm may also be useful for in vivo 
dierapeunc purposes, as well as for various other applicauons. The idemification of novel molecules associated with 
cellular adhesion may be relevam to a number of human disorders. Given that the meltrin proteins may play an 
important role in a number of disease processes, the identification of new meltrin proteins and meltrin-like molecules 
35 I, of special importance in that such proteins may serve as potential therapeutics for a variety of differem human 
disorders. Such polypeptides may also play important roles in biotechnological and medical research, as weU as 
various mdustrial applications. As a result, there is particular sciemif.c and medical interest m new molecules, such 
as PR0545. 
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PR0617 polypeptides of a« present invention which possess biological activity related ,o that of the CD24 
protein may be employed both ,n „«, for therapeutic purposes and .n .no. T^^ of ordinary skill in the an w.ll 
well know how to employ the PR0617 polypeptides of .he present invennon for such purposes. 

PRO700 polypeptides and ponions thereof which have homology to protein disulfide isomerase may also 
be useM for in .i.o therapeunc pun»ses. . weU as for various other applications. The identificanon of novel 
protein disulfide isomerases and related molecules may be relevant to a number of human disorders. Given that 
formation of disulfide bonds and protein folding p,ay imporum roles in a number of biological processes the 
.dent.fica.ion of new pro.dn disulfide isomerases and protein disulfide «omerase-like molecules is of sp«:i„ 
unpcrunce in that such proteins may serve as potential therapeutics for a variety of differem hun»„ disorders Such 
polypepudes may also play important roles in biotechnological and medical research, as well as various industrial 
apphcauons. As a result, there .s particular scientific and medical interest .n new molecules, such as PRO700 - 

PRO702 polypeptides of the present invention which possess biological activity related to du.. of the 
conglutinin protein may be employed bod, in vivo for d^erapeutic purposes and in ntro. Those of ordinary skill in 
d« an wiD well know how to employ d« PRO702 polypeptides of the present invention for such purposes. PRO702 
polypeptides having conglutinin activny would be expected to be capable of inhibiting haemagglutinin activity by 
.nfluenza viruses and/or function as tmmunoglobuUn-independem defense molecules as a result of a complement- 
mediated mechanism. 

PRO703 polypeptides of the present invention which possess biological activity related to flutt of Ae VLCAS 
protem may be employed both in Ww, for therapeutic purposes and in vitro. TTtose of ordinary skill in the art will 
weU know how to employ d,e PRO703 polypeptides of 4e presem invention for such purposes. 

PRO703 polypeptides and portions Aereof which have homology to VLCAS may also be usefiil for in vivo 
therapeutic purposes, as well as for various otiter applications. Tlie identification of novel VLCAS proteins and 
related molecules may be relevant to a number of human disorders. TT^us. d,e .dentification of new VLCAS proteins 
and VITAS protein-like molecules is of special importance in d,at such proteins may serve as potential therapeutics 
for a vadety of diflerem human disorders. Such polypeptides may also play important roles in biotechnological and 
medical research as weU as various mdustrial applications. As a result, diere is panicular scientific and medical 
interest in new molecules, such as PRO703. 

PR07Q5 polypeptides of tiie presem invention which possess biological activity related to that of the K- 
glypK:an protein be employed both in vivo for therapeutic purposes and in vnro. H.ose of ordinary skill in the 
an w.« wen know how to employ tite PRO705 polypeptides of ti« presem invention for such purposes. 

PRO708 polypeptides of die presem invention which possess biological activi^ related to that of the aryl 
sulfetase proteins may be emptoyed bod, in vivo for therapeutic purposes and in vttro. nose of ordinary skill in the 
art W.11 well know how to employ d,e PRO708 polypeptides of ti« presem invention for such purposes. 

PRO320 polypeptides of d,e presem invention which possess biotog,cal aaivity related to that of ti,e fibulin 
protem may be employed both ,n vivo for dterapeutic purposes ami in vino. TT,ose of ordinary skill in ti,e an will 
well know how to employ d,e PRO320 polypeptides of the presem invention for such purposes. 

PRO320 polypeptides and portions thereof which have homology to fibulin may also be useful for ,n vivo 
fterapeutic purposes, as weU as for various od«r applications. TT,e identification of novel fibulin proteins ami related 
nu,leeule, may be relevam to a number of human disorders such as cancer or ti,ose involving connective tissue 
atuchmem molecules am. related mechanisms. TT,us. d,e identification of new fibulin protein ami fibulin 
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pro«in.Uke molecules is oJ^ecial imponance in U,a. such pro.e.ns n>ay serve as po.enual U^erapeuucs for a varie,, 
of different human disorders. Such polypeptides may also play important roles in biotechnological and medial 
research as well as various industrial applications. As a result, there ts particular scientific and medical mteres, m 

new molecules, such as PRO320. 

PR0324 polypeptides of 0« present invention which possess biological acdvity related to that of 
5 oxidortductases may be employed both in vivo for therapeutic purposes and in vara. Tl.ose of ordinary slcill in the 
an will well know how to employ the PR0324 polypeptides of the present invention for such purposes. 

PR0351 polypeptides of the present invention which possess biological activity related to that of the 
prostasm protein may be employed both in vivo for therapeutic purposes and in vino. Those of ordinary skill in the 
art will well know how to employ the PR0351 polypeptides of the present invention for such purposes. 
10 PR0351 polypeptides and portions thereof which have homology to prostasin may also be usefiil for m wvo 

therapeutic purposes, as well as for various other applications, lite identification of novelprostasm proteins and 
related molecules may be relevant to a number of human disorders Thus, the identification of new prostasin protctns 
and prostasm -like molecules is of special importance in that such proteins may serve as potential therapcut.cs for 
a variety of diffe^m human dtsorders. Such polypeptides may also play mtportan, roles in biotechnological and 
15 medical research as well as various industrial apphcattonr As a result, there is particular scentiflc and medtcal 

interest in new molecules, such as PR0351. 

PR0352 polypeptides of tite presem invention which possess biological activity related to that of the 
butyn^Uin protein may be employed both in vivo for therapeutic purposes and in vitro. Tltose of ordinary skill in 
U« art will well know how to employ the PR0352 polypeptides of the present invention for such purposes. 

20 PR0381 polypeptides of the present invention which possess biological activity related to that of one or more 

of the FKPB immunophilin proteins may be employed boU, in vivo for ti«rapeutic purposes and in vitro, for example 
for enhancing immunosuppressant activity ^or for axonal regeneration. Those of ordinary skill in the art wiU well 
know how to employ the PR0381 polypeptides of the present invention for such purposes. 

PR0386 polypeptides of the presem invention which possess biological activity related to that of the bett.2 

25 subuni. of a sodium channel expressed in mammalian cells may be employed boti. /„ v.V. for therapeutic purposes 
and .« of ordinary skill in the an WiU weU know how to employ the PR0386 polypeptides of the presem 

invention for such purposes. 

PRO540 polypeptides of the present invention which possess biological activity related to tiiat of the LCAT 
protein may be employed both in vivo for therapeutic purposes and in vnro. Tltose of ordinary skill in *c an will 
30 well know how to employ the PRO540 polypeptides of the present invention for such purposes. 

PR0615 polypeptides of the presem mveniion which possess biological activity related to that of the 
synaptogyrin pro^in may be employed both in vivo for ti^rapeutic purposes and in vino. Those of ordinary skiU in 
the an will well know how to employ the PR0615 polypeptides of the presem invemion for such purposes. 

PR0615 polypeptides and ponions thereof which have homology to synaptogyrin may also be useful for m 
35 vivo therapeutic purposes, as weU as for various other applications, lite identification of novel synaptogyrin proteins 
and related molecules may be relevant toa«»nber of human disorders. Titus, the identification of new synaptogyrm 
proteins and synaptogyrin-like molecules is of special imponance in that such proteins may serve as potenual 
therapeutics for a variety of different human disorders. Such polypeptides may also play imponant roles m 
biotechnological and medical research as well as various industrial applications. As a result, titere ,s pan^ular 
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scientific and medical interest in new molecules, such as PR0615. 

PR06i8 polypepudes of U,e presem invention which possess biological aciv.ty related to that of an 
enteropepttdase may be employed both ,n Wv. for therapeutic purposes and in vara Uos. of ordinary skill in the 
an will well know how ,o employ the PR06I8 polypeptides of the presem invent-on lor such pun»ses 

PR0618 polypeptides and portions thereof which have homology to enteropeptidase may also be useful for 
. m v,v. therapeutic purposes, as well as for various other applicauons. TT,e identification of novel enteropeptidase 
protems and related molecules may be relevant to a number of human disorders. Thus, the identification of new 
enteropepudase proteins and emeropeptidase-like molecules is of special importance in that stKb proteins .my «rvc 
as potendal dK:n.peutics for a vanety of different human disorders. Such polypeptides may also play tmporuuu «,les 
m btotechnological and medical research as well as various industrial applications. As a result, there is particular 
sciendfic and medical interest m new molecules, such as PR0618. 

PR0719 polypeptides of the presem invention which possess biological activity related to that of die 
lipoprotein lipase H protein may be employed boUi /n wV. for dierapeutic purposes and ,n .no. Tl^ose of ordinary 
staU in the art will well know how to employ die PR07,9 polypepudes of the presem invention for such purposes 
PR0724 polypeptides of die presem invenuon which possess biological activity related to diat of die human 
LDL receptor protein may be employed bod. in ww, for diempeutic purposes and in .no Tiose of ordinary skill 
m d« art wi« well know how to employ d. PR0724 polypeptides of die presem invention for such purposes 

PR0772 polypeptides of d« presem im,e«io„ which possess biological activity related to diat of tf« human 
A4 protein may be employed bodi in nvo for dierapeutic purposes and in vara. TTiose of ordinary skill in the art will 
well know how to employ die PR0772 polypeptides of d« presem invention for such purposes. 

PR0852 polypepudes of die presem invemion which possess biological acnvity rtlated u, dial of cenain 
inotease protein may be employed bod. in wV. for dierapeutic purposes and in .,no Those of ordim«y skillln d« 
an will well know how to employ die PR0852 polypeptides of die presem invemion for such purposes. 

PR0853 polypeptides of die presem invention which possess biological activity related to dut of die 
«duc.asepro.einmaybeemployedbod./nw«.fordierapeuticpurposesandmv»r.^ Those of ordinary skill in die 
an will weU know how to employ die PR0853 polypeptides of die presem invention for such purposes. 

PR0853 polypeptides ami ponions diereof which have homology to reductase proteins may also be useful 
for ,n Vivo d«rapeutic purposes, as weU as for various odier applications. Given duit oxygen free radicals and 
antioxidants appear to play importam roles in a number of disease processes, die identification of new reductase 
proteins ami reductase-like molecules is of special importance in dut such proteins may serve a, potemial d«rapeutics 
for a variety of differem human disorders. Such polypeptides may aUo play important roles in biotechnological and 
medical research as well as various industrial applications. As a result, diere is particular scicmific and nodical 
mtercst in new molecules, such as PR0853. 

PRO860 polypepudes of die presem invention which possess biological activity related to d«. of d« 
neurofescin protein may be employed bodi in vivo for dierapeutic purposes and .n nno. Those of ordinary skill in 
d« art will weU know how to employ die PRO860 polypeptides of die presem invemion for such purposes 

PRO860 polypeptides and portions diereof which have homology to neurofascm may also be useful for in 
v^vo dcmpeutic purposes, as well as for various odier appUcations. TT« identification of novel neurofascin proteins 
andrelatedmoleculesmayberelevanttoanumberofhumandisorderswhichinvolvecellularadhes^ TT,„, ^ 
.denaficadonof new neurofascm protems and neurofascin protein-like molecules is of special importance in that iuch 
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proteins may serve as poteniial therapeutics for a variety of differetit human disorders. Such polypeptides may also 
play iraponant rotes in btotechnotogical and medical research as well as various industrial applications. As a resuU. 
there is panicular scientific and medical interest in new molecules, such as PRO860. 

PR0846 polypeptides of the present invention which possess biological activity related to that of the 
CMRF35 protein may be employed both in vivo for therapeutic purposes and in vuro. Those of ordinary sldll in the 
5 an will wcU know how to employ the PR0846 polypeptides of the present invention for such purposes. 

PR0846 polypeptides and portions thereof which have homology lo the CMRF35 protein may also be useful 
for in vivo therapeutic purposes, as well as for various other applications. The identification of novel CMRF35 
protein and related molecules may be relevant to a number of human disorders. Thus, the identification of new 
CMRF35 protein and CMRF35 protein-like molecules is of special importance in that such proteins may serve as 
10 potential therapeut.cs for a variety of different human disorders. Such polypeptides may also play important roles 
in biotechnological and medical research as well as various industrial applications. As a result, there is particular 
scientific and medical interest in new molecules, such as PR0846. 

PR0862 polypeptides of the present invention which possess biological activity related to that of the 
lysozymc protein may be employed both in vivo for therapeutic purposes and in vitro. Those of ordinary skill in the 
15 art will well know how to employ the PR0862 polypeptides of the present invention for such purposes. 

PR0862 polypeptides and portions thereof which have homology to the lysozyme protein may also be useful 
for in vivo therapeutic purposes, as well as for various other applications. The ideraificaiion of novel lysozymc 
proteins and related molecules may be relevant to a number of human disorders. Thus, the idemificaiion of new 
lysozymes and lysozyme-like molecules is of special importance in that such proteins may serve as potential 
20 therapeutics for a variety of different human disorders. Such polypeptides may also play important roles in 
biotechnological and medical research as well as various industrial applications. As a result, there is particular 
scientific and medical interest in new molecules, such as PR0862. 

PR0864 polypeptides of the present invention which possess biological activity related to that of the Wm-4 
protein may be employed bodi in vivo for therapeutic purposes and in vuro. Those of ordinary skiU in the art will 
25 well know how to employ the PR0864 polypeptides of die present invention for such purposes. 

PR0864 polypeptides and portions thereof which have homology to the Wnt-4 protein may also be useful 
for in vivo therapeutic purposes, as weU as for various otfier applications. The identification of novel Wm-4 proteins 
and related molecules may be relevant to a number of human disorders. Thus, the identification of new Wnt^ protein 
and WM.4 proiein-like molecules is of special importance in that such proteins may serve as potential therapeutics 
30 for a variety of different human disorders. Such polypeptides may also play important roles in biotechnological and 
medical research as well as various industrial applicanons. As a result. ti«re is particular scientific and medical 
interest in new molecules, such as PR0864. 

PR0792 polypeptides of the present invention which possess biological activity related to that of the CD23 
protein may be employed both in vivo for therapeutic purposes and in vitro. Those of ordinary skiU in the art will 
35 well know how to employ die PR0792 polypeptides of the present invention for such purposes. 

PR0866 polypeptides of the present invention which possess biological activity related to that of mindin 
and/or spondin protein may be empl<)yed both in vivo for therapeutic purposes and in vitro. Those of ordinary skill 
in the art will well know how to employ the PR0866 polypeptides of the present invention for such purposes. 



161 



. <WO_9M6281A2J_> 



20 



WO 99/46281 ^ ^ PCT/US99/05028 

PR0871 polypeptides of *e present .nvemion which possess biological activity related to tha. of the 
c3..ophih„ protein fanuly n»y entployed bod, .n for d^erapeutic purposes and ,n ..,ro. Tl.ose of ordinary 
skUI u, d,e art will weU know how to entploy d,c PR087I polypeptides of dte present invenUon for such purposes 
PR0873 polypepddes of dte present tnvenUon which possess biological activity rela«.d to dut. of 
carboxylesterases n.y be entployed bod, .„ for therapeutic purposes and .„ .aro. For example, .hey be used 
m conjunction with prodrugs to convert dte prodrug to its acdve form (see Danks « aLsupra). TT.y may be used 
.0 inhibit p^asite .nfecdon (see van Pelt « supra). Methods for employ the PR0873 polypeptides of the present 
mvention for these, and oO^er purposes will be readily apparent to those of ordinary stall in d« ar, 

PR09-W polypeptides of dte present invention which possess biological activity related to .hat of d« CD33 
protem and/or OB binding protein-2 may be employed bodt ir, .i.o for therapeutic purposes and in .itrc Those of 
ordinal, sldll in the ar. wil, well know how to employ dte PRO940 polypeptides of the presem invention for «Kh 
purposes. 

PR094I polypeptides of Uk present invemton which possess biological activity related to du>. of a cadherin 
protein may be employed both in .i.o for d,e,apeu,ic purposes and /„ wr... n^ose of ordtnaty skill in d« art will 
well know how to employ U« PR0941 polypeptides of *e presem invention for such purposes. 

PR0944 polypeptides of ftcprcsem mvennon which possess biological activity related to that of die CPE-R 
protein may be employed both /„ for dterapeutic purposes and in vi,n>. Those of ordmary skiU in d« art will 
weU know how to employ the PR0944 poijpeptutes of the present invention for such purposes. PR0944 polypeptides 
of d^e presem invention dtat function to bind to Clostridium perfnngens enterowxin (CPE) may fmd use for effectively 
treating infection by dte CPE endotoxin. 

PR0983 polypeptides of d«: presem invention which possess biological activity related to tiui, of fte veslcle- 
assoda^d membrane protein. VAP.33. may be employed bod, in n.o for titerapeutic purposes and in ^uo Those 
of ordiatry skill in d« an will well know how to employ d.e PR0983 polypeptides of dte present invemion for such 
purposes. 

PR01Q57 polypeptides of dK presem invemion which possess biological activity related to ti,at of protease 
protcms may be employed bod, in Ww, for ti«rapeutic punK«es and in vi,ro. Those of ordinary skill in dte art wil, 
well know how to employ d« PR01QS7 polypeptides of d« presem invention for such purposes. 

PRO1071 polypeptides of d« presem im,emion which possess biological activity related to Urn of .he 
tim,mbospondin protein may be employed bod, in wv. for d«rapeutic punH»es and in mro. Those of ordinary skai 
u. tiie art will well know how to employ d,e PRO1071 polypeptides of d,e presem invention for such purposes. 

PRO1072 polypeptides of iht present invention which possess biological activity relaied to that of reductase 
P«.«ins may be employed bod, /„ w,V. for d,erapeutic purposes and in vino. Those of ordinary skill in the an will 
well know how to employ d,e PRO1072 polypeptides of d,e presem invention for such purposes. 

PROI075 polypeptides of d,e presem invention which possess biological activity related to d,at of protein 
d»ulfide Isomerase may be employed bod, in vivo for dterapeutic purposes and in ntro. Those of ordinary skill in 
*c an will well know how to employ d,e PRO1075 polypeptides of d,e presem invention for such purposes. 

PR0181 polypeptides of d,e presem invention which possess biological activity related to d«t of d,e 
contichon prottin may be employed bod, m Ww, for d«rapeutic purposes and .„ W.ro. Those of ordim«y skill in d,e 
an Will well km,w how to employ d« PR0181 polypeptides of d,e presem invemion for such purposes 
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PR0827 polypeptides of the present invention which possess b.ological acdv.ty related «, that of various 
irtcgrin proteins mav be employed both m mo for therapeutic purposes and in varo. ^se of ordiiury skill in the 
art will well know how .0 employ the PR0827 polypeptides of the present tnvention for such purposes. 

PROl 1 14 polypepttdes of the presem invendon which possess b.ological acdvity related to that of the 
cyu,ldne receptor family of prote.ns may be employed both in vivo for therapeutic purposes and in v.,ro. Those of 
ordinary skill in the art will well taK>w how to employ the PROl 1 14 polypeptides of the present invention for such 

purposes. . 

to addition to the above, the PR01114 interferon receptor polypeptides may be employed m apphcattons. 
both in vivo and in vitro, where the ability to bind to an interferon ligand is desired. Such applications will be well 

within the skill level in the art. 

PR0237 polypeptides of the present invendon which possess biologtcal activity related to that of the carbomc 
ari^e protein may be employed both m vivo for therapeutic purposes and in vino. Those of ordmary skill in dte 
art will well know how to employ the PR0237 polypeptides of the present invention for such purposes. 

PR0541 polypeptides of the present invention which possess biologtcal activity related to that of a trypsm 
uWbitor protein may be employed both in vivo for therapeutic purposes and « wrro. Those of ordinary skill in the 
art will well know how ,0 employ the PR054I polypeptides of the presem invention for such purposes. 

PR0273 polypeptides can be used in assays that other chemokines wouW be used in to perform comparaave 

assays. The results can be used accordingly. 

PRO701 polypeptides of the present invention which possess biological activity related to that of the 
n^Bgin family may be employed both in vivo for therapeutic purposes and ,n vino. Those of ordinary skill in the 
art will well know how ,0 employ the PRO701 polypeptides of die present invemion for such purposes. 

PRO701 can be used in assays with neurons and its activity thereon can be compared with that of neurohguis 
1 2 and 3. The results can be applied accordingly. 

PR07M polypeptides of the presem invendon which possess biological activity related to that of vesicular 
i^egral membrane proteins may be employed both in vivo for therapeutic purposes and in vino. Those of ordmary 
25 skill in the art will well know how to employ the PRO704 polypeptides of the present invention for such purposes. 

PRO704 can be used in assays with the polypeptides to which they have identity with to dettrmme the 
relative activities. The results can be applied accordingly. PRO704 can be or measured for activity to 
measure endocytosis activity and titereby used to screen for agents which effect endocytosis. 

PRO706 polypeptides of ti« present invention which possess biological activity related to that of the 
30 endogenous prostatic acid phosphatase pn^cursor may be employed both in vivo for therapeutic purposes and in .no. 
Those of ordinary skill in die an will well know how to employ the PRO706 polypeptides of die present invention 

for such purposes. , ^ 

PRO706 can be used in assays with human prostatic acid phosphatase or human lysosomal actd phosphatase 
and its activity thereon can be compared with that of human prostatic acid phosphatase or human lysosomal acid 
35 phosphatase. The results can be applied accordingly. 

PR07(y7 polypeptides of die present invention which possess biological activity reUted to that of cadhenns 
«^beemployedbo,h«W.«forthenpeuticj«rposesandi.w^^^^ 
how to employ the PRO707 polypeptides of the present invention for such purposes. 
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PRO707 can be used m assays to deternime its activ.iy in relation .o other cadheruis. paniculariy cadherin 
F1B3. The results can be applied accordingly. 

PR0322 polypeptides of the present invention which possess biological activity related to that of neuropsin 
may be employed both in wV. for therapeutic purposes and .n n.ro. n,ose of ordinary skill ,n the an win well taow 
how ,0 employ the PR0322 polypeptides of the present mveraion for such purposes. 

PR0322 can be used in assays to determine its activity relative to neuropsin. ,ryps.«,gen, serine pro«ase 
and neurosin, and the results applied accordingly. 

PR0526 polypeptides of the present mvem.on which possess biological activity related to U,at of protein- 
protein binding proteins may be employed both w v. for therapeutic purposes and in vuro. Those of ordinary skill 
m the an will well know how to employ the PR0526 polypeptides of the presem mvenuon for such purposes 

Assays can be performed with growth factors and other proteins which are known to form complexes to 
determine whether PR0526 bimls ti^ereto and whed,er d,ere is increased half-life due to such binding. TT^ results 
can be used accordingly. 

PR0531 polypeptides of the present mvenoon which possess biological activity related to that of the 
piotocadherins may be employed bod, in vivo for therapeutic purposes and in vitro. Those of ordinary skill in die 
an wiU well know how ,o employ tite PR0531 polypeptides of me present invention for such purposes. 

PR053 1 can be used in assays agains, protocadherin 3 and other protocadherins. to deiermne .heir relative 
activities. The results can be applied accordingly. 

PR0534 polypeptides of die presem invention which possess biological activity related to diai of the protein 
disulfide isomerase may be employed bod, in nvo for dierapeutic purposes and in vuro. ^se of ordinary skiU in 
*e an will well know how to employ d,e PR0534 polypeptides of d,e presem invention for such punK«es. 

PR0534 can be used in assays wiU, protein disulfide isomerase to determine die relauve activities. Tht 
results can be applied accordingly. 

PR0697 polypeptides of d« present invention which possess biological activity related to diat of the sFRP 
fenuly may be employed bod, in vivo for d«rapeutic purposes and in vitro. 'n,ose of ordinary skill in d,e an will well 
know how to employ the PR0697 polypeptides of die present invention for such purposes. 

PR0697 can be used in assays wid, sFRPs and SARPs to detennine dK relative activities. TT^ results can 
be applied accordingly. 

PR0731 polypeptides of du present invention which possess biological activity related to dm of any 
protocadherin may be employed bod, in nvo for d,erapeutic purposes and in vitro. Tbosc of ordinary skill in .he an 
W.I1 well know how to employ d,e PR0731 polypeptides of d,e presem itivention for such purposes. 

PR0731 can be used in assays wid, the polypeptides to which dtty have identity wid, to determine d,e 
relative activities. The results can be applied accordingly. 

PR0768 polypeptides of die presem invention which possess biological activity related to d,at of integrins 
nay be employed bod, in vivo for dierapeuoc purposes and in vuro. TTiose of ordinary skill in d,e an will well know 
how to employ d,e PR0768 polypeptides of d,e present invemion for such purposes. 

PR0768 can be used in assays wid, d>e polypeptides to which dwy have idemity wid, to determine die 
relative activities. The result can be applied accordingly. 

PR0771 polypeptides of die present invention which possess biological activity related to du» of d,e tesncan 
protein may be employed bod, in vivo for dterapeutic purposes and in vttro. n,ose of ordinary skill m d,e an will 
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well know how to empire PR0771 polypeptides of the present invcmion for such purposes. 

PR0771 can be used in assays with the polypeptides to which ihey have idcnuty with to detemnne the 
relative acrivines. The results can be applied accordingly. 

PR0733 polypeptides of the present mvcntion which possess biological activity related to that of the proteins 
which bind the T1/ST2 receptor may be employed both m vivo for therapeutic purposes and in vitro. Those of 
5 ordinary skill in the art will well know how to employ the PR0733 polypepodes of the prcseM invention for such 
purposes. 

PR0733 can be used in assays with the polypeptides to which they have idemity with to determine the 
relauvc activities. The results can be applied accordingly. 

PR0162 polypeptides of the present invention which possess biological activity related to that of the 
10 pancreatitis-associated protem may be employed both in vivo for therapeutic purposes and in vitro. Those of ordinary 
skill m the art will well know how lo employ the PR0162 polypeptides of the presem invention for such purposes. 

PR0162 can be used in assays with the polypeptides to which they have identity with to determine the 
relative activities. The results can be applied accordingly. 

PR0788 polypeptides of the presem invention which possess biological activity related to diat of the anti- 
15 neoplastic urinary protein may be employed both in vivo for therapeutic purposes and in vitro. Those of ordinary 
sWn in the an will well know how to employ the PR0788 polypeptides of the present invention for such purposes. 

PR0788 can be used in assays with the polypeptides to which diey have identity with to determine the 
relative activities. The results can be applied accordingly. 

PRO1008 polypeptides of the present invention which possess biological activity related to that of dkk-l may 
20 be employed both in vivo for therapeudc purposes and in vitro. Those of ordinary skill in the art will well know how 
to employ the PRO1008 polypeptides of the presem invention for such purposes. 

PRO1008 can be used in assays with the polypeptides to which they have idemity with to determine the 
relative activiUes. The results can be applied accordingly. 

PRO1012 polypepodes of the presem im^eniion which possess biological activity related to that of the protein 
25 disulfide isometase may be employed both in vivo and in vitro purposes. Those of ordinary skill in the ait will well 
know how to employ the PRO1012 polypeptides of the presem invention for such purposes. 

PR01012 can be used in assays with the polypeptides to which they have idcnuty widi to determine the 
relative activities. The results can be applied accordingly. 

PRO1014 polypeptides of the present invention which possess biological activity related to that of reductase 
30 may be employed both in vi^ for therapeutic purposes and in vitro. Those of ordinary skill in the ar, will well know 
how to employ the PRO1014 polypeptides of the presem invention for such purposes. 

PR01014 can be used in assays widt the polypeptides to which titey have idemity with to determine the 
relative activities. Inhibitors of PRO10I4 arc panicularly preferred. The results can be applied accordingly. 

PRO10I7 polypeptides of the presem invemion which possess biological activity related to that of 
35 sulfotransferasc may be employed boU, in vivo for therapeutic purposes and in vttro. Those of ordinary skill in the 
art wm weU know how to employ the PRO10I7 polypeptides of the presem invention for such purposes. 

PROlon can be used in assays with the polypeptides to which they have identity witi. to determine tiie 
relative activities. The results can be applied accordingly. 
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PR0474 polypeptides of the present mventmn which possess biological activ.tj- related to that of 
dehydrogenase may be employed both in Ww, for therapeutic purposes and .„ ynro. Uosc of ordmary skill in the 
an will well know how to employ the PR0474 polypeptides of the present invention for such purposes. 

PR0474 can be used in assays with the polypeptides to which they have identity with lo determine the 
relative activities. The results can be applied accordingly. 

PRO1031 polypeptides of the present invention which possess biological activity related to that of IL-17 may 
be employed both m for therapeutic purposes and in »,ro. TlK.se of ordi«.o, skill in the an will well know how 
to employ the PRO103I polypeptides of the present mvenoon for such purposes. 

PRO103I can be used in assays with the polypeptides to which they have identity with lo detennine the 
relative activities. The results can be applied accordingly. 

PR0938 polypeptides of the present invention which possess biological activity related to that of protem 
d^ulfide isomerase may be employed both in n.o for therapeutic purposes and in vitro. Those of ordinary skill in 
*e an will well know how to employ the PR0938 polypeptides of the present invention for such purposes. 

PRO1082 polypeptides of die present invemion which possess biological activity related to that of the LDL 
receptor may be employed both in vivo for therapeutic purposes and in vino. Those of ordinary skill in dte an will 
well know how to employ the PRO1082 polypeptides of the present invention for such purposes. 

PRO1082 can be used in assays with the polypeptides to which they have identity with «, detennine d» 
rdanve activities. The results can be applied accordingly. PROl 082 can also be used in assays to identify ca^lidate 
agents which modulate the receptors. 

PRO1083 polypeptides of the present invemion which possess biological activity related to tiutt of 7TM 
nxeptors may be employed bodi in vivo for therapeutic purposes and in vuro. Those of ordinary skill in the an wiU 
well know how to employ the PRO1083 polypeptides of the present invention for such purposes. 

m parucular PRO1083 can be used in assays to determine candidate agents which comrol or modulate 
PRO1083, i.c., have an effect on Uk receptor. 

The VEGF-E molecules herein have a number of therapeutic uses associated with survival, proliferation 
and/or difTerention of cells. Such uses include ti>e treaunent of umbilical vein endotiieiial cells, in view of die 
demonstnued ability of VEGF-E to increase survival of human umbilical vein endoU«lial cells. Tream«m may be 
needed if fte vein were subjected to traumata, or simations wherein anificial means are employed to enhance ti« 
survtval of the umbilical vein, for example, where it is weak, diseased, based on an artificial matrix, or in an 
arnficial environmem. Odter physiological conditions that could be improved based on the selective mitogcnic 
character of VEGF-E are also included herein. Uses also include d« treatmem of fibroblast and myocy«s. in view 
of tite demonstrated ability of VEGF-E to induce proliferation of fibroblasts and hypen«,phy in myocytes In 
particular. VEGF-E can be used in wound healing, tissue growth and muscle generation and regeneration. 

For the indications refened to above. ti:e VEGF-E molecule will be fonnulated and dosed m a fashion 
conststem with good medical practice taking imo account tite specific disorder to be treated, the condition of the 
^dividual patient, dte site of delivety of ti« VEGF-E. tite method of administration, and odter factors known to 
pracunoners. Thus, for purposes herein. Uie "dterapeutically effective amount" of the VEGF-E is an amount tiutt 
« effective either to prevent, lessen the wonsening of. alleviate, or cure the treated condition, in particular tiutt amoum 
whKh u suflkiem to enhance die survival, proliferation and/or differemiation of tite treated cells in vivo 
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VEGF-E amino acid variant sequences and denvadvcs ihai are immunologically crossreacnve w.ih aniibodics 
raised against nadvc VEGF arc useful in immunoassays for VEGF-E as sumdards. or. when labeled, as compeunve 
reagents. 

The VEGF-E is prepared for storage or administration by mixing VEGF-E havmg the desired degree of 
punty wiii physiologically acceptable carriers, excpients, or stabilizers. Such materials are non-toxic to recipients 

5 at the dosages and concemrarions employed. If the VEGF-E is water soluble, it may be formulated in a buffer such 
as phosphate or other organic acid salt preferably at a pH of about 7 to 8. If the VEGF-E .s only panially soluble 
in water, it may be prepared as a microemulsion by formulating it with a nonionic surfactant such as Tween. 
Pluronici. or PEG. e.g.. Twccn 80. in an amount of 0.04^.05% (w/v), lo increase us solubility. 

OptionaUy other ingredients may be added such as antioxidants, e.g.. ascorbic acid: low molecular weight 

10 (less than about ten residues) polypeptides, e.g.. polyarginine or tripeptides; proteins, such as serum albumin, gelaun. 
or immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone; amino acids, such as glycine, glutamic acid, 
aspartic acid, or arginine; monosaccharides, disaccharides. and other carbohydrates including cellulose or its 
derivatives, glucose, mannose. or dex.rins; chelating agents such as EDTA: and sugar alcohols such as mannitol or 
sorbitol 

15 nie VEGF-E lo be used for therapeutic administration must be sterile. Sterility is readily accomplished by 

(Utradon duough sterile filtrauon membranes (e.g. . 0.2 micron membranes) lite VEGF-E ordinarily will be stored 
in lyophilizcd form or as an aqueous solution if it is highly stable to thermal and oxidative denamration. The pH of 
the VEGF-E preparations typically will be about from 6 to 8. although higher or lower pH values may also be 
appropriate in certain instances. It will be understood that use of certain of the foregoing excipients. earners, or 

20 stabilizers will r«suh in die formation of salts of the VEGF-E. 

If the VEGF-E is to be used parenterally. dierapeutic compositions conuining the VEGF-E generally are 
placed into a container having a sterile access port, for example, an intravenous solution bag or vial having a stopper 
picrceable by a hypodermic injection needle. 

Generally, where the disorder permits, one should formulate and dose die VEGF-E for site-specific delivery. 

25 This is convenient in the case of wounds and ulcers. 

Sustained release formulations may aUo be prepar«l, and include the formation of microcapsular particles and 
implamableanicles. For preparing sustained-release VEGF-E compositions, the VEGF-E is preferably incorporated 
into a biodegradable matrix or microcapsule. A suiuble material for diis purpose is a polylacUde, although other 
polymers of poly-(a-hydroxycarboxylic acids), such as poly-D-(-)-3-hydroxybutyric acid (EP 133.988A). can be used. 

30 Other biodegradable polymers include polyOactones), poly(ace.als). poly(onhoesters). or poly(onhocarbonates). Tlie 
imtial consideration here must be diat the carrier itself, or its degradation products, is nontoxic in the target tissue 
and will not further aggravate the condition. TTiis can be determined by routine screening in animal models of the 
target disorder or. if such models are unavailable, in normal animals. Numerous sciemific publications documem 
such animal models. 

35 For examples of sustained release compositions, see U.S. Patent No. 3,773.919. EP 58.481A, U.S. Patem 

No. 3.887.699, EP 158,277A. Canadian Patem No. 1176565. U. Sidman « al. Biopolymers 22, 547 (19831. and 

R Langer el aL Chem. Tech. 12, 98 119821. 

When applied topically, the VEGF-E is suitably combined with oUier ingredienu. such as carriers and/or 
adjuvants. There are no limiutions on the nature of such odier ingredients, except that they must be pharmaceuucaUy 
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aocpu^le and cfficac»us for U,cir in«ndcd adminis^don. and canno. degrade the activity of the actwe .nerediems 
of *e con^idon. Exa„,p,cs of suiu^ble vehicles include oinnnencs. crean.. gels, or suspensions, with or without 
punficd collagen. TTie compositions also nuty be impregnated into transdermal patches, plasters, and bandages 
preferably in liquid or scmi-Iiquid form. 

For Obtaining a gel formulation, the VEGF-E fonnulated in a liquid composition may be nuxed witii an 
. effective amount of a water-soluble polysaccharide or synthetic polymer such as poIyeU,ylenc glycol to form a gel 
Of the proper viscosity to be appUed topically. TT.e polysacchande that may be used includes, for example, cellulose 
derivatives such as etherified cellulose derivatives, including alky, celluloses, hydroxyallcy. celluloses and 
alkymydroxyallcyi celluloses, for example, methy.cel.ulose. hydroxyethyl cellulose, carboxymethyl cell^osc 
hydroxypropyl mcthylcellulose. and hydroxypropyl cellulose: starch and fractionated starch; agar: algimc acid and 
alginates; gum arabic; pullullan; agarose; carrageenan; dextrans: dextrins; fn,ctans; inulin; rr^. .,yu^ 
arabtnam: chitosans; glycogens; glucans; and synthetic biopolymers; as well as gums such as xanthan gum- guar 
gum; locus, bean gum; gum arabic; tragacanth gum; and karaya gum; and derivatives and mixntres thereof' Tl,e 
preferr«l gelling agent herein is one that is inert to biological systems, nontoxic, simple to prepare, and not too rumty 
or viscous, and will not destabilize the VEGF-E held within it, 

Prefentbly the polysaccharide is an eUterified cellulose derivattve. more preferably one that ts well defined 
punfied. and listed in USP. e.g., methylcelhilose and the hydtoxyalkyl cellulose derivatives, such as hydroxypropyl 
ceMose. hydroxyethyl cellulose, and hydroxypropyl me.hylce.lu.ose. Most preferred herein is medtylcelhUose 

The polyethylene glycol useM for gelling is typically a m.xmre of low and high molecubr weight 
polyethylene glycols to obtain the proper viscosity. For example, a mixn.re of a polyed>ytene g.yco. of molecular 
weigh. 400^ wim of molecular weigh. 1500 would be effective for this purpose when nuxed m the proper ratio 
to obtain a paste. 

The .erm 'water soluble" as applied to the polysaccharides and polyed.y.ene glycols is meant to include 
colkndal sohidons and dispersions. In general, the solubility of the cellulose derivatives is determined by the degree 
of suhsntudon of eU«r groups, a«i the stabilizing derivatives useM herein should have a sufficient quantity of such 
efter groups per anhyd«,gtacose unit in Ae ceUulose chain to render the derivatives water soluble. A degree of ether 
subsduitionof a.leastO.35 e^er groups per anhydroglucose unit is generally suffident. AddiUo^Uly. the ceUulose 
denvanves may be in the form of aUcali meal salts, for exampte. ibt U. Na. K. or Cs salu. 

If me^fcellulose is employed in *e gel. prefemWy it comprises about 2-5%. more preferaWy about 3% 
of the gel and the VEGF is present in an amount of about 300-1000 mg per ml of gel. 

n« dosage ,0 be employed is dependent upon the factors described above. As a general proposition the 
VEGF-E is fonnulated and delivered to the target site or tissue at a dosage capable of esublishing in the muc a 
VEGF-E level greater dun about 0.1 ng/cc up to a maximum dose that is efHcacious bu, unduly toxic TOs 
.mra-t^ue co««mration should be maintained if possible by continuous infusion, sustained ttlease, topical 
application, or injection at empirically determined frequencies. 

I. is widdn the scope hereof to combine d,e VEGF-E therapy with other novel or conventional therapies 
(e.«.. growth factors such as VEGF. aFGF, bFGF. PDGF. IGF. NGF. anabolic steroids. EGF or TGF-a) for 
e|^ing .he activity of any of U« growm factors, including VEGF-E. in promoting cell proliferation, survival 
differentiation and repair. I, is no. necessary that such co.r«tmem drugs be inctaded per se in die compositions of 
*B mvention. aldiough titis will be convenient where such drugs are proteinaceous. Such admixmres ate suitably 
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administered in the same manner and for the same purposes as the VEGF-E used alone. The useful molar ratio of 
VEGF-E to such secondary growth factors is tv-pically 1:0.1-10. with about equimolar amounts being prcfened. 

The compounds of the present invention can be formulated according to known methods to prepare 
pharmaceuucally useful compositions, whereby the PRO polypeptide hereof is combined in admixture with a 
pharmaccutically acceptable earner vehicle. Suitable carrier vehicles and their formulation, inclusive of other human 
proteins, e.g.. human serum albumin, are described, for exan^le. in Remington's Pharmacmical Sciences, 16th cd.. 
1980, Mack Publishing Co.. edited by Oslo ei al. the disclosure of which is hereby incorporated by reference. The 
VEGF-E herein may be administered paremcrally to subjects suffering from cardiovascular diseases or conditions, 
or by other methods that ensure its delivery to the bloodstream m an effective form. 

Compositions panicularly well suited for the clinical administration of VEGF-E hereof employed in the 
10 practice of the present invention include, for example, sterile aqueous solutions, or sterile hydratable powders such 
as lyophilized protein. It is generally desirable to include further in the formulation an appropriate amount of a 
pharroaceutically acceptable salt, generally in an amount sufficient to render the foraiulaiion isotorac. A pH regulator 
such as arginine base, and phosphoric acid, are also typically inchided in sufficient quantities to maintain an 
appropriate pH. generally from 5.5 to 7.5. Moreover, for improvement of shelf-lifc or stability of aqueous 
15 formulations, it may also be desirable to include further agents such as glycerol. In this manner, variant t-PA 
foimulations are rendered appropriate for parenteral administration, and. in particular, intravenous administration. 

Dosages and desired drug concertrations of pharmaceutical con^ositions of the present invention may vary 
depending on the particular use envisioned. For example, in the treatment of deep vein thrombosis or peripheral 
vascular (fisease. "bolus" doses, will typically be preferred with subsequent administrations being given to maintain 
20 an approximately constant blood level, preferably on the order of about 3 /ig/ml. 

However, for use in connection with emergency medical care facilities where infusion capability is generaUy 
not available and due to the generally critical namre of the underlying disease (e.g.. embolism, infarct), it will 
generally be desirable to provide somewhat larger initial doses, such as an intravenous bolus. 

For the various therapeutic indications referred to for the compounds hereof, the VEGF-E molecules will 
25 be formulated and dosed in a fashion consistent with good medical practice taking into account the specific disorder 
to be treated, the condition of tiie individual patient, the site of dcBvery. the method of administration and other 
factors known to practitioners in the respective art Thus, for purposes herein, the "therapeutically effective amount" 
of the VEGF-E molecules hereof is an amount tiiat is effective either to prevent, lessen the worsening of, alleviate, 
or cure the treated condition, in particular that amount which is sufficient to enhance the survival, proliferation or 
30 diftreotiation of targeted cells in vivo. In general a dosage is employed capable of establishing in the tissue tiiat is 
the target for tiie therapeutic indication being treated a level of a VEGF-E hereof greater dian about 0. 1 ng/cm' up 
to a maximum dose that is efficacious but not unduly toxic. It is contemplated that intra-tissue administration may 
be the choice for certain of dlie therapeutic indications for the compounds hereof. 

The human ToU proteins of die present invention can also be used in assays to identify other proteins or 
35 molecules involved in ToU-mediated signal transduction. For example. PR0285 and PR0286 are usefiil in identifying 
the as of yet unknovm namral ligands of human Tolls, or otiier factors that participate (direcUy or indirccUy) in die 
activation of and/or signaling tfirough a human Toll receptor, such as potential Toll receptor associated kinases. In 
addition, inhibitors of die receptor/Ugand binding interaction can be identified. Proteins involved in such binding 
interactions can also be used to screen for peptide or small molecule inhibitors or agonists of the binding interaction. 
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Screening assays can be designed lo find lead compounds diat mimic ihe biological acuvity of a native Toll 
polypeptide or a ligand for a native Toll polypept.de. Such screening assays will include assays amenable to high- 
throughput screening of chemical libraries, makmg Uiem particularly suitable for identifying small molecule drug 
candidates. Small molecules contemplated include synthetic organic or inorganic compounds. The assays can be 
performed in a variety of formats, including protem-protem binding assays, biochemical screening assays, 
immunoassays and cell based assays, which are well characterized in die an. 

In vitro assays employ a mixture of components including a Toll receptor polypeptide, which may be part 
of fusion product with another peptide or polypeptide, e.g. . a tag for detecting or anchoring, etc. TTte assay mixmres 
may further comprise (for binding assays) a naural intra- or extracellular Toll binding target (,.e. a Toll ligand. or 
another molecule known to activate and/or signal through the ToU receptor). While native binding targets may be 
used, i, is frequently preferred to use portion of such naUve binding taigets (e.g. peptides), so long as the portion 
provides binding affmity and avidity to the subject Toll protein conveniently measurable in the assay. TTk assay 
tnixmre also contains a candidate pharmacological agent. Candidate agents encompass numerous chemical classes, 
through typically diey are organic compounds, preferably small organic compounds, and are obtained from a wid^ 
variety of sources, including libraries of synthetic or natural compounds. A variety of other reagents may also be 
included m dte mixmre. such as. salts, buffers, neutral proteins, e.g. albumin, detergents, protease iriubitors. 
nuclease inhibitors, antimicrobial agents, etc. 

Inm vitro binding assays, the resultant mixture is incubated under conditions whereby, but for the presence 
of die candidate molecule, the ToU protein specifically binds the ceUular binding target, portiin or analog, with a 
Ttferencc bindi>« afBniiy. Tbt mixture components can be added in any order d»t provides for the requisite bindings 
and incubations may be performed at any temperamre which facilitates optima] binding. Incubation periods are 
likewise selected for optimal binding but also minimized to facilitate rapid high-ihroughput screening. 

After incubation, the agent-biased binding between the Toll protein and one or more binding targets is 
detected by any convenient technique. For cell-free binding type assays, a separation step is often used to separate 
bound from unbound components. Separation may be effected by precipitation (e.g. TCA precipitation, 
immunoprecipitation. etc.). immobilization (e.g on a solid substrate), etc.. followed by washing by. for example! 
membrane filtration (e.g. Whatman's P-18 ion exchange paper. Polyfdtronic s hydrophobic GFC membrane, etc.)! 
gel chromatography (e.g. gel filtration, affinity, etc.). For ToU-dependem transcription assays, binding is detected 
by a change in tite expression of a Toll-dependent reponer. 

Detection may be effected in any convenicm way. For ceU-frce binding assays, one of the components 
usually comprises or is coupled to a label. TTm label may provide for direct detection as radioactivi luminescence, 
optical or election density, etc., or indirect detection, such as, an epitope ug, an enzyme, etc. A variety of methods 
may be used to detect the label depending on the nawre of the label and otiwr assay components, e.g. through optical 
or electron density, radiative emissions, nonradiative energy transfers, etc. or indirectly detected with antibody 
conjugates, etc. 

Nucleic acid encoding die Toll polypeptides disclosed herein may also be used in gene therapy. In gene 
therapy applications, genes are introduced into cells in order to achieve in vivo synthesis of a tiierapeutically effective 
genetic product, for example for replacement of a defective gene. "Gene dierapy " includes both conventional gene 
Uierapy where a lasting effect is achieved by a single treamiem. and die administration of gene dierapeutic agents, 
which involves tfie one time or repeated administration of a therapeutically effective DNA or mRNA. Antiscasi 
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RNAs and DNAs can be used as iherapeuiic agents for blocking the expression of certain genes in vivo, h has 
already been shown that shon antisense oligonucleotides can be imported into cells where they act as inhibitors, 
despite their low intracellular concentrations caused by their restricted uptake by the cell membrane. (Zamecnik et 
al pf^r Nat!. Acad. Sci. USA S2. 4143-4146 (1986)). The oligonucleotides can be modified to enhance their 
uptake, c,g. by subsumimg their negatively charged phosphodiester groups by uncharged groups. 

5 There arc a variety of techniques available for introducing nucleic acids into viable cells. The techniques 

vary depending upon whether the nucleic acid is transferred into cultured cells m vitro, or in \nvo in the cells of die 
intended host. Techniques suitable for the transfer of nucleic acid into mammalian cells m vitro include the use of 
liposomes, electroporation, microinjection, cell fusion, DEAE-dexiran, the calcium phosphate precipitaiion method, 
etc. The currently preferred in vivo gene transfer techniques include iransfeciion with viral (typically retroviral) 

10 vectors and viral coat protein- liposome mediated iransfeciion (Dzau et al. . Trrn<1s jn BiQ^gchnologY iL» 205-210 
(19931). In some situations ii is desirable to provide the nucleic acid source wiih an agent that targets the target cells, 
such as an antibody specific for a ceU surface membrane protein or the target cell, a hgand for a receptor on the urgct 
cell, etc. Where liposomes are employed, proteins which bind to a cell surface membrane protein associated with 
endocyiosis may be used for targeting and/or to facilitate uptake, e.g. capsid proteins or fragments dicreof tropic for 

15 a particular ceU type, antibodies for proteins which undergo iniemalization in cycling, proteins that target intracellular 
localization and enhance intracellular half-life. The technique of receptor-mediated endocytosis is described, for 
example, by Wu et al . J. Biol. Chem. 2£2, 4429^32 (1987); and Wagner et aL . PrfK , N^U . Ac^d. Sci . USA 82. 
3410-3414 (1990). For review of the cuncntly known gene marking and gene therapy protocols see Anderson et ai, 
S£isii££256, 808-813 (1992). 

20 The various uses listed in connection with the Toll proteins herein, are also available for agonists of the 

native Toll receptors, which mimic at least one biological function of a native Toll receptor. 

Neuroirimin as well as other members of die IgLON subfamily of the immunoglobulin supcrfamily have 
been identified to have effect upon neural paneming. differentiation, maturation and growth. As a result. PR0337 
the human neuroirimin homolog polypeptides would be expected to have utility in diseases which arc characterized 
25 by neural disfunction. For example, motoneuron disorders such as amyotrophic lateral sclerosis (Lou Gehrig's 
disease). Bell's palsy, and various conditions involving spinal muscular atrophy, or paralysis. NGF variant 
fonnulaoons of the invention can be used to treat human neurodegenerative disorders, such as Alzheimer's disease, 
Parkinson's disease, epilepsy, multiple sclerosis. Huntington's chorea. Down's Syndrome, nerve deafness, and 
Meniere's disease. Moreover PR0337 polypeptide may also be used as a cognitive enhancer, to enhance learmng 
30 particularly in dementia or trauma, such as those associated with the above diseases. 

Further. PR0337 may be employed to treat neuropathy, and especially peripheral neuropathy, ''Peripheral 
neuropathy- refers to a disorder affecting the peripheral nervous system, most often manifested as one or a 
combinadon of motor, sensory, sensorimotor, or autonomic neural dysfunction. The wide variety of morphologies 
exhibited by peripheral neuropathies can each be atuibuted uniquely to an equally wide number of causes. For 
35 example, peripheral neuropathies can be genetically acquired, can result from a systemic disease, or can be induced 
by a toxic agent. Examples include but arc not limited to diabedc peripheral neuropathy, distal sensorimotor 
neuropathy, or autonomic neuropathies such as reduced motility of the gastrointestinal u-act or atony of the urinary 
bladder. Examples of neuropadiies associated with systemic disease include post-polio syndrome or AlDS-associated 
neuropathy; examples of hereditary neuropathies include Charcoi-Marie-Tooth disease, Refsum's disease. 
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Abctalipoproteinemia. Tangier disease. Krabbe's disease. Metachromatic leulcodystrophy. Fabry's disease, and 
Dejerine-Sonas syndrome; and examples of neuropathies caused by a toxic agent include those caused by treatment 
with a chemotherapeutic agem such as vuKristine. cisplatin. methotrexate, or 3--a2ido-3 -deoxythvmidine 
Coaespondingly. neurotrimin antagonists would be expected to have utility m diseases characterized by excessive 
neuronal activity. 

Endothelin is generated from inactive intermediates, the big endothelins. by a unique processing evcm 
catalyzed by the zinc metalloprotease. endothelin comrerting enzyme (ECE). ECE was recently cloned, and its 
strucmre was shown to be a smgle pass transmembrane protein with a short intracellular N-,erminal and a long 
extracellular C-terminal that contains the catalytic domain and numerous N-glycosylation sites. ECEs cleave the 
endothelin propeptide between Trp73 and VaI74 producing the active peptide. ET. which appears to function as a 
local rather than a circulating hormone (Rubanyi. G.M. & Polokoff. M.A.. Pharmachological Reviews 46: 325415 
(1994) Thus ECE activity is a potential site of regulation of endothelin production and a possible urgci for 
therapeutic intervention in the endothelin system. By blocking ECE activity, it is possible stop the production of CT-l 
by inhibiting d* conversion of the relatively inactive precursor, big ET-1. to the physiologically active form. 

ECE-2 is 64% identical to bovine ECE-2 at die amino acid level. ECE-2 is closely related to ECE-1 (63% 
identical. 80% conserved), neutral cndopepudase 24. 1 1 and the Kell blood group protein. Bovine ECE-2 is a type 
n membrane-bound metalloproteinase localized in die trans-Golg, network where it acts as an intracellular enzyme 
convening endogenous big endothelin-l into acdve endodielin (Emoto. N. and Yanai^isawa. M.. J. Biol. Chem 270- 
15262-15268 (1995). TTie bovine ECE-2 mRNA expression is highest in parts of the brain, cerebral cortex, 
cerebellum and adrenal meduUa. It is expressed at lower levels in raymetrium. testes, ovary, and endoiheUai cells. 
Bovine ECE-2 and ECE-1 both are more active on ET-1 as a substrate compared to ET-2 or ET-3. Emoto and 
Yanangisawa» supra. 

Human ECE-2 is 736 amino acids in length with a 31 residue amino-.erminal uil. a 23 residue transmembrane helix 
and a 682 caitoxy-teiminal domain. It is 94% identical to bovine ECE-2 and 64% identical to human ECE- 1 . The 
predicted transmembrane domain is highly conserved between the human and bovine ECE-2 protems and between 
human ECE-1 and human ECE-2. as are the putative N-linked glycosylation sites. Cys residues conserved in the 
neutral endopeptidase 24.11 and the Kell blood group protein family and the putative zinc binding motif. The 
sequence suggests, that like other members of die NEP-ECE-Kell family, human ECE-2 encodes a type U 
transmembrane zinc-binding metalloproteinase. which, by extrapolation from what is known about bovuie ECE-2, 
is an imracellular enzyme located within the secretory pathway which processes endogenously produced big ET-l 
while it is still in die secretory vesicles. Emoto and Yanangisawa. supra. 

TTie expression pattern of ECE-2 differs from that observed for ECE-I. Northern blot analysis of mRNA 
levels indicated low levels of expression of a 3.3 kb transcript in adult brain (highest in the cerebelhim. putamen. 
medulla and lemporal lobe, and lower in the cerebral cortex, occipital lobe and frontal lobe), spinal cord, lung and 
pancreas and higher levels of a 4.5 kb transcript in fetal brain and kidney. Tht two transcript sizes probably 
represem the use of alternative polyadenylation sites as has been observed for bovine ECE-2 (Emoto and 
Yanangisawa. supra) and ECE-I (Xu et al.. Cell 78: 473-485 (1994). PCR on cDNA libraries indicated low levels 
of expression in fetal brain, feud kidney, fetal small intestine and aduh testis. Fetal liver, fetal lung and adult 
pancreas were all negative. 
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The endothelin (ED family of peptides have potent vascular, cardiac and renal acuons which may be of 
pathophysiological importance in many human disease states. ET-1 .s expressed as an mactive 212 amino acid 
prepropeptide. The prepropeptide .s first cleaved at Arg52-Cys53 and Arg92-Ala93 and then the carboxy terminal 
Lys9 1 and Arg92 are trimmed from the protem to generate the propeptide big ET- 1 ECEs then cleave the propeptide 
between Trp73 and Val74. producing the active peptide. ET, which appears to function as a local rather than a 
5 circulating hormone (Rubanyi and Polokoff. Pharma. R. 46: 325-415 (1994). 

ETKlothclins may play roles in the pathophysiology of a number of disease states including: 1) cardiovascular 
diseases (vasospasm, hypertension, myocardial ischemia; reperfusion injury and acute myochardial infarction, stroke 
(cerebral ischemia), congestive heart failure, shock, atherosclerosis, vascular thickening); 2) kidney disease (acute 
and chronic renal failure. glomerulonephriDs. cirrhosis); 3) lung disease (bronchial asthma, pulmonary hypertension); 
10 4) gastiointesrinal disorders (gastric ulcer, inflammatory bowel diseases); 5) reproducuve disorders (premature labor, 
dysmenorhca. preeclampsia) and 6) carcinogenesis. Rubanyi & Polokoff. supra. 

Diseases can be evaluated for the impact of ET upon them by examining: I) increased production of ETs: 
2) increased reactivity to ETs; and/or 3) efBcacy of an ET receptor antagonist, antibody or ECE inhibitor. Response 
to the previous criteria suggest that ETs likely play roles in cerebral vasospasm following subarachnoid hemorrhage. 
15 hypertension (fulmmant/complications). acute renal failure and congesnve hean failure While inhibitors of ET 
production or activity have not been used in models of coronary vasospasm, acute myocardial infarction, and 
atherosclerosis, they do have elevated ET levels and increase reactivity to ETs. Shock and pulmonary hypenension 
also exhibit elevated ET levels (Rubanyi and Polokoff. supra). Inhibition of ECEs in these conditions may be of 
ther^utic vahie. 

20 The expression pattern of ECE-2 differs from that observed for ECE-l . ECE-2 was observed at low levels m the 
adult brain, lung and pancreas and higher levels in fetal brain and kidney by Northern blot analysis (Fig. 8). PCR 
tevealed low levels of expression in additional ussues: fetal lung, feu.1 small intesune and adult testb. Fetal liver was 
negative. A similar pattern was reported for bovine ECE-2 (Emoto and Yanangisawa. supra). U is expressed in brain 
tissues (cerebral cortex. cerebeUum and adrenal medulla), myometrium and testis, and m low levels in ovary and vety 

25 low levels in many other tissues. Bovine ECE-l (Xu et al. supra) is more widely and more abundantly expressed. 
It is observed in vascular endothelial cells of roost organs and in some parenchymal cells. With tiie exception for 
brain, bovine ECE-2 mRNA was present at lower levels than ECE-l. Applicants believe ECE-2 to be a particularly 
good target for the therapeutic intervention for diseases such as cerebral vasospasm following subarachnoid 
hemorrhage and strdce. 

30 

92. ^|^«i-PRO Po |YP1T'f'''r Ant'hod'es 
TTK present invention further provides anti PRO polypeptide antibodies. Exemplary antibodies include 
polyclonal, monoclonal, humanized, bispecific, and heteroconjugate antibodies. 

35 A. PfflYTf"""' Antibodies 

The anii-PRO polypeptide antibodies may comprise polyclonal antibodies. Methods of preparing polyclonal 
antibodies are known lo flie skilled arusan. Polyclonal antibodies can be raised in a mammal, for example, by one 
or more injections of an immunizmg agent and. if desired, an adjuvant. Typically, die immunizing agem and/or 
adjuvant wiU be injected in the mammal by multiple subcutaneous or intraperitoneal injections. The immunizing agem 
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may mclude ihc PRO polypeptide or a fiision protein Uiereof. 1, may be useful to conjugate the inununizms aaent 
u. a protein known to be immunogenic in the mammal being .mmunized. Examples of such immunoeemc protetns 
include but aie no. limited «> keyhole limpe. hemocyanin. serum albumin, bovine thyroglobulin. and soybean trypsm 
inhibitor. Examples of adjuvants which may be employed include Freund's complete adjuvant and MPL-TDM 
adjuvam (monophosphoryl Lipid A. symhetic trehalose dicorynomycolate). The immunization protocol may be 
selected by one skilled in the an without undue experimentation. 
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B. Monoclonal An«^-^ft,|iyff 
TTie anti-PRO polypeptide antibodies may. altemaiively. be monoclonal antibodies. Monoclonal antibodies 
may be prepared using hybridoma methods, such as those described by KohJer and Milstein. Nmre, 25fi:495 (1975) 
In a hybridoma method, a mouse, hamster, or other appropriate host ammal. is typically immunized wtth an 
immunizing agent to elicit lymphocytes .hat produce or are capable of producing antibodies tha. will specifically bind 
to the immunizing agem. Alternatively, the lymphocytes may be immunized in vitro. 

The immunizing agent will typically include the PRO polypeptide of interest or a fusion protein thereof 
GeneiaUy. either peripheral btood lymphocytes ("PBLs") an: used if cells of human origin are desired, or spleen cells 
or lymph node cells are used if non-human mammalian sources are desired. The lymphocytes are then fused with 
an immortalized ceU line using a suitable fusing agent, such as polyethylene glycol, to form a hybridoma cell fOoding. 
Monoclonal Ani i hmligs: PrinciPln atld Pnwtict; . Academic Press. (1986) pp. 59-1031. Immortalized cell lines aie 
usuallytransfonnedmamnialiancells.panicularlymyeIomacellsofrodent.bovineandh^ Usually rat 

or mouse myeloma ceU lines are employed. TT,e hybridoma cells may be cuhured in a suitable culmre medium that 
preferably contains one or more substances that inhibit the growth or survival of the unfiised. tmmonalized cells. 
For example, if the parental cells lack the enzyme hypoxanthine guanine phosphoribosyl transferase (HGPRT or 
HPRT). the culmre medium for the hybridomas typically will include hypoxanthine. aminopterin. and thymidme 
("HAT medium"), which substances prevem the growth oif HGPRT-deficiem ccUs. 

Preferred immortalized cell lines are those that fuse efficienUy. support stable high level expression of 
antibody by d« selected antibody-producing cells, and are sensitive to a medium such as HAT medium. More 
preferred immonalized cell Unes are murine myeloma lines, which can be obtained, for instance, from the Salk 
Instimte Cell Distribudon Center. San Diego. California and the American Type Culmre Collection. RockviUe 
Maryland. Human myeloma and mouse-human heteromyeloma cell lines also have been described for the production 
of human monoclonal antibodies [Kozbor. / InvnunoL, 122:3001 (1984); Brodeur et al. Monoclonal Amibod. 
Production Techniques and AppUcations. Marcel Dekker, Inc., New York. (1987) pp. 5M3]. 

The culmre medium in which the hybridoma cells are cultured can then be assayed for the presence of 
monoclonal antibodies directed against the PRO polypeptide of interest. Preferably, the binding specificity of 
monoclonal antibodies produced by the hybridoma cells is determined by immunoprecipiution or by an m virro 
binding assay, such as radioimmunoassay (RIA) or enzyme-linked immunoabsorbent assay (ELISA). Such techniques 
and assays are known in ti,e art. TT^e binding affinity of the monoclonal antibody can. for example, be determined 
by die Scatchard analysis of Munson and PoUard. Anal. Biochem. , m220 (1980). 

Alter the desiitd hybridoma cells are identified, the clones may be subdoned by limiting dilution procedures 
and grown by sumdard methods [Coding. smsH. Suitable culmre media for this purpose include, for example 
Dulbecco's Modified Eagle's Medium and RPMM640 medium. Ahematively. the hybridoma cells may be grown 
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in vivo as ascites in a mammal. 

monoclonal antibodies secteied by the subclones may be isolated or purified from ihe culture medium 
or ascites fluid by conventional immunoglobulin purificauon procedures such as. for example, protein A-Sepharosc. 
hydroxylapatite chromatography, gel electrophoresis, dialysis, or affinity chromatography. 

The monoclonal anubodies may also be made by recombinant DNA methods, such as those described in 
5 U.S. Patent No. 4.816.567. DNA encoding the monoclonal antibodies of the invention can be readily isolated and 
sequenced using com^entional procedures (e.g.. by using oligonucleotide probes that are capable of binding 
specifically to genes encoding *e heavy and Ught chains of murine antibodies). The hybridoma cells of the invention 
serve as a preferred source of such DNA. Once isolated, the DNA may be placed into expression vectors, which 
are then transfecicd imo host cells such as simian COS cells. Chinese hamster ovary (CHO) cells, or myeloma cells 
10 that do not otherwise produce immunoglobulin protein, to obtain the synthesis of monoclonal antibodies in the 
recombinant host cells. The DNA also may be modified, for example, by substimiing the coding sequence for human 
heavy and light chain constant domains in place of the homologous murine sequences [U.S. Paiem No. 4.816.567; 
Morrison et al.. asBO] or by covalemly joining to the immunoglobulin coding sequence all or pan of the coding 
sequence for a non-immunoglobulin polypeptide. Such a non-immunoglobulin polypeptide can be subsrimted for the 
15 constam domains of an antibody of the invention, or can be substituted for ti« variable domains of orve antigen- 
combining site of an antibody of the invention to create a chimeric bivalent antibody. 

TTk antibodies may be monovalent antibodies. Methods for preparing monovalem antibodies are well known 
•m the art. For example, one method involves recombinant expression of immunoglobulin light chain and modified 
heavy chain. TTk heavy chain is truncated generally at any poim in the Fc region so as to prevem heavy chain 
20 crosslinking. Alternatively, the relevant cysteine residues are substimted with anodier amino acid residue or are 

deleted so as to prevent crosslinking. 

/„ vitro methods are also suiuble for preparing monovalent anubodies. Digestion of antibodies to produce 
fragments thereof, particularly. Fab fragments, can be accomplished using routine techniques kiKwn in the art. 

25 r Hitmaniaed Antibodies 

The anti-PRO polypeptide antibodies of the invention may ftirther comprise humanized antibodies or human 
antibodies. Humanized forms of non-human (e.g.. murine) antibodies are chimeric immunoglobulins, 
ta^obulin chains or fragmems thereof (such as Fv. Fab. Fab'. F(ab'), or oti^er antigen-binding subsequences 
of antibodies) which comain minimal sequence derived from non-human immunoglobulin. Humanized antibodies 
30 include human immunoglobulins (recipient antibody) in which residues from a complemenuiy determining region 
(CDR) of the recipient are replaced by residues liom a CDR of a non-human species (donor antibody) such as mouse, 
rat or rabbh having the desired specificity, affinity and capacity. In some instances. Fv framcworic residues of the 
human immunoglobulin are replaced by corresponding non-human residues. Humanized antibodies may also 
comprise residues which are found neither in the tecipietit antibody nor in the imported CDR or frimiework 
35 seqtiences. In general, tite humanized antibody wiU comprise substantially all of at least one. and typically two. 
variable domaim. in which ail or substantially aU of the CDR regions correspond to diose of a non-tauman 
i„m«UK«lobulin and all or substantially all of the FR regions are dK«e of a human immunoglobulin consensus 
sequence. TTie humanized amibody optimally also will comprise at least a portion of an immunoglobulin constant 
region (Fc). typically that ofahuman immunoglobulin IJoneserfl/.. Namre.m: 522-525 (1986); Riechmam, « a/.. 
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Nature, 222:323-329 (1988): and Presta. Curr. Op. Strua. Biol . 2:593-596 (1992)^ 

Mechods for humanizing nan-hunum antibodies are well known m ihc an. Generally, a humanized antibody 
has one or more anuno acid residues uuroduced imo it from a source which .s non-human. TTiese non-human ammo 
acid residues are often referred «, as 'unpon' residues, which are vpically uken from an variable domain 

Humanizadon can be essentially perfomied following the meduxl of Winter and co-workers f Jones « al. . Nature 221' 
5 522-525 (1986); Riechmann el al.. Nature, 322:323-327 (1988): Verhoeyen et al.. Science. 222:1534-1536 (1988)) 
by substituting rodent CDRs or CDR sequences for the corresponding sequences of a human amibody. Accordingly' 
such "humanized" antibodies are chimeric antibodies (U.S. Patem No. 4,816.567). wherein substamially less than 
an intact human variable domain has been substituted by the corresponding sequence from a non-human species. In 
practice, humanized antibodies arc typically human antibodies in which some CDR residues and possibly some FR 
10 residues are substinited by residues from analogous sites in rodent antibodies. 

Human antibodies can also be produced using various techniques known in the an. including phage display 
libraries (Hoogenboom and Winter. J Mol. Bid., 222:381 (1991); Marks « oL J. Mol. Biol. . 222:581 (1991)1 The 
techniques of Cole et al. and Boemer e, al. are also available for Uie preparation of human monoclonal antibodies 
(Cole et al.. Monoclonal Ambcdies and Cancer Therapy, Alan R. Liss. p. 77 (1985) and Boemer « al.. J Immunol 
15 i47ai:86-95 (1991)). 
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D. Bisnecinc Anti7f,H|j»j, 

Bispecific antibodies are monoclonal, preferably human or humanized, antibodies that have binding 
specificities for at least two different antigens. In the present case, one of the binding specificities is for the PRO 
polypeptide, the other one is for any other antigen, and preferably for a cell-surface protein or receptor or receptor 



subunit 



Methods for making bispecific antibodies are known in the art. Traditionally, the recombinant production 
of bispectfic antibodies is based on the co-expression of two immunoglobulin heavy<hain/light-cham pairs, where 
the nvo heavy chains have different specificities fMilstein and Cuello. Nature, 2J25:537-539 (1983)). Because of the 
random assonmem of immunoglobulin heavy and light chains, these hybridomas (quadroraas) produce a potential 
nnxture of ten diflerem amibody molecules, of which only one has the correct bispecific smicmre. TTxe purification 
of the correct molecule is usually accomplished by affinity chromatography steps. Similar procedures arc disclosed 
in WO 93/08829. published 13 May 1993. and in Traunecker et al.. EMBO J. . lfl:3655-3659 (1991). 

Antibody variable domains with the desired binding specificities (amibody-antigcn combining sites) can be 
fused to immunoglobulin constant domain sequences. TTie fi«,on preferably .s with an immunoglobulin heavy<hain 
constant domain, comprising at least pan of the hinge. CH2. and CH3 regions. It is preferred to have the first heavy- 
chain constant region (CHI) containing the site necessary for light-chain binding present in at least one of the fusions. 
DMAs encoding the immunoglobulin heavy^ feions and. if desired, the immunoglobulin light chain, are inserted 
into separate expression vectors, and are co-transfected into a suitable host organism. For fimher details of 
35 generating bispecific antibodies see. for example. Suresh et al.. Methods in Emymology, i21:2IO (1986). 

E. Heteroconiupaic Antihodi>« 
Hetcroconjugate antibodies are also within the scope of the presem invemion. Hetcroconjugaie antibodies 
are composed of two covalemly joined antibodies. Such antibodies have, for example, been proposed to target 
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immune system celU to unwanted cells (U-S. Patent No. 4,676.9801, and for treaimem of HIV infection [WO 
91/00360; WO 92/200373; EP 03089). It is contemplated that the amibodies may be prepared m iwro using known 
methods in synthetic protein chemistry, including those involving crosslinking agents. For example, immunotoxins 
may be constructed using a disulfide exchange reaction or by forming a thioether bond. Examples of suitable reagents 
for this purpose include iminothiolate and methyM-mercaptobuiyrimidate and those disclosed, for example, in U.S. 
5 Patent No. 4.676.980. 

»« Vfiff, frnrt-PttO Polvnfntidf AntH)0<l|« 
The ami-PRO polypeptide antibodies of the invention have various utilities. For example. anti-PRO 
polypeptide antibodies may be used in diagnostic assays for a PRO polypeptide, e.g.. detecting its expression in 
10 specific cells, tissues, or serum. Various diagnostic assay techniques known in the art may be used, such as 
competitive binding assays, direct or indirect sandwich assays and immunoprecipitation assays conducted in either 
heterogeneous or homogeneous phases IZola. MoiVKlonal AffliMi'-V ft Mf^TWMfff TwiUliWCS. CRC Press. Inc. 
(1987) pp. 147-1581. The antibodies used in the diagBosric assays can be labeled with a detectable moitty. The 
detectable moiety shouW be capable of producing, either directly or indirectly, a detectable signal. For example, the 
15 detectable moiety may be a radioisotope, such as 'H. "C.- P." S, o.-" 1. a fluorescent or chemiluminescent 
compound, such as fluorescein isothiocyanate, rhodamine, or taciferin. or an enzyme, such as alkaline phosphatase, 
beo-galactosidasc or horseradish peroxidase. Any method known in the art for conjugating the antibody to the 
detecuble moiety may be employed, including those methods described by Humcr et al. Nature. JM:945 (1962); 
David « al.. Biochemistry. 12:1014 (1974): Pain et al., J. Imnmnol. Meth., 4Q:219 (1981): and Nygren. J. 
20 HiaoOiem. and Cytochem. , 2fi:407 (1982). 

Anti-PRO polypeptide antibodies also are useftil for the affinity purification of PRO polypeptide from 
recombinant cell culture or naniral sources. In this process, the antibodies against the PRO polypeptide are 
immobilized on a suitable support, such a Sephadex resin or filter paper, using methods well known in the art. TTic 
immobilized antibody then is contacted with a sample containing the PRO polypeptide to be purified, and tiiereafter 
25 the support is washed with a suitable solvent that wiU remove substantially aU the material in the sample except the 
PRO polypeptide, which is bound to the immobilized antibody. Finally, the support is washed with another suitable 
solvem that wiU release the PRO polypeptide from the antibody. 

Arti-ToU receptor (i.e., anti-PR0285 and anti-PR0286 antibodies) may also be useful in blocking the 
biological activities of the respective Toll receptors. Tbc primary function of die family of Toll receptors is believed 
30 to be to act as patiwgen pattern recognition receptors sensing the presence of conserved molecular pattern presem 
on microbes. Lipopolysaccharidcs (LPS. also known as endotoxins), potentially letiial molecules produced by 
various bacteria, bind to die lipopolysaccharide binding protein (LBP) in the blood. The complex formed then 
activates a receptor known as CD14. TTiere is no consensus in the art about what happens next. According to a 
hypothesis, CD14 does not directiy instruct macrophages to produce cytokines, cell adhesion proteins and enzymes 
35 involved in the production of lower molecular weight proinflammatory mediators, rather enables LPS to activate a 
second receptor. Alternatively, it has been suggested that LPS may activate certain receptors directly, without betp 
from LBP or CD14. The dau disclosed in the presem application indicate that the human toll-like receptors are 
signaling receptors that are activated by LPS in an LBP and CDU responsive manner. As tins mechanism, under 
pathophysiologic conditions can lead to an often fatal syndrome called septic shock. ami-ToU receptor antibodies (just 
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as «her Toll receptor anagon.su) mgta be useful in *c trearoen. of scp.ic shock. Ic .s foreseen mat Oie different 
ToU recepu,^ inigh. recognize different pathogens, e.g. . various strains of Gram-negat.ve or Gram-positive bactena 
Accordingly, in ceruun situations, combinataon therapy with a nurture ofant.bod.es specif.cally binding different Toll 
receptors, or the use of bispecific anti-ToU antibodies may be desirable. 

It is specifically denumstrated that anu-huTLR2 antibodies are believed to be speciHcally useful in blocking 
a,e induction of this receptor by LPS. As it has been shown that LPS exposure can lead to septic shock (Parrillo 
V. . 228. 1471-1477 [1993]). anti-huTLR2 antibodies are potentially uscfi.1 in the treatment of septK: 



shock. 



nic foregoing merapeutic and diagnostic uses listed in collection with the anti-Toll receptor antibodies are 
also applicable to o^er ToU antagonist. ..e.. other molecules (proteins, peptides, snu.ll organic molecules, etc.) that 
10 block ToU receptor activation and/or signal transduction mediated by Toil receptors. 

In view of their therapeutic potentials, the Toll proteins (including variants of the native Toll homologues) 
and their agonists and antagonists (including but not limited to anti-Toll antibodies) are incorporated in compos.uoni 
suitable for therapeutic use. TT^tapeutic compositions are prepared for storage by mixing the active ingrediem 
havtng the desired degree of purity with optional physiologically acceptable carriers, excipients or stabilizers 
(Remu^ton-s Pharmaceutical Sciences 16th Edition, Osol. A. Ed. 1980) in the form of lyophilized formulations or 
aqueous solutions. Acceptable carriers, excipients or stabilizers are nontoxic to recipients at the dosages and 
concentrations employed, and include buffers such as phosphate, citrate and other organic acids: antioxidants 
mcludmg ascorbic acid; low molecular weight (less than about 10 residues) polypepddes; proteins, such as senm, 
albumin, gelatin or immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone, amino acids such as 
glycine, glutamine. asparagine. arginine or lysine; monosaccharides, disaccharides and other carbohydrates Including 
glucose, mamM,se. or dextrins; chelating agents such as EDTA; sugar alcohols such as mannitol or sorbitol: salt- 
forming counterions such as sodium; and/or nonionic surfactants such as Tween. Pluronics or PEG. 

The active ingredients may also be entrapped in microcapsules prepared, for example, by coacervation 
techniques or by interfacial polymerization, for example, hydroxymethylcellulose or gelatin-microcapsules and poly- 
(methylmedmcylate) microcapsules, respectively), in colloidal dmg delivery systems (for example, liposomes 
albumin microspheres, microemulsions. nano-panicles and nanocapsules) or in macroemulsions. Such techniques 
are disclosed in Remington's Phani»r>...tir«^ ^r\mn supra. 

The formulations to be used for in vivo administration must be sterile. TOs is readily accomplished by 
miration through sterile filtration membranes, prior to or following lyophilization and reconstitudoa. 

TTierapeutic compositions herein generally are placed into a container having a sterile access port, for 
example, an intravenous solution bag or vial having a stopper pierccable by a hypodermic injection needle. 

The route of administration is in accord with known methods, e.g. injection or infiision by intravenous 
umperitot^al. intracerebral, intramuscular, intraocular, intraarterial or intralesional routes, topical admlnistrauon, 
or by sustained release systems. 

Suitable examples of sustained release preparations include semipermeable polymer matrices in the form 
of shaped articles, e.g. fihns. or microcapsules. Sustained release matrices include polyesters, hydrogels 
polylaciides(U.S. Patent 3.773.919. EP 58.481). copolymers of L-glutamic acid and gamma ethyl-L-glutamate (u' 
Sidman a HL. fiiflBSfaoDCIS 22 d): 547-556 [19831), poly (I-hydroxyeihyl-methacrylate) (R. Langer. a al JL 
B i omcd , Man Rf l li-. 167-277 (I9811 and R. Langer. CbcBUislu 12: 98-105 |1982I). ethylene vinyl aceiate 
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(R. Langer si al.. Id.) or poly-D-(-)-3-hydroxybutyric acid (EP 133.988). Sustained release compositions also include 
liposomes. Liposomes containing a molecule within the scope of the present invention are prepared by methods 
known Eeiss: DE 3.2L8.121; Epstein g gj-. PfPT Nsil Sci, USA 82= 3688-3692 (1985); Hwang a/-, 
p,,. Ar.H .Sci. USA 71: 403(M034 (1980): EP 52322: EP 36676A: EP 88046: EP 143949: EP 142641: 

Japanese patent application 83-118008; U.S. patents 4.485.045 and 4.544.545; and EP 102.324. Ordinarily the 
5 liposomes are of the smaU "(about 200-800 Angstroms) tmilamelar type m which the lipid content is greater than about 
30 mol. % cholesterol, the selected proportion being adjusted for the opumal NT-4 therapy. 

An effecnve amount of the active ingredient will depend, for example, upon the therapeutic objectives, the 
route of administration, and the condition of the patiem. Accordingly, it will be necessary for the therapist to uter 
the dosage and modify the route of administration as required to obuin the optimal therapeuuc effect. A typical daily 
10 dosage might range from about 1 ,g/kg to up to 100 mg/kg or more, depending on the factors menuoned above. 
Typically, the clinician wUl administer a molecule of the present invention until a dosage is reached that provides the 
required biological effect. The progress of this therapy is easily monitored by conventional assays 

The following examples are offered for illustrative purposes only, and arc not intended to limit the scope 

of the present invention in any way. 
15 AU patent and literamre references cited in die present specification are hereby incorporated by reference 

in their entirely. 

Commercially available reagents refened to in the examples were used according to manufacturer's 
20 instructions unless otherwise indicated. The source of toe ceUs identified in the following examples, and throughout 
dte specification, by ATCC accession numbers is the American Type Culmre Collection. Rockville. Maryland. 

Therefor 

25 TTK extraceUular domain (BCD) sequences (including the secretion signal sequence, if any) from about 950 

taown secreted protems from the Swiss-Pro. public database were used to search EST daubases . TTte EST databases 
included public databases (e.g.. Dayhoff. GenBank). and proprietary daubases (e.g. LIFESEQ™. Incyte 
Pharmaceuticals. Palo Alto. CA). Tl« search was perfomied using dte computer propam BLAST or BLAST2 
(Altschul and Gish. m,.h»h, in t:nrvmologv m 460-480 (1996)) as a comparison of the ECD protein sequences 
30 to a 6 frame translanon of .he EST sequences. Those comparisons with a Blast score of 70 (or in some cases 90) or 
greater that did not encode known proteins were clustered and assembled into consensus DNA sequences with the 
program "phrap- (Phil Green. University of Washington. Seattle. WA; 
(htq>://bozeman.mbt.washington.edu/phrap.docs/phrap.hmil). 

Using this extraceUular domain homology screen, consensus DNA sequences were assembled relative to 
35 AeotheridentifiedECTsequencesusingphrap. l„ addition, the consensus DNA sequences obtained were often (but 
M always) extended using repeated cycles of BLAST and phrap to extend the consensus sequence as far as possible 
using the sources of EST sequences discussed above. 

Based upon the consensus sequences obtained as described above, oligonucleotides were then synthesized 
a«i used to identify by PGR a cDNA library that contained the sequence of interest and for use as probes to isolate 
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a Clone of the fuU-lcngd, coding sequence tor a PRO polypeptide. Forward ( 0 and reverse (.r) PCR pnmers 
generally range from 20 to 30 nucleotides and ate often destgned to g,ve a PCR product of about 100-1000 bp in 
length. TlK probe (.p) sequences are typically 40-55 bp in length. In some cases, additional oligonucleotides are 
synthestzed when the consensus sequence ,s greater than about .-I.5kbp. In order to screen several libraries for a 
full-length clone, DNA from the libraries was screened by PCR amplification, as per Ausubel et al £^ 
in Molr r ulur mm . with d« PCR primer pair. A positive library was *en used to isolate clones 
encodmg the gene of interest using the probe oligonucleotide and one of die primer pairs. 

The cDNA libraries used to isolate the cDNA clones were constructed by standard method, usi«r 
conm^raaliy available reagents such as those from Invitrogen. San Diego. CA. Tl« cDNA was prtmed wiU, oligo 
dT contanung a Nod site, linked with blun, to Sail hemikinased adaptors, cleaved with Notl. stzed appropriately by 
gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or pRKD- 
PRK5B ,s a precursor of pRK5D d.at does not contain d,e Sfil site; see. Holmes e, al.. Sl^, 252:1278-1280 
(1991)) in the unique Xhol and Notl sites, 

EXAMPIF, 2 : Isolation of cDNA dnn... hy ^ mvla^t S^o-n inf 

PrCPararion of Olieo dT nr;nv.,| cDNA library 

mRNA was isohted from a human tissue of interest using reagents and protocols from Invitrogen San 
Diego. CA (Fast Track 2). Hus RNA was used to generate an oligo dT primed cDNA Ubrary in the vector pRK5D 
ustng reagems and protocols from Life Technologies. Gaithersburg. MD (Super Script Plasmid System) fa this 
procedure. d,e double stnuKled cDNA was sized to greater thanlOOO bp and the Sall/NoU linkered cDNA was cloned 
mto Xhol/Notl Cleaved vector. pRK5D is a cloning vector that has an sp6 tnutscriptlon inidation s.te followed by 
an Sfil restnciion enzyme site preceding the XhoI/NotI cDNA cloning sites. 

2- Preparation of random pric ed cDNA 

A secondary cDNA library was generated in order to preferenUally represem the 5" ends of the primary 
CDNA clones. Sp6 RNA was generated from die primary library (described above), and dus RNA was used to 
generate a random primed cDNA library in Ute vector pSST-AMY.O using reagents and protocols from Life 
Technologies (Super Script Plasmid System, referenced above), fa .his procedure the double stranded cDNA was 
sued to 500-1000 bp. linkered wid, blum to Nod adaptors, cleaved with Sftl. and cloned into Sfil/NoU cleaved 
vector. pSST-AMY.0 Is a cloning vector d»t has a yeast alcohol dehydrogenase promoter preceding die cDNA 
clomng snes and the mouse amylase sequence (the mature sequence without the secr«io„ sigmtl) followed by .he yeas, 
alcohol dehydrogenase terminator, after *e clomng sites. Ttm. cDNAs cloned into dus vector U». are fused in 
frame w.th amylase sequence will lead to die secretion of amylase from appropriately ,ransfec«d yeas, colonies. 

3- Transfnrmarion and Defcrrinp 

DNA from the library described in paragraph 2 above was chilled on ice to which was added 
electrocompetent DHIOB bacteria (Ufe Technologies. 20 ml). Tl,e bacteria and vector mixnire was d«n 
electr^porated as recommended by d« mamtfacturer. Subsequendy, SOC media (Ufe Technologies. 1 ml) was added 
and U.e m«nire was incubated at 37-C for 30 mimites. TTte ,ransfom«ms were dten plated onto 20 standard 150 
mm LB plates containing ampicUlin and incubated for 16 hours (37X). Positive colonies were scraped off the plates 
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and the DNA was isolated from the bacterial pellet using standard protocols, e.g. CsCl-gradicm. The purified DNA 
was then carried on lo the yeasi protocoU below. 

TTie yeast tneihods were divided into three categories: (1) Transfomution of yeast with the plasmid/cDNA 
combined vector; (2) Detection and isolation of yeast clones secreting amylase: and (3) PGR amplification of the 
insert directly from the yeast colony and purification of the DNA for sequencing and further analysis. 
5 The yeast strain used was HD56-5A (ATCC-90785) This stram has the following genotype: MAT alpha. 

ura3-52. lcu2-3. Ieu2-112. his3-1 1. hisS-lS. MAL*. SUC*. GAL* Preferably, yeast mutants can be employed that 
have deficiem post-translational pathways. Such mutants may have translocation deficiem alleles in seel 1 . secll, 
secfa. with truncated «c71 being most prefencd. Aliemaiively. antagonists (including antisense nucleotides and/or 
ligantis) which interfere with the normal operation of these genes, other proteins implicated in this post translation 
10 pathway (e.g.. SEC61p. SEC72p. SEC62p. SEC63p. TDJ Ip or SSAlp4p) or the complex formation of these proteins 
may also be preferably employed in combination with the amylase-expressing yeast. 

Transformation was performed based on the protocol outlined by Gieu et al.. Nwl , Acid- Res. . ai:»425 
(1992). Transformed cells were then inoculated from agar into YEPD complex media broth (100 ml) and grown 
overnight at 20°C. The YEPD bioth was prepared as described in Kaiser et al.. Mflhod^ in YMMCcnctic? . Cold 
15 Spring Harbor Press. Cold Spring Harbor. NY. p. 207 (1994). TTie overnight culture was then diluted to about 2 
X lO* cells/ml (approx. OD«„=0.I) into fresh YEPD broth (500 ml) and regrown to I x' 10 cells/ml (approx. 
OD«»=0.4-0.5). 

TlK cells were then harvested and prepared for transformation by transfer into GS3 rotor botdes m a Sorval 
GS3 rotor at 5.000 rpm for 5 minutes, the supernatant discarded, and then resuspended into sterile water, and 
20 centrifuged again in 50 ml falcon mbes at 3.500 rpm in a Beckman GS-6KR centrifuge. The supenuum. was 
discarded and the cells were subsequemly washed with UAc/TE (10 ml. 10 mM Tris-HCl. 1 mM EDTA pH 7.5. 
100 mM Li,OOCCH0. and resuspended into LiAc/TE (2.5 ml). 

Transformation took place by mixing the prepared celte (100 m») with freshly denatured single stranded 
salmon testes DNA (Lofstrand Ubs. Gaithersburg. MD) and transforming DNA (1 ;xg. vol. < 10 ,\) in microfugc 
25 tubes. mixnire was mixed briefly by vortexing, then 40% PEG/TE (600 ,1. 40% polyethylene glycoMOOO. 10 
mM Tris-HCl. 1 mM EDTA. 100 mM Li^OOCCH,. pH 7.5) was added. TOs muture was gemly mixed and 
ireubaied at 30-C while agitating for 30 minutes. TTe cells were then heat shocked at 42°C for 15 minutes, and the 
reaction vessel centrifiiged in a microfiige at 12.000 rpm for 5-10 seconds, decanted and resuspended into TE (500 
;il. 10 mM Tris-HCl, 1 mM EDTA pH 7.5) foUowed by reccmrifugauon. TTic ceUs were then diluted into TE (1 ml) 
30 and aliquots (200 mD were spread onto the selective media previously prepared in 150 mm growth plates (VWR). 

Alternatively, instead of multiple smaU reactions, the transformation was perfomwd using a single, large 
scale reaction, wherein reagem amounts were scaled up accordingly. 

The selective media used was a synthetic complete dextrose agar lacking uracil (SCD-Ura) prepared as 
described in Kaiser et al.. ^^^^ i«y.^^C^6c.. Cold Spring Harbor Press, Cold Spring Harbor. NY. p. 208- 
35 210(1994). Transformanis were grown at SO'C for 2-3 days. 

The detection of colonies secreting amylase was performed by including red surch in the selective growth 
media. Staich was coupled to the red dye (Reactive Red.l20. Sigma) as per the procedure described by Biely et ai. . 
Anal Biochem. . 122: 176-179 (1988). The coupled starch was incorporated imo the SCD-Ura agar plates at a final 
concentration of 0.15% (w/v), and was buffered with potassium phosphate to a pH of 7.0 (50-100 mM fmal 

181 



\VO«WJ628l W A PCT/US99/05028 

concentration). 

nc posidve colonies were picked and streaked across fresh selective n^dta (onto ISO mm plates) m order 
to obtatn well isolated and identifiable single colonies Well isolated single colonies posnive for amylase secretion 
were detected by direct inco^Joration of red starch mto buffered SCD-Ura agar. Pos.ti vc colonies were determined 
by their ab.hty to break down starch resulong m a clear halo around die positive colony visualized directly. 

4- Matiffn of DNA bv ppr Ampiip^an^n 

When a positive colony was isolated, a portion of it was picked by a toothpick and diluted into sterile water 
(30 Ml) m a 96 well pla.c. A, this time, the positive colonies were either frozen and stored for subsequent analysis 
or immediately amplified. An aliquot of cells (5 ,1) was used as a template for the PCR reaction in a 25 ^1 volume 
contammg: 0.5 ,1 Kientaq (Clomech. Palo Al.o. CA): 4 0 ^1 10 mM dNTP s (Perkin Elmer-Cetus) 2 5 mI Kenuq 
buffer fClontcch): 0.25 ^1 forward oligo 1: 0.25 ,1 reverse oligo 2; 12.5 ,1 distilled water TTie sequence of the 
forward oligonucleotide i was: 

5- -TGTAAAACGACGGCCAGTTAAATAr.ArrTr.r^^TTATTArT-Tr (SEQ ID NO:324) 
The sequence of reverse oligonucleotide 2 was- 

5'-CAGGAAACACCTATGACCACCTr^ArArrTr.r^^^Trrimr (SEQ ID NO:325) 
PCR was then performed as follows: 



a. 




Denature 


92*»C. 


5 minutes 


b. 


3 cycles of: 


Denature 




30 seconds 






Anneal 


59°C, 


30 seconds 






Extend 




60 seconds 


c. 


3 cycles of: 


Denature 




30 seconds 






Anneal 


SVC, 


30 seconds 






Extend 


72 ''C, 


60 seconds 


d. 


25 cycles of: 


Denature 


92 •'C, 


30 seconds 






Anneal 




30 seconds 






Extend 


72*'C. 


60 seconds 


e. 




Hold 


4**C 





The underlined regions of the oligonucleotides annealed to die ADH promoter region and the amylase 
region, respectively, and amplified a 307 bp region from vector pSST-AMY.O when no insen was present. Typically 
Ihe first 18 nucleotides of the 5' end of these oligonucleotides contained annealing sites for the sequencing primers' 
Thus, the total product of the PCR reaction from an empty vector was 343 bp. However, signal sequence-fused 
cDNA resulted in considerably longer nucleotide sequences. 

FoUowing the PCR. an aliquot of the reacnon (5 mD was examined by agarose gel electrophoresis in a 1 % 
agarose gel usi.« a Tris-Borau^-EDTA (TBE) buffermg system as described by Sambrook et al.. sum- Clones 
resulting in a single strong PCR product larger than 400 bp were farther analyzed by DNA sequencing after 
purification with a 96 Qiaquick PCR clean-up column (Qiagen Inc.. Chatswonh. CA). 
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pXAMPLE 3 'r'riliffn ftf tPN* FHrodiny Human PRQ213 

A consensus sequence was obtained relarivc to a variety of EST sequences as described in Example 1 above, 
wherein the consensus sequence obtained is herein designated DNA28735. Based on the DNA28735 consensus 
sequence, oligonucleotides were syndiesized: 1) to identify by PGR a cDNA library thai contained the sequence of 
interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR02I3. 
5 A pair of PGR primers (forward and reverse) were synthesized: 

f»n.,rHPrB nrimer 5*-TGGAGGAGGAATATGGCAGCC-3* (SEQIDN0:3) 
rT-w^ryP^" 5'-TnTCGAGTCGTGTCGGGTTGG-3- (SEQ ID N0:4) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA28735 
sequence which had the following nucleotide sequence 
10 hy^^'^1ir'"■"" 

5'-GGTGACACTrGCCAGTCAGATGTGGATGAATGCAGTGCTAGGAGGG-3' (SEQ ID N0:5) 

In order to screen several Ubraries for a source of a full-length done. DNA from the libraries was screened 
by PGR amplification with the PGR primer pair idemificd above. A positive library was then used to isolate clones 
encoding the PR0213 gene using the probe oligonucleotide and one of the PGR primers. RNA for construction of 
15 the cDNA libraries was isolated from human fetal lung tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PR0213 
(herein designated as UNQ187 (DNA30943-1 163)1 (SEQ ID N0:1) and the derived protein sequence for PR0213. 

The eniiic nucleotide sequence of UNQ187 (DNA30943-1 163) is shown in Figure 1 (SEQ ID NO: 1). Clone 
UNQ187 (DNA30943-1163) contains a single open reading frame with an apparem translational initiauon site at 
20 nucleotide positions 336-338 and ending at the stop codon at nucleotide positions 1221-1223 (Figure 1). Ibt 
predicted polypepude precursor is 295 amino acids long (Figure 2). Glone UNQ187 (DNA30943-1 163) has been 
deposited with ATCC. 

Analysis of the amino acid sequence of the fiill-length PR02I3 polypeptide suggests that a portion of it 
possesses significam homology to the human growth arrest-specific gene 6 protein More specifically, an analysis 
25 of the Dayhoff database (version 35.45 SwissProt 35) evidenced significant homology between the PR0213 ammo 
acid sequence and the following Dayhoff sequences. HSMHC3W5A_6 and B48089. 

EXAMPLE 4 : lijnlarinn nf cHNA C lnnes Encoding Human PR(K7.4 

A consensus sequence was obtained relative to a variety of EST sequences as described in Example 1 above, 

30 wherein the consensus sequence obtained is herein designated DNA36469. Based on the DNA36469 consensus 
sequence, oligonucleotides were synthesized: 1) to identify by PGR a cDNA library that contained the sequence of 
imerest. and 2) for use as probes to isolate a clone of the full-length coding sequence for PR0274. ESTs proprietary 
to Genentech were employed in the consensus assemWy. The ESTs are shown in Figures 5-7 and are herein 
designated DNA17873. DNA36157 and DNA28929. respectively. 

35 Pain of PGR primers (forward and reverse) were synthesized: 

f^rv^j. ^ i.rp primer lAMfig m S'.CTGATCCGGTrCTTGGTGCCCCTG-3' (SEQ ID NO:!!) 
^^ y^ PTP prin^r 1 i%MM.m S'-GCTCTGTCACTCACGCTG-3- (SEQIDN0:!2) 
f^r^,^ prPnHn,.,inMfi9 0^ 5'.TCATCTCTTCCCTCTCCC-3' (SEQ ID N0:13) 
fnr«.rH PTP nrimer i/iM69.f4^ 5'-CCTTCCGCGACGGAGTrG-3' (SEQ ID N0:14) 
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revcrsff prR rriip^r i f^fHfjg r jj 5--ggcaaagtccactccgatgatgtc-3' (seq id no:I5) 

revpffC PCR primtr 2 r}64(i9 r?J 5 -GCCTGCTGTGGTCACAGGTCTCCG-3' (SEQ ID NO: 16) 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA36469 
sequence which had the following nucleotide sequence 
hybridization pwhe nM^yOpp 

5 -TCGGGGAGCAGGCCTTGAACCGGGGCATTGCTGCTGTCAAGGAGG-3' (SEQ ID N0:17) 

In order to screen several Ubraries for a source of a full-length clone. DNA from the libraries was screened 
by PGR amplificanon with the PGR primer pair identified above. A positive library was then used to isolate clones 
encoding the PR0274 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of 
the cDNA libraries was isolated from human fetal liver ussue (L1B229). 

DNA sequencing of the clones isolated as described above gave the Ml-lengdi DNA sequence for PR0274 
[herein designated as UNQ24I (DNA39987-1 184)] (SEQ ID N0:1) and the derived protein sequence for PR0274. 

The entire nucleotide sequence of UNQ241 (DNA39987-1 184) is shown in Figure 3 (SEQ ID NO 6) Clone 
UNQ241 (DNA39987-1 184) contains a single open reading frame wim an apparem transUtional initiation site a, 
nucleotide posinons 83-85 and ending at the stop codon at nucleotide positions 1559-1561 (Figure 3). TTie predicted 
polypepdde precursor is 492 ami™, acids long (F.gure 4). has an estimated molecular weight of about 54.241 daltons 
and an csdmaied pi of about 8.21 . Clone UNQ241 (DNA39987-1 184) has been deposited with ATCC and is assigned 
ATCC deposit no. 209786. 

Analysis of the amino acid sequence of die fiill-length PR0274 polypeptide suggesB that ii possesses 
significant homology to the Fn54 protein. More specifically, an analysis of die Dayhoff database (version 35.45 
SwissProt 35) evidenced significant homology between the PR0274 amino acid sequence and the foUowing Dayhoff 
sequences. MMFN54S2J. MMFN54SI_1. CELF48C1_8. CEF38B7.6. PRP3_RAT. INL3 PIG. MTCY07A7 13 
YNAX_ICLEAE. A47234 and HME2 MOUSE. 
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EXAMPLF S : Isolation rT>[^^ nffllH F p codinp Hum^r Po nr^^ 

A consensus sequence was obtained relative to a variety of EST sequences as described in Example 1 above, 
wherein the consensus sequence obtained is herein designated DNA35930. Based on the DNA35930 consensus 
sequence, oligonucleotides were synthesized: 1) to idenrify by PCR a cDNA library dut contained die sequence of 
imerest. and 2) for use as probes to isotote a clone of die ftilHengdi coding sequence for PRO300. 
Forward and reverse PCR primers were synthesized: 
30 forward Prp primrr I CIJ S'-GCCGCCTCATCTrCACCTTCTrCC-S- (SEQ id NO:20) 

forward PTF Primer 2 f3g?30 , f2) 5'-TCATCCAGCTCGTGCTGCTC-3' (SEQ ID N0:21) 
fgrwardPTF Primer 3 f3 .- ^930n } 5'-CTrCTTCCACTrCTGCCTGG.3' (SEQ ID N0:22) 
i^mn Prp Primer 4 f3S930 f4) 5'-CCTGGGCAAAAATGCAAC-3' (SEQ ID NO:23) 

reverse Prp primrr I O^m rl) 5'-caggaatgtagaaggcacccacgg-3' (seq id no:24) 
reverse PCR Primer 2 (3?930 , r?) 5'-tggcacagatcttcacccacacgg-3- (SEQ id no:25) 

Additionally, a syndietic oligonucleotide hybridization probe was constructed ftom die consensus DNA35930 
sequence which had the following nucleotide sequence 

hvbrirfiMiin^ p^fT^f n'r°^ft nn 

5'.TCTCCATCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTG.3- (SEQ id N0:26) 
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In order lo screen several Ubraries for a source of a ftill-lengih clone. DNA from the libraries was screened 
by PCR aniplificanon with the PCR pnmer pair ideniificd above. A positive library >vas then used to isolate clones 
encoding the PRO300 gene using the probe oligonucleotide and one of the PCR primers RNA for construction of 
ihe cDNA libraries was isolated from human fetal kidney tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PRO300 
5 {herein designated as UNQ263 (DNA40625-1 189)) (SEQ ID NO: 18) and the derived protein sequence for PRO300 
The entire nucleotide sequence of UNQ263 {DNA40625.il 89) U shown in Figure 8 (SEQ ID N0:18). 
Clone UNQ263 (DNA40625-1 189) contains a single open reading fnune with an apparent translational initiation site 
at nuclcoude positions 45-»7 and ending at the stop codon at nucleotide positions 1416-1418 (Figure 8). The 
predicted polypeptide precursor is 457 am.no acids long (Figure 9). Clone UNQ263 (DNA40625.1189) has been 
10 deposited with ATCC and js assigned ATCC deposit no. 209788. 

Analysis of the amino acid sequence of the full-length PRO300 polypeptide suggests that ponions of it 
possess significant homology to the Diff 33 protein. More specifically, an analysis of the DayhofT daubase (version 
35.45 SwissProt 35) evidenced significant homology between the PRO300 amino acid sequence and the following 
Dayhoff sequence. HSU49188_l. 
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FXAMPLE 6 - nn l niiTTn nf fPNft r'""''" E"'-"'lini> Htiman PRQ284 

Two cDNA sequences were isolated in the amylase screen described in Example 2 and those cDNA 
sequences are herein designated DNA12982 (see Figure 12; human placenta-derived) and DNA15886 (see Figure 
13; human salivary gland-derived). n>e DNA12982 and DNA15886 sequences were then clustered and aligned. 
20 giving rise to a consensus nucleotide sequence herein designated DNA18832. 

Based on the DNA18832 consensus sequence, oligonucleotide probes were generated and used to screen 
a human placenta Ubrary (LIB89) prepared as described in paragraph 1 of Example 2 above. n« cloning vector was 
pRKSB (PRK5B is a precursor of pRK5D that does not contain the Sfil site; see. Holmes ei al.. SciaiEfi. 25i.l278- 
1280 (1991)). and the cDNA size cut was less than 2800 bp. 
25 PCR primers (forward and reverse) were synthesized: 

f,r-,ri ?r? r^-' ' '"'^ 5-.TCGTACAGTrACGCTCTCCC-3- (SEQ1DN0:31) 

f^.. ,,.orPnnn,.rin8«2.n 5'-CTrGAGGAGCGTCAGAAGCG-3- (SEQ ID NO:32) 
P„ ,r.>PrPnnn^rn8832 rt 5'-ATAACGAATGAAGCCTCGTG-3- (SEQ ID NO:33) 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA18832 sequence which 
30 had the following nucleotide sequence 

5--GCTAATATCTGTAAGACGGCAGCTACAGCAGGCATCATrG-3' (SEQ ID NO:34) 

to Older to screen several Ubraries for a source of a hiU-length clone. DNA from the libraries was screened 
by PCR amplification with the PCR primer pairs identified above. A positive library was then used to isotate clones 
35 encoding the PR0284 gene using the probe oligonucleoude and one of the PCR primers. 

A fidl length clone was identified that contained a single open reading frame with an apparent translational 
initiation site at nucleoride posiuons 167-169 and ending at the stop codon found at nucleoude positions 1022-1024 
(Figure 10; SEQ ID NO:27). The predicted polypepude precursor is 285 amino acids long, has a calculated 
nwlecular weight of approximately 32.190 daltons and an estimated pl of approxmiately 9.03. Analysis of the fiill- 
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length PR0284 sequence shown in Figure 1 1 (SEQ ID NO:28) evidences the presence of the following, a signal 
peptide from about ammo acid 1 u, about amino acid 24. transmembrane domains from about amino acid 76 to about 
amino acid 96 and from about amino acid 171 to about ami™ acid 195 and a potendal N-glycosyla.ion site from about 
ammo acid 153 to about amino acid 156. Clone UNQ247 a)NA23318-121 1) has been deposited with ATCC on April 
21. 1998 and is assigned ATCC deposit no. 209787. 

Analysis of the amino acid sequence of the fiill-Iength PR0284 polypepdde suggests Aa, „ possesses no 
significant sequence similarity to any known protein. However, an analysis of the Dayhoff database fversion 35 45 
SwissProt 35) evidenced some degree of homology between the PR0284 amino acid sequence and the foUowing 
Dayhoff sequences. JQ0124. CELE04A4_5. AB006451_1. AF030162J. 1M23_YEAST, S71194. NIA CUCMA 
IM17_yEAST. 150479 and HUM2FHP_1. 

EXAMPLE 7 : Isolation nf ^;nfM^ r\ ^ ^^ E ncoding Hi.mr^p PPmoA 

A cDNA sequence isolated in the amylase screen as described in Example 2 above was found, by BLAST 
and FastA sequence alignment, to have sequence homology to a nucleotide sequence encoding sarcoma-associated 
pioiein SAS. TTus cDNA sequence is herein designated DNA23020 (see Figure 16). TTie DNA23020 sequence was 
then compared to a variety of expressed sequence tag (EST) daubases which included public EST databases (e g 
GenBank) and a proprieun- EST DNA database (UFESEQ™. Incy«: Pharmaceuticab. Palo Alto. CA) to idemily 
existing homologies. Tbt homolt^y search was peribrmed using die computer program BLAST or BLAST2 (Alishul 
et al.. Metho4?inPn ^ vmol nej f 2fiS:460^ (1996)). Those comparisons resulting in a BLAST score of 70 (or in 
some cases 90) or greater that did no. encode known proteins were clustered and assembled mto a consensus DNA 
sequence with the program -phrap" (Phil Green. University of Washington. SeatUe. Washington; 
http://bo2eman.mbt.washington.edu/phrap.docs/phrap.hm1]). The consensus sequence obtained thereftom is herein 
designated DNA35858. Two proprietary Genentech ESTs were employed in the assembly wherein those EST 
sequences are herein identified as DNA21971 (Figure 17: SEQ ID NO:38) and DNA29037 (Figure 18- SEO ID 
NO:39). 

Based on die DNA35858 consensus sequence. oUgonucleotide probes were generated and used to screen 
a human kidney library (UB228) Ubrary prepared as described in paragraph 1 of Example 2 above. TTie cloning 
vector was pRKSB (pRK5B is aprccursor of pRK5D that does not coraain the Sfil site; see. Hotaics et 3i..Sa£m 
252:1278-1280 (1991)). and the cDNA size cut was less than 2800 bp. 

PGR primers (forward and reverse) were synthesized: 
forward PCP pritngr 1 Hf/^^ff fl) 5'-ACCCACGTCTGC0TTGCTGCC-3' (SEQ ID NO:40) 
forward PCR Primer 2 niSlrS f?) 5'-gagaatatgctggagagg-3' (SEQ ID N0:4i) 

gvcrSCPCR Primgffg f ^l^SSr l ) S -AGGAATGCACTAGGATTCGCGCGC-S' (SEQ id N0:42) 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA35858 
sequence which had the following nucleotide sequence 
35 hybridization nrnh.. nsBsff 

5'-GGCCCCAAAGGCAAGGACAAAGCAGCTGTCAGGGAACCTCCGCCG-3' (SEQ id N0:43) 

In Older to screen several fibraries for a source of a fiill-Iength clone. DNA from the libraries was screened 
by PGR amplification wid, the PCR primer pair idemified above. A positive library was then used to isolate clones 
encoding the PR0296 gene using the probe oligonucleotide and one of the PCR primers. 
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A full length clone was identified that contained a single open reading frame w.th an apparent iranslational 
initiation site at nucleotide positions 174-176 and ending at the stop codon found at nucleotide positions 786-788 
(Figure 14; SEQ ID NO:35). The predicted polypeptide precursor is 204 amino acids long, has a calculated 
molecular weight of approximately 22. 147 daltons and an estimated pi of approximately 8.37. Analysis of the full- 
length PR0296 sequence shown in Figure 15 (SEQ ID NO:36) evidences the presence of the following: a signal 
peptide from about amino acid 1 to about amino acid 34 and transmembrane domains from about amino acid 47 to 
about amino acid 63. from about amino acid 72 to about amino acid 95 and from about ammo acid 162 to about amino 
acid 182. Clone UNQ260 (DNA39979-1213) has been deposited with ATCC on April 21. 1998 and is assigned 

ATCC deposit no. 209789. 

Analysis of the amino acid sequence of the full-length PR0296 polypeptide suggests that it possesses 
significam sequence similarity to the sarcoma-amplifed SAS protein, thereby indicating that PR0296 may be a novel 
SAS homolog. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced 
significant homology between the PR0296 amino acid sequence and the following Dayhoff sequences. 158391. 
GEN11061. SSC2B04.1. HSU81031_2. CD63_RAT. CD63_MOUSE. CD63_HUMAN. AF022813_1. 
CD63_RABIT and CO02_HUMAN. 

15 

FX AMPLE 8 - l-tTlnii-r rPNi^ Q"^* Enrorting Hnirmn PRQ329 

Aconsensus sequence was obtained relative toavarietyofEST sequences as described in Example 1 above. 

wherein the consensus sequence obtained is herein designated DNA35612. Based on the DNA35612 consensus 
sequence, oligonucleotides were synthesized: 1) to identify by PGR a cDNA library that contained the sequence of 
20 interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR0329. 
PGR primers (forward and reverse) were synthesized: 
f , p..,,.prPnH^.rinS612.fn 5'-TGGGCTGTGTCCTCATGG-3- (SEQ ID NO:46) 
..p.,, ^ nHn,., 7 nS612.f2^ 5--nTCGAGGGCGAATrCTG.3- (SEQIDNO:47) 
,......prn.w,.m6.2.rl^ 5--AG-rrGTrGGAGTGTGATAGCCAG-3- (SEQ ID NO:48) 

, prPnHn...in5612.r2> 5-.AAACTrGGTTGTGGTCAGTGGGTG-3- (SEQ ID NO:49) 

Additionally, a synthedc oUgonudeotide hybridization probe was constr^ted fiom the consensus DNA3S612 
sequence which had the following nucleotide sequence 

5 .GTGAGGGACCTarcTCCACTGAGGAGAGCAGGTGCCACACGGAGG.3- (SEQ ID NO:50) 

In order to screen several libraries forasource of a full-length clone. DNA from the libraries was screened 
by PGR ampUficadon with the PGR primer pairs identified above. A positive library was dien used to isolate clones 
encoding the PR0329 gene using the probe oligonucleotide and one of the PGR primers. RNA for consuuction of 
the cDNA libraries was isolated from human fetal liver tissue (LIB6). 

DNA sequencing of the clones isolated as described above gave the fulWength DNA sequence for PR0329 
35 (herein designated as UNQ291 (DNA40594-1233)1 (SEQ ID NO:44) and the derived protein sequence for PR0329. 

The entire nucleotide sequence of UNQ291 (DNA40594.1233) is shown in Figure 19 (SEQ ID NO:44). 
CloiK UNQ291 (DNA40594.1233) contains a single open reading frame with an appartm translational initiation site 
at nucleotide positions 9-11 and ending at the stop codon a, nucleotide positions 1086-1088 (Figure 19). The 
predicted polypeptide precursor is 359 ammo acids long (Figure 20). The full-length PR0329 protein shown in 
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Figure 20 has an «,u„a.cd molecular we,gh. of about 38.899 dahons and a pi of abou. 5.21 Clone UNQ-'9, 
(DNA40594-1233) has been deposited with ATCC on Februan- 5. 1998 and .s assigned ATCC deposit no. 209617 
Analysts of the amino acid seque«e of the full-length PR0329 polypeptide suggesu that possesses 
s.gndicam sequence similarity ,o a high affinity immunoglobulin F. receptor protein. More specifically an analysis 
of Uie Dayhoff daubase (version 35.45 SwissPro. 35) evidenced sigmficam homology between the PR03^9 amim, 
5 acid sequence and die following Dayhoff sequences. FCGl.HUMAN. FCGO HUMAN. P R91439 P R22549 
P_R91438. P_W00859. P_R208II. P_R22550. HUMCD6406J and FCC1_M0USE. 

EXAMPLE 9 : Isolation nf rp>i^ r| » p .. c.ncodinp Hn^i p n PP ^^f> 

A consensus sequence was obained relative ,o a vanery of EST sequences as described in Example I above 
10 wherein the consensus sequence obtained is herein designated DNA42257. Based on die DNA42257 consensus 
sequence, oligonucleotides were synthesized: 1) to idemily by PCR a cDNA library that contained the sequem* of 
interest, and 2) for use as probes to isolate a clone of the ftiU-length coding sequence for PR0362. 
PCR primers (forward and reverse) were synthesized; 

f9rwartPCRnrim>Tlf^^^'i7fi) 5-tatccctccaattcagcaccctgg-3- (seq id no-53) 
'5 fPrwar^ PCR nrimrr ^ n 5 -gtcggaagacatcccaacaag-3- (Seqidno:54) 
averse PCR primrr I f4??s7 ri) 5--CTrcACAATGTCGCTGTGCTGcrc-3' (Seq id no.-ss) 

BverS^PCR primrr?(4??^7r^) 5 -AGCCAAATCCAGCAGCTGGCrrAC-3- (SEQ ID NO-56) 
Additionally, a synthetic ohgonucleo.ide hybridization probe was constructed from the consensus DNA42257 
sequence which had die following nucleotide sequence 
20 hvbridizafinti nrohe f4??S7 p]) 

5'-TGOATOACCGOAGCCACTACACGTGTGAAGTCACCTGGCAGACTCCTGAT-3- (SEQ ID NO:57) 

border ,0 screen several Ubnmes forasource of a full-length clone. DNA from the libraries was screened 
by PCR amplification with d« PCR primer pairs identified above. A positive library was tiien used to isolate clones 
encodmg die PR0362 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of 
-5 the cDNA hbraries was isolated from human fetal brain tissue (UB153). 

DNA sequencing of the clones isolated as described above gave die full-length DNA sequence for PR0362 
Ihereindesignated as UNQ317 (DNA45416-1251)) (SEQ ID N0:51) and die derived protein sequence for PR036. 

The entire nucleotide sequence of UNQ317 (DNA454I6-I251) is shown in Figure 21 (SEQ ID N0-5I) 
Clone UNQ317 (DNA454.6-1251) contains a single open reading frame witi, an apparent translational initiation site 
50 at nucleotide positions 119-121 and ending at the stop codon a, nucleotide positions I082-I084 (Figure 21) TTie 
predicted polypeptide precursor is 321 amino acids long (Figure 22). The full-length PR0362 p««ein shown in 
F.gu« 2 has an estimated molecular weight of about 35.544 daltons and a pi of abou. 8.51 . Analysis of the ftilHengU, 
PR0362 polypeptide as shown in Figure 22 evidences the presence of a glycosaminoglycan anachmem site at abou. 
ammo acid 149 » about ami«, acid 152 and a tnmsmembrane domain from about amino acid 276 to about amino acid 

5 306. CloneUNQ317(DNA454,6-125.)hasbeendeposi.edwid.ATCConFebnuiry5. 1998 and U assigned ATCC 
deposit no. 209620. 

Analysis of d« amino acid sequence of ti« full-length PR0362 polypeptide suggests da. i. possesses 
significant seque^re similarity to die A33 antigen p„«i„ a«l ti« HCAR protein. More specifically, an analyse of 
the Dayhoffdatabase (version 35.45 SwissPro. 35) evidenced significan. homology between Uk PR03d2 amino acid 
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sequence and the following Dayhoff sequences. AB00234,_1. HSUS5258.1. HSC7NRCAM.1. RNU81037_1. 
A33 HUMAN. P_W14158. NMNCAMRIJ. HSTITINNZJ. S71824.1 and HSU6304!_1 

PXA f^P' g tQ ^ kniaiionof cHNA rioncs Enco^ l inf H<mnPR03W 

A consensus sequence was obtained relative to a vaneiy of EST sequences as described in Exan^le 1 above. 
5 wherein the consensus sequence obuuned is herein designated DNA42828. Based on the DNA42828 consensus 

sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library d«t contamed the sequence of 

interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR0363. 
A pair of PCR primers (forward and reverse) were synthesized: 

^,^.., ,^orPnrin.W42«28.fn S'-CCAOTGCACAGCAGGCAACGAAGC-B' (SEQ ID NO:60) 
10 ,,..,n- r"-^ <^^^^^^^ S-.ACTAGGCTGTATGCCTGGGTGGGC.3- (SEQ ID N0:6l) 

Additionally, a synthetic oligonucleotide hybridizaiion probe was constmcted from the consensus DNA42828 

sequence which had the following nucleotide sequence 

^y^^i'1iya^iff" p*^"*^ uisii.oi) 

5 -GTATGTACAAAGCATCGGCATGGTTGCAGGAGCAGTGACAGGC.3- (SEQ ID NO:62) 
15 In order to screen several Ubraries for a source of a fiill-length clone. DNA from the libraries was screened 

by PCR ainpUficaaon with the PCR primer pair identified above. A positive library was then used to isolate clones 
encoding the PR0363 gene using the probe oligonucleotide and one of die PCR primers. RNA for construction of 
the cDNA libraries was isolated from human fetal kidney ussue (UB227). 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PR0363 
20 [herein designated as UNQ318 (DNA45419-1252)] (SEQ ID NO:58) and the derived protein sequence for PR0363. 

The entire nucleotide sequence of UNQ318 (DNA45419-1252) is shown b Figure 23 (SEQ ID NO:58). 
Clone UNQ318 (DNA45419-1252) contains a single open reading frame with an apparent transUtional initiation site 
at nucleotide positions 190-192 and ending at the stop codon at nucleotide positions I309-I311 (Figure 23). Tbt 
predicted polypeptide precursor is 373 amino acids long (Figure 24). Tlie full-length PR0363 protein shown m 
25 Figu,e24hasa«estimatedmolecularweightofabou,41.281daItonsandapIofabout8.33. A transmembrane 
domain exists ai amino acids 221 to 254 of the amino acid sequence shown in Figure 24 (SEQ ID NO:59). TTie 
PR0363 polypeptide also possesses at least two myelin PO protein domains from about amino acids 1 5 to 56 and from 
about amino acids 87 to 116. Clone UNQ318 (DNA45419-1252) has been deposited with ATCC on Febnmy 5, 
1998 and is assigned ATCC deposit no. 209616. 
30 Analysis of the amino acid sequence of the full-length PR0363 polypeptide suggests that it possesses 

significant sequence similarity to die cell surface protein HCAR. diereby indicatmg that PR0363 may be a novel 
HCAR homolog. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced 
significant homology between the PR0363 amino acid sequence and the following Dayhoff sequences. HS46KDA_1 . 
HSU90716.1. MMCARHJ. Ml^CARHOM.l. MMU90715.1. A33_HUMAN. P W14146. P_W14l58. A42632 
35 and 842632. 

FVAMPi-E w - hiTlMiTm ftf rPH* f"^"^^ Fnrodini' Human PRQ8« 

A consensus sequence was obtained relative to a variety of EST sequences as described in Example 1 above, 
wherein the consensus sequence obtained is herem designated DNA38133. Based on die DNA38133 consensus 
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sequence. oligonudeoUde, were sy„thes«ed: 1) .o identify by PCR a cDNA libra^. conuined a« sequence of 
interest, and 2) for use as probes to isolate a clone of the fiiU-lcngth coding sequence for PR0868. 
A pair of PCR primers (forward and reverse) were synthesized: 

S aim<i?c;n mmrmv fi) s'-gtagcagtgcacatggggtgttggo- (seq id no:65) 

revwcPCffprimrrn^nVD S'-ACCGCACATCCTCAGTCTCTCTCC-3- (SEQ ID NO:66) 
Additionally, a synthetic oligonucleot.de hybridization probe was constructed from the consensus DNA38I33 
sequence which had the following nucleotide sequence 

hvbridi2atinn prnt^ pi) 

S'-ACGATGATCGCGGGCTCCCrrCTCCTGCTrGGATTCCTTAGCACCACCAC-B- (SEQ ID MO:67) 

In order to screen sevctal Ubranes for a source of a full-length clone. DNA from the libraries was screened 
by PCR amplification with the PCR prm«r pair identified above. A positive library was then used to isolate clones 
encodmg the PR0868 gene using the probe oligonudeoUde and one of the PCR primers. RNA for construction of 
the cDNA libraries was isolated from human fetal kidney tissue <LIB227). 

DNA sequencing of the clones isolated as described above gave the fiilHength DNA sequence for PR0868 
[herein designated as UNQ437 (DNA52594..270)) (SEQ ID NO:63) and the derived protein sequence for PR0868 
The entire nucleotide sequence of UNQ437 (DNA52594-1270) is shown in Figure 25 (SEQ ID NO-63> 
Clone UNQ437 (DNA52594-,270) contains a single open reading frame with an apparent translaHonal initiation site 
at nucleotide positions 325-327 and ending at die stop codon at nucleodde positions 2290-2292 (Figure 25) The 
predicted polypeptide precursor is 655 amino acids long (Figure 26). The full-lengm PR0868 protein shown in 
Figure 26 has an estimated molecular weight of about 71.845 dal.ons and a pi of about 8.22. Analysis of the fiJI- 
20 length PR0868 pol>peptide sequence demonstrates the presence of conserved cysteine-comaining domains from about 
amtno add 66 to about ami™, acid 78 and from about amino acid 1 23 to about amino acid 1 34 of the seque^^e shown 
in Figure 26 (SEQ ID N0:3). a TNFR deadi domain from about amino acid 85 to about amino acid IIO a 
FASA.mousedeaUidomainblockfromabout amino acid 159 to about amino acid 175 and a transmembrane domain 
from about amim, acid 347 to about amino acid 375. Clone UNQ437 (DNA52594-1270) has been deposited with 
25 ATCC on Mardi 17. 1998 and is assigned ATCC deposit no. 209679 

Analysis of the ammo acid sequence of the full-length PR0868 polypeptide suggeso that i, po«es,es 
significant sequence similarity to die mmor necrosis factor receptor protein, thereby indicating that PR0868 may be 
a novd member of the tumor necrosis factor receptor family. More specifically, an analysis of die Dayhoff database 
(version 35.45 SwissProt 35) evidenced significant homology between die PR0868 amino acid sequence and the 
30 followmg Dayhoff sequences. RNU94330_1. P_R99933. P_R99945. P_R99950. HSU94332 1. CD40 HUMAN 
S63368_l, TNR2_HUMAN. MVU87844_| AND CVU87837_1. 

EXAMPlrP, 1? : iMlMion of rPNA rinnP^ f - n r ftdi"" Hnm^p f^ n^^ 

A consensus sequence was obtained refative to a variety of EST sequences as described in Example 1 above 
35 wherem .he consensus sequence obtained is herein designated DNA30892. Based on tiie DNA30892 consensus 
sequence. ohgomKleotides were synthesized: I) to identify by PCR a cDNA library tiiat contained die sequence of 
interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR0382. 
A pair of PCR primers (forward and reverse) were symhcsized: 

fc rvanl PCR primrr 5--tgacatcccccttatgaagctggc-3" (SEQIDno:70) 
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^.uf ^,.PrR nnmer 5-.TACACGTCCCTGTGGTTGCAGATC-3' (SEQ1DN0:71) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from ihc consensus DNA30892 

sequence which had the following nucleotide sequence 

t^y^iri^iTatinn nrobe 

5'.CGTrCAATGCAGAAATGATCCAGCCTGTGTGCCTGCCCAACTCTGAAGAG-3' (SEQID N0.72) 
5 In order to screen several libraries for a source of a full-length done. DN A from the libraries was screened 

by PGR amplification with the PGR primer pair identified above. A positive library was then used to isolate clones 
encoding the PR0382 gene using the probe oligonucleotide and one of the PGR primers. RNA for construction of 
the cDNA libraries was isolated from human fetal kidney tissue (LIB227), 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PR0382 
10 Ihercin designated as UNQ323 (DNA45234-1277)1 (SEQ ID NO:68) and the derived protein sequence for PR0382. 

The entire nucleotide sequence of UNQ323 (DNA45234-1277) is shown in Figure 27 (SEQ ID NG:68). 
Clone UNQ323 (DNA45234-1277) contams a single open reading frame with an apparent translational initiation site 
at nucleotide positions 126-128 and ending at the stop codon at nucleotide positions 1485-1487 (Figure 27). The 
predicted polypeptide precursor is 453 amino acids long (Figure 28). Tlie full-length PR0382 protein shown in 
15 Figure 28 has an estimated molecular weight of about 49.334 daltons and a pi of about 6.32. Analysis of the native 
PR0382 amino acid sequence shown in Figure 28 (SEQ ID NO:69) indicates the presence of a putative 
oansmembrane domain from about amino acid 240 to about amino acid 284. a putative signal peptide at about amino 
add 1 to about amino acid 20. a puutivc apple domain at about amino acid 386 to about amino acid 419. a putative 
Kringlc domain at about amino acid 394 to about amino acid 406 and a histidine-containing protease active site at 
20 about amino acid 253 to about amino acid 258. Clone UNQ323 {DNA45234-1277) has been deposited with ATCC 
on March 5. 1998 and is assigned ATCC deposit no. 209654 

Analysis of the amino acid sequence of the full-lengtii PR0382 polypeptide suggests that it possess 
significant homology to serine protease proteins, thereby indicating tiiat PR0382 may be a novel serine protease. 
Specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced significant homology 
25 between the PR0382 amino acid sequence and the following Dayhoff sequences. HSU75329.1 . ENTK.MOUSE. 
HEPS_HUMAN. AFO30065_l. HEPS.RAT. PU^N.PIG. P_R89430. P_R89435. PLMN.HORSE. PLMN_BOVIN 
and P_R83959. 

PVAMPi J. 13 - Mmn ff rPN* FnrnHinp Human PRQ?4g 

30 A consensus sequence was obtained relative to a variety of EST sequences as described in Example 1 above, 

wherein the consensus sequence obtained is herein designated DNA44706. An EST proprietary to Genemech was 
employed in the consensus assembly and is herein designated DNA13217 (Figure 31: SEQ ID NO-.75). Based on 
the DNA44706 consensus sequence, oligonucleotides were syntiiesized: I) to identify by PGR a cDNA library that 
comained the sequence of in«:rest. and 2) for use as probes to tsolate a clone of the fuU-length coding sequence for 

35 PR0545. 

Forward and reverse PCR primers were synthesired: 
f»^,rHPrBnrin.erl S'-GTCTCAGCACGTGTTCTGGTCTCAGGG-S' (SEQ ID NO:76) 
f»,^.,HPrRnrimer2 5 -CATGAGCATGTGCACGGC-3- (SEQ ID NO:77) 
f»T~f,Mi»rPnr.mer3 5 -TACCTGCACGATGGGCAC-3" (SEQ ID NO:78) 
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for^vardPCRnnmffr a 5 -CACTGGGCACCTCCCnTC-3' (SEQ id N0:79) 
rever?^ PC>^ primffr I S'-CTCCAGGCTGGTCTCCAAGTCCTTCC-S- (SEQ id NO-80) 
ffvcniC PCR pnmgr S -TCCCTGTTGGACTCTGCAGCTTCCO' (SEQ id N0:81) 
rcvcrs? FCR nrimcr ^ 5 -CTTCGCTGGGAAGAGTTTG-3' (SEQ id N0:82) 

Addhionally. a synthetic oUgonudcotide hybridization probe was constricted fn,m the conscnsiB DNA44706 
sequence which had the following nucleotide sequence 

hybridization pri^hf 

J-GTGCAACCAACAGATACAAACTCTTCCCAGCGAAGAAGCTGAAAAGCGTC-3- 
(SEQ ID N0:83) 

In crier to screen several Ubraries for a source of a full-length clone. DNA from the libraries was scrcei«d 
by PGR amplification with one of the PCR primer pa,rs .demified above. A positive library was then used to isolate 
clones encoding the PR0545 gene usmg the probe oligonucleotide and one of the PCR primers. RNA for 
construction of die cDNA libraries was isolated from human placenta tissue (UB90). 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PR0545 
(herein designated as UNQ346 (DNA49624-1279)1 (SEQ ID NO:73) and dte derived protein sequence for PR0545 

The emire nucleotide sequence of UNQ346 {DNA49624-1279) is shown in Figure 29 (SEQ ID NO-73) 
Clone UNQ346 (DNA49624-I279) contains a single open reading frame with an apparem translational initiation siu: 
at nucleotide positions 311-313 and ending at the stop codon at nucleotide positions 2516-2518 (Figure 29) Tht 
predicted polypeptide precursor is 735 amino acids long (Figure 30). TTte fall-length PR0545 protein shown in 
Figure 30 has an estimated molecular weight of about 80.177 daltons and a pi of about 7.08. Important regions of 
the PR0545 amino acid sequence include the signal peptide, corresponding to amino acids 1-28. five potential 
N-glycosylarion sites, from about amino acid 1 11-1 14. amino acids 146-149. amino acids 348-351 , amino acids 449- 
452. and amino acids 648-651. and a neutral zinc metallopeptidase. zmc-binding region signature sequence from 
about am.no acids 344-353. Clone UNQ346 (DNA49624-1279) has been deposited wiU. ATCC and is assigned 
ATCC deposit no. 209655. 



EXAMPI.F. 14 : Isolation of cDNA rinnP« Fnr^ing H..n.an Pp r^f iiy 

A consensus sequence was obtained relative to a variety of EST sequences as described in Example 1 above, 
wherein the consensus sequence obtained is herein designated DNA42798. Based on tht DNA42798 sequence 
oligonucleotides were synthesi^d: 1) to identify by PCR a cDNA library ti,at contained the sequence of interest, and 
2) for use as probes to isolate a clone of Uie fiill-lengdi coding sequence for PR0617. 

A pair of PCR primers (forward and reverse) were syndiesized: 
fgPVard PTR nrimfr 5"-ACGGGCACACTGGATCCCAAATG-3' (SEQ id N0:86) 

rcvcrsf PCR primf r 5'-ggtagagatgtagaagggcaagcaagacc-3' (seq id no:87) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from tiie consensus DNA42798 
35 sequence which had die following nucleotide sequence 
hvbririiTfit^ftn pfn^frp 

5'-GCTCCCTACCCGTGCAGGTrrCTTCATTrGTrCCTrrAACCAGTATGCCG-3- (SEQ ID NO:88) 

In Older to screen several libraries for a source of a full-length clone. DNA from the libraries was screened 
by PCR amplification witii tfie PCR primer pair identified above. A positive libraty was tfien used to isolate clones 
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encoding the PR0617 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of 
the cDNA libraries was isolated from human fetal kidney tissue (LIB227) 

DNA sequencing of the clones isolated as described above gave the fiiU-length DNA sequence for PR0617 
(herein designated as UNQ353 (DNA48309-1280)) (SEQ ID N0:1) and the derived protein sequence for PR0617 
The entire nucleotide sequence of UNQ353 (DNA48309-1280) is shown in Figure 32 (SEQ ID NO:84) 
5 Clone UNQ353 (DNA48309-1280) contains a single open reading frame wiih an apparent translauonal initiation site 
at nucleotide positions 723-725 and ending at the stop codon at nucleotide positions 924-926 (Rgure 32). The 
predicted polypeptide precursor is 67 amino adds long (Figure 33). The full-length PR0617 protein shown in Figure 
33 has an esnmatcd molecular weight of about 6,981 daltons and a pi of about 7.47. Analygs of the PR0617 amino 
acid sequence also evidences the existence of a putative signal peptide from about amino acid 15 to about amino acid 
10 27 and a putative protein kinase C phosphorylation site from about amino acid 41 to about amino acid 43. Clone 
UNQ353 (DNA48309-1280) has been deposited on March 5. 1998 with ATCC and is assigned ATCC deposit no. 
209656. 

Analysis of the amino acid sequence of the full-length PR0617 polypeptide suggests that it possesses 
significant homology to the CD24 protein, thereby indicating that PR0617 may be a novel CD24 homolog. More 
15 specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced significant homology 
between die PR06I7 amino acid sequence and the following Dayhoff sequences. CD24_HUMAN. CD24_MOUSE. 
S15785. CD24_RAT, VGE BPG4. MSE5_HUMAN. HSMHC3W36A_2. MLU15184_8. P R85075. SEPL_HUMAN 
and MTCY63_13. 

20 FY AMPLE IS : T«nlariftn of cDNfr rinng!! FncodinC Hliman PR07W 

A consensus sequence was obtained relative to a variety of EST sequences as described in Example 1 above, 
wherein the consensus sequence obtained is herein designated DNA30837. Based on the DNA30837 consensus 
sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of 
interest, and 2) for use as probes to isolate a clone of die hill-lcngth coding sequence for PRO700. 
25 Forward and reverse PCR primen were synthesized: 

fn»u.rri PCR nrimer 1 5'-ATGTTCTTCGCOCCCTGGTG-3' (SEQ ID NO:91) 
f»n«,rdPCRnrimer2 5 -CCAAGCCAACACACTCTACAG-3' (SEQ ID NO:92) 
r.v>,^PrR nrimer 1 S'-AAGTGGTCGCCTTGTGCAACGTGC-S- (SEQ ID NO:93) 
,.v.».PrR nrimer 2 5'-GGTCAAAGGGGATATATCGCCAC-3' (SEQ ID NO:94) 
30 AdditionaUy. a synthetic oligonucleotide hybridization probe was constructed from die consensus DNA30837 
sequence which had the foltowing nucleotide sequence 
hvhridi/arion probe 

5'-GCATGOAAGATGCCAAAGTCTATGTCGCTAAAGTGGACTGCACGGCCCA-3' 
(SEQ ID NO:95) 

35 In order to screen several Ubraries for a source of a full-lengtii clone. DNA from the libraries was screened 

by PCR an^lification with one of the PCR primer pairs identified above. A positive Ubrary was then used to isolate 
clones encoding the PRO700 gene using the probe oligomideotide and one of the PCR primers. RNA for 
construction of the cDNA libraries was isolated from human fetal kidney tissue (UB227). 
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DNA sequencing of the clones isolated as descnbed above gave the full-length DNA sequence for PRO700 
[herein des.gnau»i as UNQ364 (DNA46776-1284)1 (SEQ ID NO:89) and the derived prote.n sequence for PRO700. 

The entire nucleotide sequence of UNQ364 (DNA46776-1284) is shown in Figure 34 (SEQ ID NO-89) 
Clone UNQ364 (DNA46776-1284) contains a single open reading frame with an apparent transiattonal initiation site 
at nucleotide positions 33-35 and ending at the stop codon at nucleotide positions 1329-1331 (Figure 34). Tbc 
predicted polypeptide precursor is 432 amino acids long (Figure 35). The full-length PRO700 protein shown in 
Figure 35 has an estimated molecular weight of about 47.629 daltons and a pi of about 5.90. Important regions of 
the amino acid sequence of PRO700 include the signal peptide, corresponding to amino acids from about 1 to 33, 
regions homologous to disulfide isomerase. corresponding to amino acids from about 82-99. 210-255. and 345-36o! 
a tyrosine kinase phosphorylation site, corresponding to amino acids from about 143-151. and an cndoplasmi<l 
reticulum targedng sequence, corresponding to amino acids from about 429-432. Clone UNQ364 (DNA46776.1284) 
has been deposited with ATCC and is assigned ATCC Deposit No. 209721. 



EXAMPLE 16 : Isolation nfcnN4 | rj^^ r ^ Encoriint> H^r^^n pp n-7ff^ 

A consensus sequence was obtained lelarivc to a vadety of EST sequences as descnbed in Example 1 above, 
wherein the consensus sequence obtained is herein designated DNA36623. Based on the DNA36623 consensus 
sequence, oligonucleotides were symhesized: 1) to identily by PCR a cDNA library that contained the sequence of 
interest, and 2) for use as probes to isolate a clone of die ftiU-length coding sequence for PRO702. 

A pair of PCR primers (forward and reverse) were synthesized: 
forward PCR primer nfff>7^ fp 5"-CGCTGACTATGTrGCCAAGAGTCG-3' (SEQ ID NO:98) 
reverse PCR primrr (366?^ rD 5 -GATGATGGAGGCTCCATACCTCA0-3' (SEQ ID NO:99) 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA36623 
sequence which had the following nucleotide sequence 
hvbridiTannn ppt^y (^^?,},;t\) 

5'-GTGTrCATrGGCGTGAATGACCTrGAAAGGGAGGGACAGTACATGTrCAC-3' (SEQ ID NO: 100) 

In order to screen several libtar«s for a source of a full-length clone. DNA from the libraries was screened 
by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones 
encoding die PRO702 gene using the probe oligomicleotide and one of d« PCR primers. RNA for construction of 
the cDNA libraries was isolated from human fetal liver tissue (LIB229). 

DNA sequencing of the clones isolated as described above gave the fiiU-lcngth DNA sequence for PRO702 
30 [herein designated as UNQ366 (DNA50980-1286)) (SEQ ID NO:96) and the derived protein sequence Itor PRO702. 

The entire nucleotide sequence of UNQ366 (DNA50980-1286) is shown in Figure 36 fSEQ ID NO-96) 
Clone UNQ366 (DNA50980.1286) contains a single open reading frame with an apparem translational inidation site 
at nucleotide positions 22-24 and ending a, d«. stop codon at nucleotide positions 853-855 (Figure 36). TTie predicted 
polypeptide precunor is 277 amino acids long (Figure 37). TTie full-length PRO702 protein shown in Figure 37 has 
35 an estimated molecular weight of about 30.645 daltons and a pi of about 7.47. Analysis of the full-length native 
PRO702 amino acid sequence evidences the presence of a putative signal peptide from about amino acid 1 to about 
amino acid 25. potential N-glycosylation sites from about amino acid 230 to about amino acid 233 and from about 
amino acid 258 to about amino acid 261 and a C-type lectin domain signature sequence from about amino acid 248 
to about ammo acid 270. Clone UNQ366 (DNA50980-1 286) has been deposited with ATCC on March 3 1 . 1998 and 
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is assigned ATCC deposit no. 209717. 

Analysis of the amino acid sequence of the full-length PRO702 polypeptide suggests that it possesses 
sigmfican. sequence similarity to the conglutinin protein, thereby indicating that PRO702 nay be a iK>vel congluiinin 
homolog. More specifically, an analysis of the Dayhoff database (version 35.45 SwissPro. 35) evidenced significant 
homology between the PRO702 amino acid sequence and the following Dayhoff sequences. S32436. P.R75642. P 
5 W18780. P_W18781. A53330. AC002528_1. HSPPA21C0_1. CA21.HUMAN. CA14_HUMAN and A61262. 

pXAMPLE 17 : t«>l«i»" "f cI^N ft ^'""^ ^^^^^ 

A consensus sequence was obtained relative to a variety of EST sequences as described in Example 1 above, 
wherein .he consensus sequence obtained is herein designated DNA43047. Based on the DNA43047 consensus 
,0 sequence, oligonucleotides were synthesized: I) to identify by PCR a cDNA library that contained Ae sequence of 
interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PRO703. 

Forward and reverse PCR primers were synthesized: 
.. ^u^^rHPrR Primer S'-GAGAGCCATCGGGCTCCACCTG-B' (SEQ ID NO; 103) 
r.v.r..PrR primer 1 5'-GGAGAATOTGGCCACAAC-3' (SEQ ID NO:104) 
15 ..v.».PrR primer 2 5 -GCCCTGGCACAGTGACTCCATAGACG-3- (SEQ ID NO:105) 
PCR Primer 3 5 -ATCCACTTCAGCGOACAC-3' (SEQ ID NO:106) 
Additionally, a synthetic oligonuclcoudc hybridization probe was constructed from the consensus DNA40654 
sequence which had the following nucleotide sequence 
hy]fridi7arion probe 

20 5'-CCAGTGCCAGGATACCTCTCTrCCCCCCAGAGCATAACAGACACG-3- 
(SEQ ID NO: 107) 

In order to screen several Ubraries for a source of a full-length clone. DNA from the libraries was screened 
by PCR ampUfication with one of the PCR primer pairs identified above. A positive library was then used to isolate 
clones encoding the PRO703 gene using the probe oligonucleoUde and one of d» PCR primers. RNA for 
25 construction of the cDNA libraries was isolated from human fetal kidney tissue (UB227). 

DNA sequencing of the clones isolated as described above gave the lull-length DNA sequence for PRO703 
Iherein designated as UNQ367 (DNA509I3-1287)1 (SEQ ID NO:101) and the derived protein sequence for PRO703. 

m entire nucleotide sequence of UNQ367 (DNA50913-1287) is shown in Figure 38 (SEQ ID NO:101). 
Clone UNQ3d7 (DNA50913-1287) conains a single open reading frame with an apparent translational initiation site 
30 at nucleotide posidons 115-117 and ending at the stop codon at nucleotide positions 2305-2307 (Figure 38). Hie 
predicted polypepdde precursor is 730 amino acids long (Figure 39). The full-length PRO703 protein shown in 
Figure 39 has an estimated molecular weight of about 78,644 daltons. and a pi of about: 7.65. Important reg.ons of 
.he PRO703 amino acid sequence include the signal peptide, a cAMP- and cGMP^ependem protein kinase 
phosphorylation site, a CUB domain protein motif. N-glycosylation sites and a puutive AMP-binding domain 
35 signature. Clone UNQ367 (DN A50913.1287) has been deposited with ATCC and is assigned ATCC deposu no. 
209716. 
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Aco«cnsusseque«:ewasob,ai™dreladve«)avanc.>.ofESTsecH.cnc«asd«^^^^^ , 3bo,, 

wherein .he consensus sequence obtained .s herein designated DNA43437. B«ed on d,e DNA43437 consensu 
, sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contat^d the sequence of 
mterest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PRO705. 
5 A pair of PCR primers (forward and reverse) were synthesized: 

forw9r<i PCR primgr 5 -AAGCGTGACAGCGGGCACGTC.3' (SEQ id N0:I IO) 
K - V<?r?g PCR pripitr 5 -TGCACAGTCTCTGCAGTGCCCAGG-3- (SEQ ID NO lll) 
Additionally, a synthetic oligonucleotide hybndiza.ion probe was constructed from the consensus DNA43437 
sequence which had the foUowmg nucleotide sequence 
10 hybridization nmbe f4^^7,p|) 

5'-GAATGCTGGAACGGGCACAGCAAAGCCAGATACTTGCCTG-3- (SEQ ID NO:l 12) 

m order to screen several Hbrades for a source of a full-length clone. DNA from the librar.es was screened 
by PCR amplification with the PCR pnmer pair identified above. A positive librao^ was then used to isolate clones 
encoding the PRO705 gene using the probe oligonudeoUde and one of the PCR primers. RNA for construction of 
15 the cDNA libraries was isolated from human fetal kidney tissue (UB227). 

DNA sequencing of the clones isolated as described above gave the fi^ll-length DNA sequence for PRO70S 
Iherein designated as UNQ369 (DNA50914-1289)] (SEQ ID NO- 108) and the derived protein sequence for PRO705 
Tht entire nucleotide sequence of UNQ369 (DNA50914-1289) is shown in Figure 40 (SEQ FD NO-108) 
Clone UNQ369 (DNA50914-1289) contains a single open reading frame with an apparent translational iniUadon siu 
20 a. nucleotide positions 566-568 and ending at the stop codon a, nucleotide positions 2231-2233 (Figure 40) THe 
predicted polypeptide precursor is 555 amino acids long (Figure 41). n^e full-lengd, PRO705 protein shown in 
Figure 41 has an esdmated molecular weigh, of about 62.736 daltons and a pi of about 5.36. Analysis of the full- 
ict^th PRO705 sequence as shown in Figure 41 evidences the presence of the following: a signal peptide from about 
ammo acid 1 to about amino acid 23. a eukaryotic DNA topoisomerase I active site from about amino acid 418 to 
25 abou, amino acid 436. and various regions that show homology to various glypican proteins from about amino acid 
237 ,0 about amino acid 279. about ami«, acid 42 1 to about amino acid 458, abou. amino acid 53 to about amiiK, acid 
74. abou, ammo acid 466 u> about amino acid 504. about ammo acid 308 to abou. amino acid 355. abou. amino acid 
104 to about amino acid 156 and about amino acid 379 to abou. amino acid 410. Clone UNQ369 (DNA50914-1289) 
has been deposited with ATCC on March 31 . 1998 and is assigned ATCC deposit no.209722. 
30 Analysis of die amino acid sequence of the full-lengdi PRO705 polypeptide suggests that i. possesses 

stgmfican. sequence similarity to the K-glypican protein, thereby indicating Oiat PRO705 may be a novel glypican 
protem family member. More specifically, an analysis of the Dayhoff database (version 35.45 SwissPro. 35) 
evidenced significam homology between fte PRO705 ammo acid sequence and the following Dayhoff sequences 
GPCK.MOUSE. GLYP.CHICK. GLYP_RAT. GLYP.HUMAN. GPC2.RAT. GPC5 HUMAN. GPC3 HUMAn' 
35 GPC3_RAT.P_R30168andCEC03H12_2. 

EX AMPI F 19 : IsolatiffllofrnNA CU^ Bp o^ ^ i no H..n„n VJ^ ^n^ 

A consensus sequence was obuuned reladve u> a varies of EST sequence, as described in Example 1 above 
wherein the consensus sequence obuuned is herein designared DNA34024. Based on Ae DNA34024 consensus 
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sequence, oligonucleotideswere synihesizcd: I) lo idcmify by PCR a cDNA library that coniained the sequence of 
interest, and 2) for use as probes to isolate a clone of the fuH-lenglh coding sequence for PRO708. 

A pair of PCR primers (forward and reverse) were synthesized: 
ff ^p,.rH PrR primer 5--CCCAACCCAACTGTTTACCTCTGG-3- (SEQ ID N0:115) 
.^vfr«^prRnrimer 5'-CTCTCTGAGTGTACATCTGTGTGG-3' (SEQ ID NO: 1 16) 
5 Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA34024 
sequence which had the following nucleotide sequence 

5'<iCCACCCTACCTCAGAAACTGAAGGAGGTTGGNTA1TCAACGCATATGGTCGG-3' (SEQ ID N0:117) 
In order to screen several Ubranes for a source of a full-length done. DNA from the libraries was screened 
10 by PCR amplificaoon with the PCR primer pair identified above. A positive library was then used to isolate clones 
encoding the PRO708 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of 
the cDNA libraries was isolated from human bone marrow tissue (LIB255). 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PRO708 
(herein designated as UNQ372 (DNA48296-1292)J (SEQ ID NO: 113) and the derived protein sequence for PROTOS. 
15 The entire nucleotide sequence of UNQ372 fDNA48296-1292) is shown in Figures 42A-B (SEQ ID 

N0:1 13). Clone UNQ372 (DNA48296-1292) contains a single open reading frame with an apparent translational 
initiacion site at nucleotide positions 891-893 and ending at the stop codon at nucleotide positions 2436-2438 (Figures 
42A-B). The predicted polypeptide precursor is 515 amino acids long (Figure 43). The full-length PRO708 protein 
shown in Figure 43 has an estimated molecular weight of about 56.885 daltons and a pi of about 6.49. Analysis of 
20 the PRO708 amino acid sequence shown in Figure 43 (SEQ ID NO: 1 14) evidences the existence of a putative signal 
peptide at about amino acid 1 to about amino acid 37. putative sulfatase signanire sequences at about amino acid 120 
to about amino acid 132 and about amino add 168 to about amino add 177. a putative tyrosine kinase phosphorylation 
site from about amino acid 163 to about amino acid 169 and potemial N-glycosylation sites from about amino acid 
157 to about amino add 160. about amino acid 306 to about amino acid 309 and about ammo acid 318 to about amino 
25 acid 321 . Clone UNQ372 (DNA4 8296- 1292) has been deposited with ATCC on March 1 1 . 1998 and is assigned 
ATCC deposit no. 209668. 

Analysis of the amino acid sequence of Ac full-length PRO708 polypeptide suggests that it possesses 
significam homology to the aryl sulfatase proteins, thereby indicating that PRO708 may be a novel aryl sulfatase 
homolog. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced significant 
30 homology between the PRO708 amino acid sequence and the following Dayhoff sequences. ARSB_HUMAN. 
CELC54D2_2. G02857. STS_HUMAN. 137186. 137187. GEN12648. CELD1014_7. GA6S_HUMAN and 
SPHM_HUMAN. 

FV AMPLE 20 - tirlari"" nf enuA ntme* FjKoding Human PRQ32Q 
35 A consensus sequence was obtained relative to a variety of EST sequences as described in Example 1 above, 

wherein the consensus sequence obtained is herein designated DNA28739. Based on the DNA28739 consensus 
sequence. oBgonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of 
interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PRO320. 
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A pair of PCR primers (forward and reverse) were synthesized: 

forward PCR prima 5-cctcagtggccacatgctcatg-3' (seq id no:120) 

reverse PCR primer 5'-GGCTGCACGTATGGCTATCCATAG-3' (SEQ id N0:121) 

Additionally, a synthetic oligonudeoude hybridization probe was constructed from the consensus DNA28739 
sequence which had the following nucleotide sequence 

S hvbridlTaiion pj-nh^ 

5'-GATAAACTGTCAGTACAGCTGTGAAGACACAGAAGAAGGGCCACAGTGCC-3' (SEQ ID NO.I22) 

In order to screen several Ubraries for a source of a fiill-length clone. DNA from the libraries was screened 
by PCR ampUficanon with the PCR primer pair identified above. A positive libraiy was then used to isolate clones 
encoding the PRO320 gene using the probe oligonucleotide and one of the PCR primers. RNA for construcnon of 
the cDNA libraries was isolated from human fetal lung tissue (LIB25). 

DNA sequencing of die clones isolated as described above gave the ftill-length DNA sequence for PRO320 
(herein designated as UNQ281 (DNA32284-1307)1 (SEQ ID N0:1 18) and the derived protein sequence for PRO320. 

TTie entire nucleotide sequence of UNQ28I (DNA32284-1307) is shown in Figure 44 (SEQ ID N0:1 18) 
Clone UNQ281 (DNA32284-13(J7) comains a single open reading frame with an appai^m translational initiauon site 
at nucleotide positions 135-137 and cndiiig at the stop codon at nucleotide positions 1 149-1 151 (Figure 44). The 
predicted polypeptide precursor is 338 amino acids long (Figure 45). The full-length PRO320 protein shown in 
Figure 45 has an estimated molecular weight of about 37.143 dalions and a pi of about 8.92. Important regions of 
the PRO320 amino acid sequence include the signal peptide, corresponding to amino acids 1-21. an EGF-like domain 
cysteine panem signamre. corresponding to amino acids 80-91. and three calcium-binding EGF-like domains, 
corresponding to amino acids 103-124. 230-151 and 185-206. respectively. Clone UNQ281 (DNA32284-1307) has 
been deposited with ATCC and is assigned ATCC deposit no. 209670. 



EXAMPt.R 21 : Isolation of cDNA rinnrs Encodinp H..man Pp m^^ 

A consensus sequence was obtained relative to a variety of EST sequences as described in Example 1 above, 

25 wherein the consensus sequence obtained is herein designated DNA34347. Based on the DNA34347 consensus 
sequence. oUgonucleotides were synthesized: 1) to identify by PCR a cDNA Ubraiy diat contained the sequence of 
interest, and 2) for use as probes to isolate a clone of the fiill-length coding sequence for PR0324. 

PCR primers (forward and reverse) were syndiesized: 
foryarrt PCR prillKr 1 5'-GCAATGAACTGGGAGCTGC-3" (SEQ id N0:125) 

30 foryard PCR primrr ? S'-ctgtgaatagcatcctggg-3' (SEQIdN0:I26) 
forward PCR ynmrl s'-cmrcAAGCCACTOGAGGG-s ■ (seq id no: 127) 
Tcvcryg PCR primrr 1 5"-ctgtagacatccaagctggtatcc-3' (seq id no:128) 
reverse PCR primrr ? 5 -aagagtctgcatccacaccactc-3' (Seq id no:129) 

Additionally, a synthetic oligonucleoUde hybridization probe was constructed from the consensus DNA34347 
35 sentence which had the following nucleotide sequence 
hvbridigaticm ppti^ 

S -ACCTGACGCTACTATGGGCCGAGTGGCAGGGACGACGCCCAGAATC-S' (SEQ id NO:130) 

In order lo screen several libraries for a source of a fiill-lengdi clone. DNA from the Ubraries was screened 
by PCR aniplificaiion widi one of the PCR primer pairs identified above. A positive library was dien used to isolate 
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clones encoding the PR0324 gene using the probe oligonucleotide and one of the PCR primers. RNA for 
construction of the cDNA libraries was isolated from hunun feul liver tissue (L1B6). 

DNA sequencing of the clones isolated as described above gave the fuU-length DNA sequence for PR0324 
(herein designated as UNQ285 (DNA36343-1310)1 (SEQ ID NO: 123) and the derived protcm sequence for PR0324. 
The entire nucleotide sequence of UNQ285 (DNA36343.1310) is shown in Figure 46 (SEQ ID NO:123). 
5 Clone UNQ285 (DNA36343-1310) comams a single open reading frame with an apparent iransUtional initiation site 
at nucleotide positions 144-146 and ending at the stop codon at nucleotide positions 101 1-1013 (Figure 46). The 
predicted polypeptide precursor is 289 amino acids loiig (Figure 47). Tlie full-lengdi PR0324 protein shown in 
Figure 47 has an estimated molecular weight of about 32.268 daltons and a pi of about 9.2 1 . Analysis of the PR0324 
polypepdde sequence shown in Figure 47 (SEQ ID NO: 124) evidence the presence of the foUowmg: a signal peptide 
10 from about amino acid 1 to about amino acid 3 1 . a transmembrane domain from about amino acid 136 to about amino 
acid 157. tyrosine kinase phosphorylation sites from about amino acid 106 or about amino acid 107 to about amino 
acid 1 13 and regions that are homologous to shon-chain alcohol dehydrogenase regions from about amino acid 80 
tt, about amino acid 90. from about amino acid 131 to about amino acid 168. from about amino acid I to about amino 
acid 13 and from about amino acid 176 to about amino acid 185. Clone UNQ285 (DNA36343-1310) has been 
15 deposited widi ATCC on March 30. 1998 and is assigned ATCC deposit no. 209718. 

Analysis of the *mino acid sequence of the full-length PR0324 polypeptide suggests that it possesses 
significant sequence similarity to oxidoreducuses. thereby indicating that PR0324 may be a novel oxidoreductase 
homolog. More speciiicaUy. an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced significant 
homology between the PR0324 amino acid sequence and the following Dayhoff sequences. B61209. A69965. 
20 YQJQ_BACSU. D69930. S76124. FABG.ECOLI. C70023. S77280. FABG.VIBHA and MTV013_6. 

FXAMPLE 22 : l<nlatinn of c D NA Tlones Encoding Human PRQ351 

A consensus sequence was obtained relative to a variety of EST sequences as described in Example 1 above, 
v^herein the consensus sequence obtained is herein designated DNA35950. Based on the DNA35950 consensus 
25 sequence, oligonucleotides were symhesized: 1) to identify by PCR a cDNA library that contained the sequence of 
interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR0351 . 

Forward and reverse PCR prtmen were synthesized: 
fr ry ^ rHPTBnrimer 5'-CCTGTGCTGTGCCTCGAGCCTGAC-3' (SEQ ID NO: 133) 

.PTRnrimer S'-GTGGGCAGCAGTTAGCACCGCCTC-S' (SEQ ID NO: 134) 

30 Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA35950 
sequence which had the following nucleotide sequence 
hY^yr^'^"""" probe 

5'-GGCTGGCATCATCAGCnTGCATCAAGCTGTGCCCAGGAGGACGC.3- 
(SEQ ID NO:135) 

35 In order to screen several Ubraries for a source of a fuU-ler«ih clone. DNA from tiie libraries was screened 

by PCR anvliffcadon withone of the PCR primer pairs identified above. A positive library was ti«n used to isolate 
clones encoding the PR0351 gene using the probe oligonucleotide and one of the PCR pnmcrs. RNA for 
construction of the cDNA libraries was isolated from human fetal liver tissue (1JB230). 
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DNA sequencing of the clones isolated as described above gave the full-lengdi DNA sequence for PR035I 
(herein designated as UNQ308 (DNA40571-1315>] (SEQ ID N0:131) and the derived protein sequence for PR0351. 

TTic entire nucleotide sequence of UNQ308 (DNA40571-1315) is shown in Figure 48 (SEQ ID N0131) 
Clone UNQ308 (DNA40571-1315) contains .wo open reading frames with an apparent translattonal initiation site « 
nucleotide positions 189-191 and a second open reading frame beginning at nucleotide 470. with the two open reading 
frames ending at the stop codons at nucleoude positions 363-365 and 2009-2011. respectively (Figure 48). The 
pmiicied polypepdde precursor is 571 amino acids long (Figure 49). Imponant regions of the amino acid sequence 
of PR0351 include the signal peptide, regions having sequence similarity to serine proteases of the trypsin family, 
two N-glycosylation sites, and three Kringle domains. Clone UNQ308 (DNA40571.13I5) has been deposited with 
ATCC and is assigned ATCC deposit no. 209784. 

EXAMPLE 23 : Isolation of cDNA rinn,-. Fn f««ii"y Hnm^^ 

A consensus sequence was obtained relative to a variety of EST sequences as described in Example 1 above, 
wherein the consensus sequence obtained is herein designated DNA36950. Based on the DNA36950 consensus' 
sequence, oligonucleotides were synthesized: I) to identify by PCR a cDNA library that contained the sequence of 
interest, and 2) for use as probes to isolate a clone of the fiill-lengih coding sequence for PR0352. 

PCR primers (forward and reverse) were synthesized: 
forward PCR vnmr 1 5'-CTGGCACAGCTCAACCTCATCroG-3' (SEQ ID NO:138) 

fomardPCR mm 2 5'-octgtctgtctgtctcattg-3' (seq id no: i39) 
forward PCR nrimrr 3 5'-ggacacagtatactgaccac-3' (seq id no:140) 

rCYCrsC PCR primer I 5 -TGCGAACCAGGCAGCTGTAAGTGC-3' (SEQ id N0:I41) 

rgvcrsf PCR prim«-r -> 5 -TGGAAGAAGAGGGTGGTGATGTCG-3' (SEQ ID NO:142) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA36950 

sequence which had the following nucleotide sequence 

hybridization pri^e 

5'-CAGCTGACAGACACCAAACAGCTGGTGCACAGTTTCACCGAAGGC-3' (SEQ ID NO: 143) 

In Older to screen several Ubraries for a source of a fiill-length clone. DNA from the libraries was screened 
by PCR amplification with the PCR primer pair idemified above. A positive library was then used ,o isolate clones 
encoding the PR0352 gene using the probe oligonucleotide and one of the PCR primers. RNA for constniction of 
the cDNA libraries was isolated from human fetal kidney tissue (LIK27). 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PR0332 
therein designated as UNQ309(DNA41386-1316)J (SEQ ID N0:136) and the derived protein sequeiKe for PR0352. 

The entire nucleotide sequence of UNQ309 (DNA41386.1316) is shown in Figure 50 (SEQ ID N0136) 
Clone UNQ309 (DNA4138(^1316) contains a single open reading frame with an apparem translational iaidation site 
at nucleotide positions 152-154 and ending at the stop codon at nucleotide positions 1100-1102 (Figure 50). The 
predicted polypeptide precursor is 316 amino acids long (Figure 51). The fiill-length PR0352 protein shown in 
Rgure 2 has an estimated pi of about 4.62. Analysis of the full-length PR0352 sequence evidences the presence of 
a signal peptide from about amino acid 1 to about amino acid 28. a transmembrane domain from about amino acid 
251 to about amino acid 270. potemial N-glycosylation sites from about amino acid 91 to about amino acid 94. about 
amino acid 104 to about ammo acid 107, about amino acid 189 to about amino acid 192 and about amino acid 215 
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to aboui amino acid 218 and a region having homology to immunoglobulins and MHC from about amino acid 217 
to about amino acid 234. Clone UNQ309 {DNA41386-1316) has been deposited wjih ATCC on March 26, 1998 and 
is assigned ATCC deposit no. 209703. 

Analysis of the amino acid sequence of ihe full-iengih PR0352 polypeptide suggests that it possesses 
significam sequence similarity to the buiyrophilin protein, thereby indicating that PR0352 ts a novel butyrophilin 
5 homolog. More specifically, an analysis of the Dayhoff database (venion 35.45 SwissProt 35) evidenced significant 
homology between the PR0352 amino acid sequence and the following Dayhoff sequences. BUTY_HUMAN. 
HSB73 I. GGCD80_1. 146690. A33_HUMAN. P_R67988. CD86_MOUSE, P_R71360. B39371 and D50558_l. 

FYAMPLE 24 - Mmr\ fff l-^NA HnnM Fjicodine Human PR0381 

10 A consensus sequence was obtained relative to a variety of EST sequences as described in Example I above, 

wherein the consensus sequence obtained is herein designated DNA39651. Based on the DNA39651 consensus 
sequence, oligonucleotides were synthesized: 1) lo identify by PCR a cDNA library that contained the sequence of 
interest, and 2) for use as probes lo isolate a clone of the full-length coding sequence for PR0381 . 
A pair of PCR primers (forward and reverse) were synthesized: 

15 f»ru,»rH PCR primer 5 ■CTTTCCTTGCTTCAGCAACATGAGGC-3- (SEQ ID NO: 146) 
r.v.r.P PCR nrimer 5'-GCCCAGAGCAGGAGGAATGATGAGC-3' (SEQ ID NO: 147) 
Additionally, a synUietic oligonucleotide hybridization probe was constructed from the consensus DNA39651 
sequence which had the following nucleotide sequence 
hYhridiration probe 

20 5 -GTGGAACGCGGTCTTGACTCTGTrCGTCACTTCTTrGATTGGGGCTTTG-3' (SEQ ID NO:148) 

In order to screen several libraries for a source of a full-length clone. DNA from the libraries was screened 
by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones 
encoding the PR0381 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of 
the cDNA libraries was isolated from human fetal kidney tissue (LIB227). 
25 DNA sequencing of the clones isolated as described above gave the ftill-length DNA sequence for PR0381 

[hciein designated as UNQ322 (DNA44194-1317)1 (SEQ ID NO:144) and the derived protein sequence for PR038I . 

The entire nucleotide sequence of UNQ322 (DNA44I94-1317) is shown in Figure 52 (SEQ ID NO:144). 
aone UNQ322 (DNA44194-1317) contains a single open reading frame widi an apparent translaaonal initiation site 
at nucleotide positions 174-176 and ending at the stop codon at nucleotide positions 807-809 (Figure 52). The 
30 predicted polypeptide precursor is 21 1 amino acids long (Figure 53). TTte full-length PR0381 protein shown in 
Figure 53 has an estimated molecular weight of about 24.172 daltons and a pi of about 5.99. Analysis of the full- 
lei«ih PR0381 polypeptide shown in Figure 53 (SEQ ID NO: 145) evidences ihe presence of the following: a signal 
peptide from about amino acid 1 to about amino acid 20, a potential N-glycosylation site from about amino acid 176 
to about amino acid 179, potential casein kinase 11 phosphorylation sites from about ammo acid 143 to about amino 
35 acid 146. from about amino acid 156 to about amino acid 159. from about amino acid 178 to about ammo acid 181. 
and from about amino acid 200 to about amino acid 203, an endoplasmic reticulum targeimg sequence from about 
amino acid 208 to about amino acid 21 1 , FKBP-type peptidyl-prolyl cis-trans isomerase sites from about amino acid 
•fS to about amino acid 114 and from about amino acid 118 to about amino acid 131. EF-hand calcium binding 
xxa from about amino acid 191 to about amino acid 203. from about amino acid 184 to about amino acid 203 
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and from about amino acid 140 .0 about amino acid 159. and an S-lOO/ICaBP type calcium binding domain from 
about amino acid 183 to about ammo acid 203 Clone UNQ322 (DNA44 194-13 17) has been deposited with ATCC 
on April 28. 1998 and is assigned ATCC deposit no. 209808. 

Analysis of the amino acid sequence of the full-length PR038I polypeptide suggests that it possesses 
significant sequence similaniy to FKBP immunophilin proteins, thereby indicating d>ai PR0381 may be a novel FKBP 
immunophilin homolog. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) 
evidenced significant homology between the PR0381 amino acid sequence and the following Dayhoff sequences. 
AF040252_1. 149669, P_R93551. S71238. CELC05C8.1. CEU27353J. MIP TRYCR. CEZC455 3 
FKB4 HUMAN and 140718. 
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10 EXAMPLE 25: Isolation of cDNA Clnne<; Encodinp Human PROIKf i 

A consensus sequence was obtained relative to a variety of EST sequences as described in Example I above, 
wherein the consensus sequence obtained is herein designated DNA40674 Two proprietary Genentech EST 
sequences were employed in the consensus sequence assembly, wherein those EST sequences are herein designated 
DNA23350 (Figure 56; SEQ ID NO: 151) and DNA23536 (Figure 57; SEQ ID NO: 152). Based on the DNA40674 
consensus sequence, oligonucleotides were syndicsized: I) to identify by PCR a cDNA library that contained the 
sequence of interest, and 2) for use as probes to isolate a clone of the ftiU-length coding sequence for PR0386. 

A pair of PCR primers (forward and reverse) were synthesized: 
forward PCR nnniPr 5'-ACGGAGCATGGAGGTCCACAGTAC.3" (SEQ id N0:153) 
rgvgfw PCR Primer 5'-GCACGTTTCTCAGCATCACCGAC-3' (SEQ ID NO: 154) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA40674 
sequence which had the following nucleotide sequence 
hybridization prnj^ 

5"-CGCCTGCCCTGCACCTrcAACTCCTGCTACACAGTGAACCACAAACAG1T-3' (SEQ ID NO:155) 

In order w screen several Ubraries for a source of a Ml-lengdi clone. DNA from the libraries was screened 
by PCR amplificaiion with the PCR primer pair identified above. A positive library was then used to isolate clones 
encodmg the PR0386 gene using the probe oligonucleotide and one of the PCR primers. RNA for constniction of 
the cDNA libraries was isolated from human fetal brain tissue (IJB153). 

DNA sequencing of the clones isolated as described above gave die full-length DNA sequence for PR0386 
therein designated as UNQ326 (DNA45415-1318)] (SEQ ID NO: 149) and the derived protein sequence for PR0386. 

The entire nucleotide sequence of UNQ326 (DNA45415-I3I8) is shown in Figure 54 (SEQ ID NO:I49). 
Gone UNQ326 (DNA45415-1318) contains a single open reading frame with an apparent translational initiation site 
at nucleotide positions 146-148 and ending at die stop codon at nucleotide positions 791-793 (Figure 54). The 
predicted polypeptide precursor is 215 amino acids long (Figure 55). The ftill-length PR0386 protein shown in 
Figure 55 has an estimated molecular weight of about 24.326 daltons and a pi of about 6.32. Analysis of die full- 
length PR0386 sequence shown in Figure 55 (SEQ ID NO:150) evidences the presence of die following: a signal 
peptide from about amino acid I to about amino acid 20. a transmembrane domain from about amim> acid 161 to 
about amino add 179. an imnamoglobulin-lite fold from abom amino acid 83 to about amino acid 127 and potentia] 
N-glycosylation shes from about ammo acid 42 to about amino acid 45. from abom amino add 66 to about amino 
add 69 and from abom amino add 74 to about amino add 77. Clone UNQ326 (DNA45415-131 8) has been deposited 
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with ATCC on April 28. 1998 and is assigned ATCC deposit no. 209810. 

Analysis of the amino acid sequence of the full-length PR0386 polypeptide suggests that it possesses 
significant sequence similarity to the sodium channel bcta-2 subunit. thereby indicating that PR0386 is a novel 
homolog thereof. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced 
significant homology between the PR0386 amino acid sequence and the foUowing Dayhoff sequences. A57843. 
MYPO.HUMAN. GENI4531. JC4024. HS46KDA_1. HSU90716_I. D86996_2. MUSIGLVDJ. DMU42768_1 
andS19247. 
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F ^AMPLE 26 : Vtnlation of cP ^^ Tlnnes RnroHing Human PRQ54Q 

A consensus sequence was obtained relative to a variety of EST sequences as described in Example 1 above, 
wherein the consensus sequence obtained b herein designated DNA39631. Based on the DNA39631 consensus 
sequence. oJigonudeotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of 
interest, and 2) for use as probes to isolate a clone of the M-length coding sequence for PRO540. 

Forward and reverse PCR primers were synthesized; 
f ,^ p.»rri PCR nrimer 5'-CTGGGGCTACACACGGGGTGAGG-3' (SEQ ID NO:158) 
»..r.. PCR rrimei 5 -GGTGCCGCTGCAGAAAGTAGAGCG-3' (SEQ ID NO:159) 
Addiiionally. a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA40654 
sequence which had the following nucleotide sequence 

5'-GCCCCAAATGAAAACGGGCCCTACTrcCTGGCCCTCCGCGAGATG-3' 
(SEQIDNO:160) 

In order 10 screen several Ubraries for a souree of a full-length clone. DNA from the libraries was screened 
by PCR amplification with one of the PCR primer pairs identified above. A positive library was then used to isolate 
clones encoding the PRO540 gene using the probe oligonucleotide and one of the PCR primers. RNA for 
construction of the cDNA libraries was isolated from human feul kidney tissue (LIB227). 

DNA sequencing of the clones isolated as described above gave the fiiU-length DNA sequence for PRO540 
[herein designated as UN(3341 (DNA4418!»-1322)J (SEQ ID N0:156) and the derived protein sequence for PRO540. 

The emire micleoiide sequence of UNQ341 (DNA44189-I322) is shown in Figure 58 (SEQ ID NO:156). 
Clone UNQ341 (DNA44I89-1322) contains a single open reading frame with an apparent translational initiation site 
at nucleotide positions 21-23 and ending at the stop codon at nucleotide positions 1257-1259 (Figure 58). TTie 
predicted polypeptide precursor is 412 amino acids long (Figure 59). THe foil-length PRO540 protein shown in 
Figure 59 has an estimated molecular weight of about 46.658 daltons and a pi of about 6.65. Important regions of 
the amino acid sequence of PRO540 include the signal peptide, potential N-glycosylation sites, a potential lipid 
«dstrate binding site, a sequence typical of Upases and serine proteins, and a beta-transducin family Trp-Asp repeat. 
Clone UNQ341 (DNA44189-1322) has been deposited with ATCC and is assigned ATCC deposit no. 209699. 

EXAMPLE 27 - I'lrlntinn irf rPNA (^'""''^ Fnrndiiig Human PRQ615 

Aconsensus sequence was obtainedrelanvetoavarietyofEST sequences as described in Example 1 above. 

wherein the consensus sequence obtained is herein designated DNA42240. Based on the DNA42240 consensus 
sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of 
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interest, and 2) for use as probes to isolate a dotie of the full-length coding sequence for PR0615. 
Forward and reverse PCR primers were synthesized: 

forvart PCR nrim g r 5-tggtcttcgccttgatcgtgttct-3' (seq id no:163) 
fftpyard PCR primfr 5'-gtgtactgagcggcggttag-3' (seq id no:164) 

rcvcw PCR ffrimcr 5'-CTGAAGGTGATGGCTGCCCTCAC-3' (SEQ id N0:165) 
5 rtvtrse PCR prinm 5*-CCAGGAGGCTCATGGGAAAGTCC-3' (SEQ id NO: 166) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA42240 
sequence which had the following nucleotide sequence: 
hybridization p^hf 

5-CCACGAGTCTAAGCAGATGTACTGCGTGTTCAACCGCAACCAGGATGCCT-3- 
10 (SEQ id NO: 1 67) 

In order to screen several libraries for a source of a full-length clone. DNA from the libraries was screened 
by PCR ampUficarion with one of *e PCR primer pairs identified above. A positive library was then used to isolate 
clones encoding me PR06I5 gene using the probe oligonucleotide and one of the PCR pruners. RNA for 
construction of the cDNA libraries was isolated from human bone marrow tissue (LIB255). 

DNA sequencing of d« clones isolated as described above gave die full-length DNA sequence for PR0615 
Ihereindesignated as UNQ352 (DNA4830*-1323)] (SEQ ID N0:16I) and the derived protein sequence for PR06I5. 

TTie entire nucleotide sequence of UNQ352 (DNA48304-I323) is shown in Figure 60 (SEQ ID NO- 161) 
Clone UNQ352 (DNA48304.I323) conaiins a single open reading fhune with an apparent translational initiaUoa site 
at nucleotide positions 51-53 and endii« at the stop codon at nucleotide positions 723-725 (Figure 60). TTte predicted 
polypeptide precursor is 224 amino adds long (Figure 61). TTte fiill-length PR0615 protein shown in Figure 61 has 
anestimatedmolecularweightofabout24.810dal,onsandapl of about 4.75. Important regions of the amino acid 
sequence of PR0615 include a type II transmembrane domain, corresponding to about amino acids 24^3, other 
transmembrane domains, corresponding to about amino acids 74-90. 108-126. and 145-161. respectively, and a 
potential N-glycosylation site, corresponding to about amino acids 97-100. Clone UNQ352 (DNA48304-I323) has 
25 been deposited witfi ATCC and is assigned ATCC deposit no. 20981 1 

EXAMPLE 78 : isnlarinn rpfj^ i;' ^ ^ . ^ Fncodin» Hnman yf {r ^\f i 

A consensus sequence was obtained relative to a variety of EST sequences as described in Example 1 above, 
wherein the consensus sequence obtained is herein designated DNA30900. Based on the DNA30900 consensu^ 
sequence, oligonucleotides were synthesized: I) to identify by PCR a cDNA library that conuuned the sequence of 
interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR0618. 
Forward and reverse PCR primers were synthesized: 

fopvard PCR primftr 5'-taacagctgcccactgcttccagg-3' (seq id no:I7i) 
reverse PCR primr 5 -taatccagcagtgcaggccggg-3' (Seq id no:172) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from die consensus DNA30900 
sequence which had the following nucleotide sequence 
hvbridizarinn prgk 

5--atggcctccacggtgctgtggaccgtgttcctgggcaaggtgtggcagaa-3' 

(SEOIDNO:173) 
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Screening of ihe above described library gave rise to the panial cDNA clone designated herein DNA35597 
(SEQ ID NO: 170). Extension of this sequence using repeated cycles of BLAST and phrap gave rise to a nucleoiide 
sequence designated herein as DNA43335. Primers based upon the DNA43335 consensus sequence were then 
prepared as follows. 

f^ pvprd PCR primer 5^TGCCTATGCACTGAGGAGGCAGAAG>3' (SEQ ID NO: 174) 
5 fpvf «P PTR nrimer 5'-AGGCAGGGACACAGAGTCCA1TCAC-3' (SEQ ID NO:175) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA43335 
sequence which had the following nucleotide sequence 
^^vl^ridiTaf •inn nrobe 

5'-AGTATGATTTGCCGTGCACCCAGGGCCAGTGGACGATCCAGAACAGGAGG-3' 
10 (SEQ ID NO: 176) 

In order to screen several libraries for a source of a full-length clone. DNA from the libraries was screened 
by PCR amplification with one of the PGR primer pairs identified above. A positive library was then used to isolate 
full length clones encoding the PR0618 gene using the second probe oligonucleotide and one of the second set of PCR 
primers. RNA for construction of the cDNA libraries was isolated from human fetal liver tissue (LIB229). 

15 DNA sequencing of the clones isolated as described above gave the i\iU-iengih DNA sequence for PR061 8 

Pserein designated as UNQ354 (DNA49152-1324)1 (SEQ ID NO: 168) and the derived protein sequence for PR0618. 

The entire nucleotide sequence of UNQ354 (DNA49152-1324) is shown in Figure 62 (SEQ ID NO: 168). 
Ckme UNQ354 (DNA49I52-1324) contains a single open reading frame widi an apparent translational initiation site 
at nucleotide positions 73-75 and ending at the stop codon at nucleotide positions 2479^2481 (Figure 62). The 

20 predicted polypeptide precursor is 802 amino acids long (Figure 63). The ftill-length PR0618 protein shown in 
Figure 63 has an estimated molecular weight of about 88.846 daltons and a pi of about 6.41 . Important regions of 
the amino acid sequence of PR0618 include type II transmembrane domain, a sequence typical of a protease, trypsin 
family, hisddine active site, multiple N-glycosylation sites, two sequences typical of a Kringlc domain, two regions 
having sequence sinnlariiy to Kallikrein light chain, and a region having sequence similarity to low-density lipoprotein 

25 receptor. Clone UNQ354 (DNA49 152-1324) has been deposited with ATCC and is assigned ATCC deposit no. 
209813. 

FX AMPLE 29 ' Mmw (if ^P^A Clones Encoding Human PR0719 

A consensus sequence was obtained relative to a variety of EST sequences as described in Example 1 above, 
30 wherein the consensus sequence obtained is herein designated DNA44851. Based on the DNA44851 consensus 

sequence, oligonucleoudes were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of 

inierest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR0719. 
A pair of PCR primers (forward and reverse) were synthesized: 

fnnvard PPR nrimer 5*-GTGAGCATGAGCGAGCCGTCCAC-3' (SEQ ID NO: 179) 
35 r^vi^r^e PrR Primer 5'-GCTATTACAACGGTTCTTGCGGCAGC-3' (SEQ ID NO: 180) 

AdditionaUy, a synthetic oligonucleotide hybridization probe was constructed from die consensus DNA4485I 

sequence which bad the following nucleodde sequence 

hYfrrift^^n"'^ p^»^ 

5'-TrGACTCrrCTGGTGAATCAGGACAAGCCGAGTnTGCCTTCCAG-3' (SEQ ID NO: 181) 
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In order to screen several libraries for a source of a full-length clone. DNA from the libraries was screened 
by PCR ainplificanon with the PCR primer pair idemified above. A positive library was then used to isolate clones 
encoding die PR0719 gene using d>e probe oligonucleotide and one of die PCR pnmcrs. RNA for construction of 
die cDNA libraries was isolated from human placenta tissue (LIB90). 

DNA sequencing of the clones isolated as described above gave die fiilHengdi DNA sequence for PR0719 
(herein designated as UNQ387 (DNA49646.1327)1 (SEQ ID NO:177) and die derived protein sequence for PR0719. 

The entire nucleotide sequence of UNQ387 (DNA49646-1327) is shown in Figure 65 (SEQ ID NO: 177). 
Clone UNQ387 a)NA4964^l327) contains a smgle open reading frame widi an apparent translational initiaUon site 
at nucleotide positions 223-225 and ending ai die stop codon at nucleotide positions 1283.1287 (Figure 65). TTie 
predicted polypeptide precursor is 354 amino acids long {Figure 66). The ftiU-lcngdi PR0719 protein shown in 
Figure 66 has an estimated molecular weight of about 39,362 daltons and a pi of about 8.35, Analysis of die Aill 
lengdi PR0719 sequence evidences die presence of a signal peptide from about amino acid 1 to about amino acid 16 
as shown in Figure 66 (SEQ ID NO: 178). a lipase-associaied serine-containing active sue at about amino acid 163 
to about amino acid 172. and two potential N-glycosylation sites from about amino acid 80 to about amino acid 83 
and about amino acid 136 to about ammo acid 139. Clone UNQ387 fDNA49646-1327) has been deposited widi 
15 ATCC on March 26. 1998 and is assigned ATCC deposit no. 209705. 

Analysis of die amino acid sequence of die fiill-lengdi PR0719 polypeptide suggests diat it possesses 
significant sequence similarity to die lipoprotein lipase H protein, diercby indicating dut PR0719 may be a novel 
lipoprotein lipase homolog. More specificaUy. an analysis of die Dayhoff database (version 35.45 SwissProt 35) 
evidenced significant homology between die PR0719 amino acid sequence and dw following Dayhoff sequences. 
LIPL_HUMAN. UPH_HUMAN. D83548_l. A24059_l. P_R30740. D88666_l. A43357. A46696. B43357 and 
A49488. 
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EXAMPLE 30 : Isolaiion nf rnfiA Q ^ jfts RnnoHin;, Hn m an PRn7?4 

A consensus sequence was obtained relative to a variety of EST sequences as described in Example 1 above, 
wherein the consensus sequence obtained is herein designated DNA35603. Based on die DNA35603 consensus 
sequence, oligonucleotides were syndiesized: 1) to identify by PCR a cDNA Ubraiy diat contained die sequence of 
interest, and 2) for use as probes to isolate a clone of die full-lengdi coding sequence for PR0724. 
Pairs of PCR primers (forward and reverse) were syndiesized: 

forward PCR primer. 1 5-gcctgtcactgtggagacac-3' (SEQ id no:184) 
forward pcr primer ?. 5'-gcaaggtcattacagctc-3' (SEQ id NO: 185) 

reverse PCR primer 1 5--AGAACATAGGAGCAGTCCCACTC-3' (SEQ ID NO:186) 
tcvcrse PCR mmt ? 5*-TGCCTGCTGCTGCACAATCTCAG-3' (SEQ id N0:187) 

Additionally, a syndieuc oligonucleotide hybridization probe was constructed from die consensus DNA35603 
sequence which had die following nucleotide sequence 
35 hvbridiratinn pmlv 

5'-GGCTATTGCTrGCCTrGGGACAGACCCTGTGGCTrAGGCTCTGGC-3' (SEQ ID NO:188) 

In order to screen several Ubiaries for a source of a fiill-lengdi clone. DNA from die libraries was screened 
by PCR amplification widi die PCR primer pairs identified above. A positive library was dwn used to isolate clones 
encoding die PR0724 gene using die probe oligonucleotide and one of die PCR primers. RNA for constniction of 
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the cDNA libraries was isolated from human fetal lung ttssue (LIB26). 

DNA secjuencing of the clones isolated as described above gave the fuU-lengih DN A setjuencc for PR0724 
(herein designated as UNQ389 (DNA4963 1-1328)) (SEQ ID NO:182) and the derived protein sequence for PR0724 
The entire nucleotide sequence of UNQ389 (DNA49631-1328) ts shown in Figure 67 (SEQ ID NO: 182) 
aoat UNQ389 (DNA49631-1328) contains a single open reading frame with an apparent translational initiation sue 
5 at nucleotide positions 546-548 and ending at the stop codon at nucleotide posiuons 2685-2687 (Figure 67). The 
predicted polypeptide precursor is 713 amino acids long (Figure 68). The ftill-length PR0724 protein shown m 
Figure 68 has an estimated molecular weight of about 76.193 daltons and a pi of about 5.42. Analysis of the ftjll- 
length PRa724 amino acid sequence shown in Figure 68 (SEQ ID NO:183) evidences the presence of the following: 
a signal peptide from about amino acid 1 to about ammo acid 16. a transmembrane domain from about ammo acid 
10 442 to about amino acid 462 and LDL receptor class A domain regions from about amino acid 152 to about amino 
acid 171. about amino acid 331 to about amino acid 350. about amino acid 374 to about amino acid 393 and about 
amino acid 411 to about amino acid 430. Clone UNQ389 (DNA49631-1328) has been deposited with ATCC on April 
28, 1998 and is assigned ATCC deposit no. 209806 

Analysis of the amino acid sequence of the full-length PR0724 polypeptide suggests that it possesses 
15 significant sequence similanty to the human LDL receptor proiein. thereby indicating that PR0724 may be a novel 
LDL receptor homolog. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) 
evidenced significant homology between the PR0724 amino acid sequence and the following Dayhoff sequences. 
P_R48547. MMAM2R_l. LRP2.RAT. P.R60517. P.R47861. P_R05533. A44513J. A30363, P_R74692 and 
LMLIP0PH0_1. 
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RXAMPLE 31 : Kolatinn of cP NA Tlnnes Rncodine Hunwn PRQ772 

One cDNA sequence was isolated in the amylase screen described in Example 2. wherein that cDNA 
sequence is herein designated DNA43509 (see Figure 71). Based on the DN A43509 sequence, oligonucleotide ptobcs 
were generated and used to screen a human fetal lung library (LIB25) prepared as described in paragraph 1 of 
25 Example 2 above. Hie cloning vector was pRK5B (pRKSB is a precursor of pRK5D thai docs not contain the Sfil 
site: see. Holmes et al.. SocnSS. 252:1278-1280 (1991)). and the cDNA size cut was less than 2800 bp. 

A pair of PGR primers (forward and reverse) were synthesized based on the DNA43509 sequence. 
fnr^rriPTR Primer 5'-CGTnTGCAGAACCTACTCAGGCAG-3- (SEQ ID NO:192) 
r.v.«.PrR Primer 5 -CCTCCACCAACTGTCAATGTTGTGG-3' (SEQ ID NO:193) 
30 Additionally, a syntheuc oligonucleotide hybridization probe was constructed from the consensus DNA43509 
sequence which had the following nucleotide sequence 

5'-AAAGTGCTGCTGCTGGGTCTGCAGACGCGATGGATAACGT-3' (SEQ ID NO: 194) 

Using the above described primers and Ubrary. a fiiU length clone was identified that contained a single open 
35 reading frame with an apparem translational initiation site at nucleotide positions 131-133 and ending at the stop 
codon found at mwleotide positions 587-589 (Figure 69: SEQ ID NO:189). The predicted polypeptide precursor is 
152 amino acids long, has a calculated molecular weight of approximately 17.170 daltons and an estimated pi of 
approximately 9.62. Analysis of the full-length PR0772 sequence shown in Figure 70 (SEQ ID NO: 190) evidences 
the presence of the following: a potential type 11 transmembrane domain from about amino acid 26 to about ammo 
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add 42. other po^ntial rransmembrane domains from about amino acid 44 to about amino acd 65. from about amino 
acid 81 to about amino acid 101 and from about ammo acid 109 to about amino acid 129. leucme zipper pattern 
sequences from about amino acid 78 to about amino acid 99 and from about amino acid 85 to about amino acid 106 
Clone UNQ410 (DNA49645-1347) has been deposited with ATCC on April 28. 1998 and is assigned ATCC deposit 



no. 209809. 



Analysis of the amino acid sequence of the ftiU-Iengdi PR0772 polypeptide suggests that it possesses 
significant sequence similarity to the human A4 protein, thereby indicating that PR0772 may be a novel A4 protein 
homolog. More specificaUy. an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced significant 
homology between the PR0772 amino acid sequence and the following Dayhoff sequences. HSU93305 1 
A4P_HUMAN. CELB0454.2. VPU.JSRV. CELCI2D12.2. OCCM.AGRTl. LBPHIGIE 50. YIGK ECOLl' 



10 S76245 and P R50807. 
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EXAMPLE 32 : Isolation of rT^iy^ r| ^ n r T Friending H,.»,fn pp rif^o 

A consensus sequence was obtained relative lo a variety of EST sequences as described in Example 1 above 
wherein the consensus sequence obtained is herein designated DNA34364. Based on the DNA34364 consensus 
sequence, oligonucleotides were synthesized: 1, to .dentily by PCR a cDNA library that contained the sequem:e of 
interest, and 2) for use as probes to isolate a done of the full-length coding sequence for PR0852. 

PCR primers (forward and reverse) were synthesized: 
forward PCR Primer 1 5--CGCAGAAGCTACAGATrCTCG.3* (SEQ ID NO: 197) 
forward PCR Pnmfr ? 5*-GGAAATTGGAGGCCAAAGC-3' (SEQ id NO:198) 

20 forward pcR primrr ^ S'-ggatgtagccagcaactgtg-3- (seqidno:199) 
fpr>vard pcR primer i 5--gccttggctcgitctcttc-3' (seq id no:200) 
fon^ard pCR primer 1 5-ggtcctgtgcctggatgg-3' (seq id no;2oi) 

reverse PCR Primpr 1 5'-GACAAGACTACCTCCGTrGGTC-3' (SEQ id NO:202) 

reverse PCR Primer?. S'-tgatgcacagttcagcacctgttg-3' (seq id no:203) 

Additionally, a synthetic oUgomicleotidc hybridization probe was constructed from the consensus DNA34364 
sequence which had the following nucleotide sequence 
hvbridiMtion prffjiy 

5 -CGCTCCAAGGGC-nTGACGTCACAGTGAAGTACACACAAGCAAGCTG-3' (SEQ ID NO:204) 

In order to screen several libraries for a source of a full-length clone. DNA from the libraries was screened 
by PCR ampUficadon with the PCR primer pair identified above. A positive library was then used to isoUte clones 
encoding die PR0852 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of 
the cDNA libraries was isolated from human fetal kidney tissue (LIB228). 

DNA sequencing of dje clones isolated as described above gave the ftill-length DNA sequence for PR0852 
Iherein designated as UNQ418 (DNA45493-I349)) (SEQ ID NO: 195) and the derived protein sequence for PR0852. 

nie entire micleotide sequence of UN0418 (DNA45493-1349) is shown in Figure 72 (SEQ ID N0195) 
Clone UNQ418 (DNA45493-I349) contains a smgle open reading frame with an apparent translational initiation site 
at nucleotide positions 94-96 and ending at the stop codon at nucleotide positions 16748-1650 (Figure 72). The 
predicted polypeptide precursor is 518 amino acids long (Figure 73). The luIHength PR0852 protein shown in 
Figure 73 has an estimated molecular weight of about 56.180 dahons and a pi of about 5.08. Analysts of the fuU- 
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length PR0852 sequence shown in Figure 73 (SEQ ID NO: 196) evidences the presence of the following: a signal 
peptide from about amino acid 1 to about amino acid 20. a transmembrane domam from about amino acid 466 to 
about amino acid 494, potential N-glycosylauon sites from about amino acid 170 to about amino acid 173 and about 
amino acid 366 to about amino acid 369. leucine zipper sequence pattern blocks from about amino acid 10 to about 
amino acid 31 and from about amino acid 197 to about amino acid 218 and blocks of amino acids having sequence 
5 homology to cukaryotic and viral aspartyl proteases from about amino acid 109 to about amino acid 1 1 8. from about 
amino acid 252 to about amino acid 261 and from about amino acid 298 to about amino acid 310. Clone UNQ418 
(DNA45493-1349) has been deposited with ATCC on April 28. 1998 and is assigned ATCC deposit no. 209805. 

Analysis of the amino acid sequence of the full-length PR0852 polypeptide suggests that it possesses 
signiiicant sequence similarity to various protease proteins, thereby indicating that PR0852 may be a novel protease 
10 protein or homolog thereof. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) 
evidenced significant homology between the PR0852 amino acid sequence and the following Dayhoff sequences. 
PEPC.HUMAN, S66516. S66517. PEPE_CH1CK, CATD_HUMAN. P_R74207. CARP.YEAST. PEP2_RABrr. 
CATE_HUMAN and RENI_MOUSE. 

15 i^YAMPf K 33 - |»"i«""n nf rHNA Honcs Encodiflg Human PR085? 

A consensus sequence was obtained telanve to a variety of EST sequences as described in Eiuunple 1 above, 
wherein the consensus sequence obtained is herein designated DNA43050. Based on the DNA43050 consensus 
seq«nce. oUgonudeotides were synthesized: 1) to identify by PGR a cDNA library that contained the sequence of 
interest, and 2) for use as probes to isolate a clone of the fiill-lenglh coding sequence for PR0853. 

20 Forward and reverse PCR primers were synthesized: 

f n, y.rrtPrR Primer 5'-CTTCATGGCCTTGGACTTGGCCAG-3' (SEQ ID NO:207) 
j...r.. PTR nrimer 5 -ACGCCAGTGGCCTCAAGCTGGTTG-3- (SEQ ID NO:208) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA43050 

sequence which had die following nucleotide sequence 

25 hyliririi/arion iirobe 

5 -CnTCTGAGCTCTGAGCCACGGTTGGACATCCTCATCCACAATGC-3' (SEQ ID NO-.209) 

In order to screen several libraries for a source of a full-length clone. DNA from the libraries was screened 
lor PCR anvlification with one of die PCR primer pairs identified above. A positive library was dien used to isolate 
clones encoding the PR0853 gene using the probe oUgonucleotide and one of the PCR primers. RNA for construction 
30 of the cDNA libraries was isolated from human fetal kidney tissue (LIB228). 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PR08S3 
[herein designated as UNQ419 (DNA48227-1350)) (SEQ ID NO:205) and the derived protein sequence for PR0853. 

TTie eraire nucleotide sequence of UNQ419 (DNA48227-1350) is shown in Figure 74 (SEQ ID NO:205). 
Clone UNQ419 (DNA48227-1350) contains a single open reading frame with an apparem translational initiation site 
35 at nucleotide positions 128-130 and ending at the stop codon at nucleotide positions 1259-1261 (Figure 74). The 
predicted polypeptide precursor is 377 amino acids long (Figure 75). n« fiill-length PR0853 protein shown in 
Figure 75 has an estimated molecular weight of about 40.849 dalions and a pi of about 7.98. Important regions of 
the amino add sequence of PR0853 include the signal peptide, corresponding to amino acids from about 1 to about 
16 of SEQ ff) NO:206, the glycosaminoglycan attachment site, corresponding to amino acids from about 46 to about 
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49 of SEQ ID NO:206. and two sequences typical of the short<hain alcohol dehydrogenase family, corresponding 
.0 atiuno acids from about 37 to about 49 and about 1 14 lo about 124 of SEQ ID NO:206. respecttvelv Clone 
UNQ419 (DNA48227-1350) has been deposited with ATCC and .s assigm^d ATCC deposit no. 209812. ' 

EXAMPLE 34 : Isolaiion of cDNA n^j ^ ^ Fr pd'np Humap P p^^^fip 

A consensus sequence was obtained relative to a variety of EST sequences as described in Example 1 above, 
wherein the consensus sequence obtained is herein designated DNA38137. Based on the DNA38137 consensu 
sequence, oligonucleotides were syndiesized: 1) to identify by PCR a cDNA library that contained the sequence of 
interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PRO860. 

Forward and reverse PCR primers were synthesized: 
forwartl PCR mmrr 5 -GAAGGGACCTACATGTGTGTGGCC-3' (SEQ ID NO:212) 
rtfversg PCR PrinifJ 5'-ACTGACCTTCCAGCTGAGCCACAC-3' (SEQ ID NO:213) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA40654 
sequence which had the following nucleotide sequence 
hybridization nrnhf 

5-AGGACTACACGGAGCCTGTGGAGCTTCTGGCTGTGCGAATrcAGCTGGAA-3' 
(SEQIDNO:214) 

In order to screen several libraries for a source of a fiilHenglh clone. DNA from the libraries was screened 
by PCR amplilicadon with one of die PCR primer pairs identified above. A positive library was then used to isolate 
clones encoding the PRO860 gene using the probe oligonucleotide and one of the PCR primers. RNA fbr 
construction of the cDNA libraries was isolated from human fetal lung tissue (LIB26). 

DNA sequencing of die clones isolated as described above gave the ftill-length DNA sequence for PRO860 
therein designated as UNQ42I (DNA41404-1352)J (SEQ ID NO:210) and the derived protein sequence for PRO860. 

The emire nucleotide sequence of UNQ421 (DNA41404-1352) is shown in Figures 76A-B (SEQ ID 
NO:2I0). Clone UNQ421 (DNA4 1404- 1352) contains a single open reading frame with an apparent translational 
initiation site at nucleotide positions 58-60 and endmg at the stop codon at nucleotide positions 3013-3015 (Figures 
76A-B). The predicted polypeptide precursor is 985 amino acids long (Figure 77). TTic fuU-length PRO860 protein 
shown in Figure 77 has an estimated molecular weight of about 105.336 daltons and a pi of about 6.55. Important 
regions of the amino acid sequence of PRO860 inch.de dtt transmembrane region corresponding to about amino acids 
448wt67. the extracellular domain, coirespondii« to amino acids about 1^7. several N-glycosylaUon sites, numerous 
N-myrisioylation sites and a sequence typical of phosphotyrosine interaction domain proteins.. Clone UNQ421 
(DNA4I404-I352) has been deposited with ATCC and is assigned ATCC deposit no. 209844. 

E XAMPWF, Isolation of cDNA CIotim Pnr '^\ ^ <, Hum^n PR Om^ 

A consensus sequence was obtained relative to a variety of EST sequences as described in Example I above, 
wherein the consensus sequence obtained is herein designated DNA39949. Based on the DNA39949 consensu^ 
sequence. oligonucIeoUdes were symhesized. 1) to identify by PCR a cDNA library that conuined the sequence of 
iniercsi. and 2) for use as probes to isolate a clone of the fiill-lengdi coding sequence for PR0846. 

Forward and reverse PCR primers were synthesized: 
fOTVart PTR nrimrT 5'-CCCTGCAGTGCACCTACAGGGAAG-3' (SEQ id N0:2I7) 
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^^v^ H'' Pf^^ 5'-CTGTCTTCCCCTGCTTGGCTGTGG.3' (SEQ ID N0:218) 

Additionally, a synihetic oligonudeoiide hybridization probe was consir\jcicd from the consensus DNA39949 
sequence which had the following nucleotide sequence 
|]yt^pHi7atinn Dfobe 

5'.GGTGCAGGAAGGGTGGGATCCTCTTCTCTCGCTGCTCTGGCCACATC-3' 
5 (SEQ ID NO:219) 

In order to screen several Ubraries for a source of a fijll-length clone. DNA from the libraries was screened 
by PGR anq)lificaaon with one of the PCR primer pairs identified above. A positive library was then used to isolate 
clones encoding the PR0846 gene using the probe oligonucleotide and one of the PCR primers. RNA for 
construction of the cDNA libraries was isolated from human fetal kidney tissue (LIB227). 
10 DNA sequcnciiig of the clones isolated as described above gave the ftili-lengih DNA sequence for PR0846 

[herein designated as UN0422 (DNA44196-1353)1 (SEQ ID N0:215) and the derived protein sequence for PR0846. 

The entire nucleotide sequence of UNQ422 (DNA441964353) is shown in Figure 78 (SEQ ID N0:215). 
Clone UNQ422 (DNA44 196- 1353) contains a single open reading frame with an apparent translational initiation site 
at nucleotide positions 25-27 and ending at the stop codon at nucleotide positions 1021-1023 (Figure 78). The 
15 predicted polypeptide precursor is 332 amino acids long (Figure 79) The full-length PR0846 protein shown in 
Figure 79 has an estimated molecular weight of about 36,143 daltons and a pi of about 5.89. Iroportani regions of 
the amino acid sequence of PR0846 include the signal pepude. the transmembrane domain, an N-glycosylaiion site, 
a sequence typical of fibrinogen beta and gamma chains C-terminal domain, and a sequence typical of Ig like V-type 
domain as shown in Figure 79. Clone UNQ422 (DNA44196.1353) has been deposited with ATCC and is assigned 
20 ATCC deposit no. 209847. 

KXAMPLE 36 : Isnlation of cQ NA Clones Encoding Human PROg62 

A consensus sequence was obtamed reladve to a variety of EST sequences as described in Example 1 above, 
wherein the consensus sequence obtained is herein designated DNA47370. Based on the DNA47370 consensus 
25 sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of 
inrcrcsi. and 2) for use as probes to isolate a clone of the full-length coding sequence for PR0862. 

Forward and reverse PCR primers were synthesized: 
fonv^rHPTRtirimer 5*GGGATCATGTTGTTGGCCCTGGTC.3' (SEQ ID NO:222) 
..v.r^ PTR Primer S'-GCAAGGCAGACCCAGTCAGCCAG-3- (SEQ ID NO:223) 
30 Additionally, a syntheuc oligonucleotide hybridization probe was constructed from the consensus DNA47370 
sequence which had the following nucleotide sequence 
l^yl^riri^Tation probe 

5'.CrGCCTGCTACCCTCCAAGTGAGGCCAAGCTCTACGGTCGTTGTG-3' 
(SEQ ID NO:225) 

35 In Older to screen several libraries for a source of a fuIHenglh clone. DNA from the libraries was screened 

by PCR aiq>lificarion with one of the PCR primer pairs idcnufied above. A positive library was then used to isolate 
clones encoding the PR0862 gene usmg the probe oligonucleotide and one of the PCR primers. RNA for 
construction of the cDNA libraries was isolated from human pancreas ussue (LIB55). 
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DNA sequencing of the clones isolated as described above gave ihe (uIMength DNA sequence for PR0862 
(herein designated as UNQ424 {DNA52187-1354)1 (SEQ ID NO:220) and the derived protein sequence for PR0862. 

The entire nucleotide sequence of UNQ424 fDNA52I87-l354) is shown in Figure 80 (SEQ ID NO:220) 
Clone UNQ424 fDNA52 187- 1354) contains a single open reading frame with an apparent translational initiation site 
at nucleotide positions 410412 and ending at the stop codon at nucleotide positions 848-850 (Figure 80). TTie 
predicted polypeptide precursor is 146 amino acids long (Figure 81). The full-length PR0862 proteui shown in 
Figure 81 has an estimated molecular weight of about 16.430 dalions and a pi of about 5.05. taiportant regions of 
the amino acid sequence of PR0862 include the signal peptide, an N-myristoylation site, and sequences" having 
similarity to region to Alpha-lactalbumin/lysozyme C proteins as shown in Figure 81. Clone UNQ424 (DNA52I87. 
1354) has been deposited with the ATCC and is assigned ATCC deposit no. 209845. 



EXAMPLE 37 : Isolation nf rn|M/\ r | oncs Fnrf^ i p f > Human PROI^M 

A consensus sequence was obtained relative to a variety of EST sequences as described in Example I above, 
wherein the consensus sequence obtained is herein designated DNA40666. Based on the DNA40666 consensus 
sequence. oUgonucleotides were synthesized: I) to identify by PGR a cDNA library that contained the sequence of 
15 interest, and 2) for use as probes to isolate a clone of die full-lengtii coding sequence for PR0864 
Forward and reverse PCR primers were synthesized. 

fprward PCR primer 5 -gctgcacctgcaaattccactgg-3* (SEQ id no:227) 

reverse PCR primer 5 -TGGTGGGAGACTGTTTAAATTATCGGCC-3' (SEQ ID N0:228) 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA40666 
20 sequence which had die following nucleotide sequence 
hybridization prohe 

5'-TGCTTCGTCAAGTGCCGGCAGTGCCAGCGGCTCGTGGAGTT-3' 
(SEQ ID NO:229) 

to order lo screen several Ubraries for a source of a full-length clone. DNA from the libraries was screened 
by PCR ampUfication with one of die PCR primer pairs identified above. A positive libraiy was then used to isolate 
clones encoding the PR0864 gene using the probe oligonucleonde and one of the PCR primers. RNA for 
construction of the cDNA libraries was isolated from human fetal brain tissue (UB153). 

DNA sequendng of die clones isolated as described above gave the full-length DNA sequence for PR0864 
(herein designated as UNQ426 (DNA48328-1355)] (SEQ ID NO:225) and the derived protein sequence for PR0864. 

The entire nucleotide sequence of UNQ426 (DNA48328-1355) is shown in Figure 82 (SEQ ID NO:225) 
Clone UNQ426 (DNA48328-1355) contains a single open reading frame with an apparent translational iniuation site 
at nucleotide positions 37-39 and ending at the stop codon at nucleotide positions 1090-1092 (Figure 82) The 
predicted polypeptide precursor is 351 amino acids long (Figure 83). The full-lengtii PR0864 protein shown in 
Figure 83 has an estimated molecular weight of about 39.052 and a pi of about 8.97. Important regions of the amino 
35 add sequence of PR0864 include the signal peptide, two N-glycosylation sites, a Wnt-1 family signamre sequence, 
and sequence regions homologous to Wnt-1 family proteins as shown in Figure 83. Clone UNQ426 (DNA48328- 
1355) has been deposited with ATCC and is assigned ATCC deposit no. 209843. 
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FV AMPLE 38 - l r"'p'i" n nf 1 -P NA F.nrndino Human PRQ793 

A consensus sequence was obtained relative to a variety of EST sequences as described in Example I above, 
wherein the consensus sequence obtained is herein designated DNA38106. Based on the DNA38106 consensus 
sequence, oligonucleotides were synthesized: 1) to idenuly by PGR a cDNA library that contained the sequence of 
iraerest. and 2) for use as probes to isolate a clone of the full-length coding sequence for PR0792. 

5 A pair of PGR primers (forward and reverse) were synthesized: 

fpp..rH PTR nrimer S'-GCGAGAACTGTGTCATGATGGTGCS' (SEQ ID NO:232) 

rr vrr.. PPB nrimer S'-GTTTCTGAGACTGAGGAGCGGTGG-B' (SEQ ID NO:233) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA38106 

sequence which had the following nucleotide sequence 

10 t1Y^r''^'^="■»" "fo*"' 

5 -CACCGTGTGACAGCGAGAAGGACGGCTGGATCTGTGAGAAAAGGCACAAC-3' (SEQ ID NO:234) 

In order to screen several Ubraries for a source of a full-length clone. DNA from the libraries was screened 
by PGR amplification with the PGR primer pair identified above. A positive library was then used to isoUte clones 
encoding the PR0792 gene using the probe oligonucleotide and one of the PGR primers. RNA for construction of 
15 the cDNA libraries was isolated from human bone marrow tissue (LIB255). 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PR0792 
(herein designated as UNQ431 (DNA56352-1358)) (SEQ ID NO:230) and die derived protein sequence for PR0792. 

TTie entire nucleotide sequence of UNQ431 (DNA56352-1358) is shown in Figure 84 (SEQ ID NO:230). 
Glone UNQ431 (DNA56352-I3S8) contains a single open reading frame with an apparent translational iniuaiion site 
20 at nucleotide positions 67-69 and ending at the stop codon at nucleotide positions 946-948 (Figure 84). The predicted 
polypeptide precursor is 293 amino acids long (Figure 85). The full-length PR0792 protein shown in Figure 85 has 
an estimated molecular weight of about 32.562 dalions and a pi of about 6.53. Analysis of the full-length PR0792 
sequence shown in Figure 85 (SEQ ID NO:231) evidences the presence of the following: a type II transmembrane 
domain from about amino acid 31 to about amino acid 54. potential N-glycosylarion sites from about amino acid 73 
to about amino add 76 and from about amino acid 159 to aboui amino acid 162. a leucine zipper amino acid sequence 
pattern from about amino acid 102 to about amino acid 123. potential N-myristolation sites from about amino acid 
18 to about amino acid 23. from about amino acid 133 to about amino acid 138 and from about amino acid 242 to 
about amino acid 247 and a G-type lectin domain signature block from about amino acid 264 to about amino acid 287. 
Clone UNQ43I (DNA56352-1358) has been deposited with ATCG on May 6. 1998 and is assigned ATCC deposit 
30 no. 209846. 

Analysis of the amino acid sequence of the lull-length PR0792 polypeptide suggests that it possesses 
significant sequence similarity to die CD23 protein, thereby indicating that PR0792 may be a novel CD23 homolog. 
Mort specificany. an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced significant homology 
between ihe PR0792 amino acid sequence and the following Dayhoff sequences. S34198. A07100_l. A05303_l. 
35 P R41689, P_P82839. A10871_l . P_R12796. P_R47199. A46274 and P_R321 88. 
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A consensus sequence was obuined relative lo a variety of EST sequences as described in Example I above, 
wherein the consensus sequence obtained is herein designated DNA44708. Based on the DNA44708 consensus 
sequence, oligonucleotides were synthesized: 1) ,o identify by PCR a cDNA library that contained the sequence of 
interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR0866. 
PCR primers (forward and reverse) were synthesized: 

fopvari PCR prinwrr I 5*-cagcactgccaggggaagaggg-3' (Seq id no:237) 

foPm<JPCR mmr? S'-CAGGACTCGCTACGTCCG-3' (SEQ id N0:238) 
forwqrd PCR PTiny^r ^ 5'-CAGCCCCTTCTCCTCCnTCTCCC-3' (SEQ ID NO:239) 
rcvereg PCR priTper I S'-GCAGTTATCAGGGACGCACTCAGCC.B" (SEQ ID NO:240) 

revtrse PCR primer 2 5'-ccagcgagaggcagatag-3' (seq id nO:24I) 

f Cversc PCR Pripif r ^ 5'-CGGTCACCGTGTCCTGCGGGATG-3- (SEQ id N0:242) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA44708 

sequence which had the following nucleotide sequence 

hybridization pm^y 

5 .CAGCCCCTrcTCCTCCTTTCTCCCACGTCCTATCTGCCTCTC.3- (SEQ ID NO:243) 

In order to screen several libntnes for a source of a full-length clone. DNA from the libraries was screened 
by PCR amplification with one of the PCR primer pairs idemified above. A positive library was then used to isolate 
clones encoding the PR0866 gene using the probe oligonucleotide and one of the PCR primers. RNA for 
construction of the cDNA libraries was isolated from human fetal kidney (issue (LIB228). 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PR0866 
[herein designated as UNQ435 (DNA5397M 359)J (SEQ ID NO:235) and the derived protein sequence for PR0866. 

The entire nucleotide sequence of UNQ435 (DNA53971-I359) is shown in Figure 86 (SEQ ID NG:235). 
Clone UNQ435 (DNA53971.1359) contains a single open reading frame with an apparent translational initiation site 
at nucleotide positions 275-277 and ending at the stop codon at nucleotide positions 1268-1270 (Figure 86). The 
predicted polypeptide precursor is 331 amino acids long (Figure 87). The full-length PR0866 protein shown in 
Figure 87 has an estimated molecular weight of about 35.844 dalions and a pi of about 5.45. Analysis of the full- 
leiigth PR0866 sequence shown in Figure 87 (SEQ ID NO:236) evidences the presence of d« following: a signal 
peptide fioffl about amino acid I to about amino acid 26. Clone UNQ435 (DNA53971-1359) has been deposited with 
ATCC on April 7, 1998 and is assigned ATCC deposit no. 209750. 

Analysis of the amino acid sequence of the fuU-length PR0866 polypeptide suggests that it possesses 
significant sequence similarity to the mindin/spondin family of proteins, thereby indicating thai PR0866 may be a 
novel mindin homolog. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProi 33) 
evidenced significam homology between the PR0866 amino acid sequence and the following Dayhoff 
sequences.AB006085_l. AB006084_1. AB006086_1. AF0I7267_I, CWU42213_1, AC004160 I. CPMICRP 1, 
S49108. A48569 and 146687. 

EXAMPLE 40 : Isolation of cDNA Clnn... Ft 'M I I 'w Human PBr>fl7| 

A consensus sequence was obtained relative to a variety of EST sequences as described in Example 1 above, 
wherein the consensus sequence obtained is herein designated DNA40324. Based on the DNA40324 consensus 
sequence, oligomicleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of 
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inicrest. and 2) for use as probes to isolate a clone of ihe full-length coding sequence for PR0871. 

PCR primers (forward and reverse) were synthesized: 
f...^ ^.PrR nrimerl 5 -TGCGGACATCCTACTGGCACAGGG-3- (SEQ ID NO:246) 
..p.„,HPrB nnmer2 5'-CGAGTTAGTCAGAGCATG-3' (SEQ ID NO:247) 
f.^.,-, rMPrR nnmer3 S'-CAGATGGTGCTGTTGCCG-S' (SEQ ID NO:248) 
5 r- rr^ pen primer I 5'-CAACTGGAACAGGAACTGAGATGTGGATC-3* (SEQ ID NO:249) 
^p r r'^prpnrimerl S'-CTGCTrCAGCAGTOCAAGGGTCTGO' (SEQ ID NO:250) 
yp rp,PrRnrin.er3 S -CCTCTCCGATTAAAACGCO- (SEQ ID NO:25l) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA40324 
sequence which had the following nucleotide sequence 
to hvWistP*""" 

5'^AGAGGACTGGTrGCCATGGCAAATGCTGGTTCTCATGATAATGG-3' (SEQ ID NO:252) 

In order to screen several Ubraries for a source of a full-length done. DNA from the libraries was screened 
by PCR amplification with one of the PCR primer pairs identified above. A positive library was then used to isolate 
clones encoding the PR0871 gene using the probe oligonucleotide and one of the PCR primers. RNA for 
15 construction of the cDNA libraries was isolated from human fetal kidney tissue (UB227). 

DNA sequencing of the clones isolated as described above gave the fiill-length DNA sequence for PR0871 
(herein designated as UNQ438 (DNA50919-136I)1 (SEQ ID NO:244) and the derived protein sequence for PR0871 . 

n,e entire nucleotide sequence of UNQ438 (DNA50919.1361) is shown in Figure 88 (SEQ ID NO:244). 
Clone UNQ438 (DNA50919-I361) contains a single open reading frame with an apparem translational initiation site 
20 at nucleotide positions 191-193 and ending at the stop codon at nucleotide positions 1607-1609 (Figure 88). TTie 
predicted polypeptide precursor is 472 amino acids long (Figure 89). The full-length PR0871 protein shown in 
Figure 89 has an estimated molecular weight of about 53.847 daltons and a pl of about 5.75. Analysts of the lull- 
lengdi PR0871 sequence shown in Figure 89 (SEQ ID NO:245) evidences the presence of the followmg: a signal 
peptide from about amino acid I to about amino acid 21. potential N-glycosylation sites from about ammo acid 109 
25 to about amino acid 1 12 and from about amino acid 201 to about amino acid 204. a cyclophilin-type peptidy-prolyl 
cis-trans isomerase signamre sequence from about amino acid 49 to about amino acid 66 and regions that are 
homologous to cyclophilin-iype peptidy-prolyl cis-trans isomerases fiom about amino acid 96 to about ammo acid 140. 
from about amino acid 49 to about amino acid 89 and from about amino acid 22 to about amino acid 51 . Clone 
UNQ438 (DNA50919-1361) has been deposited with ATCC on May 6. 1998 and is assigned ATCC deposit no. 
30 209848. 

Analysis of the amino acid sequence of the ftill-length PR0871 polypeptide suggests that it possesses 
significant sequence similarity to the cyclophilin family of proteins, thereby indicating that PR0871 may be a novel 
cyclophilin protein famUy member. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 
35) evidenced significant homology between U« PR0871 amino acid sequence and the following Dayhoff sequeiKes. 
35 SPBC16H5_5. S64705. YAL5_SCHP0. CYP4_CAEEL. CELC34D4_7. CYPA_CAEEL. HUMORF006_l. 
CYPI_MYCTU. AF043642_1 and HSSRCYP.l. 

FXAMPLE 41 - l-TTlnTini ftf fPNA f'""^ FnrndinP Human PRQ873 
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A consensus sequence was obtained relative to a vanety of EST sequences as described in Example 1 above, 
wherein the consensus sequence obtained ts herein designated DNA39621. Based on the DNA39621 consensus 
sequence, oUgonucIeotides were synthesized: 1) to identify by PCR a cDNA library that comai™:d the sequence of 
interest, and 2) for use as probes to isolate a clone of the fulHengUi coding sequence for PR0873. 
A pair of PCR primers (forward and reverse) were synthesized: 

forward PCR primer 5 -aggtgcctgcaggagtcctgggg-3* (seq id no:255) 
reverse pcR pr i mer 5'- ccacctcaggaagccgaagatgcc-3' (Seq id no:256) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA39621 
sequence which had the following nucleotide sequence: 

hvbridizailon prohe 

5'-GAACGGTACAAGTGGCTGCGCTTCAGCGAGGACTGTCTGTACCTG-3- (SEQ ID NO:257) 

In order to screen several libraries for a source of a full-length clone. DNA from die libraries was screened 

by PCR amplifkadon with the PCR primer pair identified above. A pos.uvc library was then used to isolate clones 

encoding the PRG873 gene using d« probe oligonucleotide and one of the PCR primers. RNA for construction of 

the cDNA libraries was isolated from human fetal liver tissue (L1B229). 

DNA sequencing of the clones isolated as described above gave the fiill-length DNA sequence for PR0873 

[herein designated as UNQ440 (DNA44I79-1362)J (SEQ ID NO:253) and d« derived protein sequence for PR0873. 
The emire nucleotide sequence of UNQ440 (DNA44179-1362) is shown in Figure 90 (SEQ ID NO-253) 

Clone VNCm (DNA44179-1362) contains a single open reading frame with an apparem translational imtiation site 

at nucleotide positions 139-141 and ending at the stop codon at nucleotide positions 1774-1776 (Figure 90). nie 
predicted polypeptide precursor is 545 amino acids long (Figure 91). TTie ftill-length PR0873 protein shown in 
Figure 91 has an estimated molecular weight of about 58.934 daltons and a pi of about 9.45. Analysis of the ftill- 
length PR0873 sequence shown in Figure 91 (SEQ ID NO:254) evidences the presence of the following features, 
a signal peptide fiom about amino acid I to about amino acid 29; a carboxylesierase type-B serme active site at about 
ami«, acid 312 to about ami«, acid 327; a carboxylesteiase type-B signahire 2 motif at about amino acid 218 to about 
amino acid 228; and three potemial N-glycosylation sites at about ammo acid 3 1 8 to about amino acid 321 . about 
amino acid 380 to about amino acid 383. and about amino acid 465 to about amino acid 468. Clone UNQ440 
(DNA44179.I362) has been deposited with ATCC on May 6. 1998 and is asstgned ATCC deposit no. 209851. 

Analysis of Ae amino acid sequence of Ac full-length PR0873 polypeptide suggests that it possesses 
significant sequence similarity to a human liver carboxylesterase. thereby indicating that PR0873 may be a novel 
carboxylesterase. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidcnt»d 
significant homology between the PR0873 amino acid sequence and the following Dayhoff sequences: ESIO RAT 
GEN12405. AB010633J. EST4_RAT. A48809. SASB_ANAPL. RNU41662 I. RNU22952 1. BAl'raT 
GEN13522. 



35 EXAMPLI;42 : Iwlarion of cDNa ciong. Pf^-r^ m Human PRnmp 

A consensus sequence was obuined relative to a variety of EST sequences as described in Example 1 above, 
wherein the consensus sequence obuiined is herein designated DNA47442. Based on the DNA47442 consensu^ 
sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that comained the sequence of 
interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PRO940. 
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A pair of PCR primers (forward and reverse) were synthesized: 
f^p..^r ^ prR nrimer 5 -CAAAGCCTGCGCCTGGTCTGTG-3- (SEQ ID NO:260) 
avsm-KSMSL 5 -1TCTGGAGCCCAGAGGGTGCTGAG-3' (SEQ ID NO:262) 
Addiiionally. a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA47442 
sequence which had the following nucleotide sequence 

5-.GGAGCTGCCACCCATTCAAATGGAGCACGAAGGAGAGTrCACCTG-3- (SEQ ID NO:263) 

In order to screen several Ubraries for a source of a full-length done. DNA from the libraries was screened 
by PCR ampUfication with the PCR primer pair identified above A positive library was then used to isolate clones 
encoding the PRO940 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of 
10 the cDNA libraries was isolated from human fetal liver tissue (LIB229). 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PRO940 
[herein designated as UNQ477 (DNA54002-1367)1 (SEQ ID NO:258) and the derived protein sequence for PRO940. 

The entire nucleotide sequence of UNQ477 (DNA54002-1367) is shown in Figure 92 (SEQ ID NO:258). 
Clone UNQ477 (DNA54002-1367) contains a single open reading frame with an apparcm translational initiation site 
15 at nucleotide positions 46^8 and ending at the stop codon at nucleotide positions 1678-1680 (Figure 92). The 
predicted polypeptide precursor is 544 amino acids long (Figure 93). The full-length PRO940 protein shown in 
Figure 93 has an estmiated molecular weight of about 60.268 daltons and a pl of about 9.53. Analysis of the full- 
length PRO940 sequence shown in Figure 93 (SEQ ID NO:259) evidences the presence of the following: a signal 
peptide from about amino acid 1 to about amino acid 15. potential N-glycosylation sites from about amino acid 100 
20 10 about amino acid 103. from about amino acid 297 to about amino acid 300 and from about amino acid 306 to about 
amino acid 309 and an immunoglobulin and major histocompatibility complex signamre sequence block from about 
amino acid 365 to about ammo acid 371. Clone UNQ477 (DNA54002-1367) has been deposited with ATCC on April 
7, 1998 and is assigned ATCC deposit no. 209754. 

Analysis of the amino acid sequence of the full-length PRO940 polypeptide suggests that it possesses 
25 significant sequence similarity to CD33 and the OB binding pro.c.n-2. More specifically, an analysis of the Dayhoff 
database (version 35.45 SwissProt 35) evidenced signifkant homology between the PRO940 ami»> acid sequence and 
the fonowing Dayhoff sequences. CD33_HUMAN. HSU71382.1. HSU71383_1. D86359_l, PGBM.HUMAN. 
MAGS.MOUSE. D86983_l. C22B_HUMAN. P_W01002 and HVU24n6_l. 

30 F-XAMPLE 43 : Isnlaiion of c^ N* Clones Encodinp Human PR0941 

A consensus sequence was obtained relative to a variety of EST sequences as described in Example 1 above, 
wherein the consensus sequence obtained is herein designated DNA35941. An EST sequence proprietary to 
Genentech was employed in the assembly and is herem designated DNA6415 (Figure 96; SEQ ID NO:265). Based 
on the DNA35941 consensus sequence. oUgonuclcoudes were synthesized: 1) to identify by PCR a cDNA library 

35 that contained the sequence of interest, and 2) for use as probes to isolate a clone of the fiiU-lcngth coding sequence 
forPR0941. 

A pair of PCR primers (forward and reverse) were synthesized: 
fa».,rHPrR nrimer 5--CTTGACTGTCTCTGAATCTGCACCC-3' (SEQ ID NO:266) 
r,^.r«^PrR Primer 5'-AAGTGGTGGAAGCCTCCAGTGTGG-3- (SEQ ID NO:267) 
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Additionally, a synthetic oligonucleotide hybridizauon probe was constructed from the consensus DNA35941 
sequence which had the following nucleotide sequence 
hvbridiTatiffi^ pr^hy 

5 -CCACTACGGTATTAGAGCAAAAGTTAAAAACCATCATGGTTCCTGGAGCAGC-3- (SEQ ID NO:268) 
to order to screen several libraries for a source of a full-length clone. DNA from the hbraries was screened 
by PCR amplificadon with the PCR pnmer pair identified above. A pos.uve libraor was then used to isolate clones 
encoding the PR0941 gene using the probe oligonucleotide and one of the PCR pruners. RNA for construction of 
the cDNA libraries was isolated from human fetal kidney tissue (LIB227). 

DNA sequencing of the clones isolated as described above gave the fiiU-length DNA sequence for PR094I 
(herein designated as UNQ478 (DNA53906.1368)) (SEQ ID NO:263) and the derived protein sequence for PR0941. 

The entire nucleotide sequence of UNQ478 (DNA53906-1368) is shown in Figure 94 (SEQ ID NO-263) 
Clone UNQ478 (DNA539«^1368) contains a single open reading frame with an apparent translational initiation site 
at nucleotide positions 37-39 and ending at die stop codon a. nucleotide positions 2353-2355 (Figure 94) TIk 
predicted polypeptide precursor is 772 amino acids long (Figure 95). TTie fiill-length PR0941 protein shown in 
Figure 95 has an estimated molecular weight of about 87.002 daltons and a pi of about 4.64. Analysis of the full- 
length PR0941 sequence shown in Figure 95 (SEQ ID NO:264) evidences the presence of the following: a signal 
peptide from about amino acid 1 to about amino acid 21. potential N-glycosylation sites from about amino acid 57 
to about amino acid 60. from about amino acid 74 to about amino acid 77. from about amino acid 419 to about amino 
acid 422. from about amino acid 437 to about amino acid 440. from about amino acid 508 to about amino acid 51 1 
from about amino acid 515 to about amino acid 518. from about amino acid 516 to about amino acid 519 and from 
about amino acid 534 to about amino acid 537. and cadherin extracellular repeated domain signamre sequences from 
about amino acid 136 to about amino acid 146 and from about amino acid 244 to about amino acid 254 Clone 
UNQ478 (DNA53906-1368) has been deposited with ATCC on April 7. 1998 and is assigned ATCC deposi, no 
209747. 

Analysis of the ami«, acid sequence of die full-length PR0941 polypeptide suggests that it possesses 
significam sequence similarity to a cadherin protein. ti«reby indicating Uu« PR094 1 may be a novel cadherin protem 
family member. More specifically, an analysis of the Dayholf database (version 35.45 SwissProt 35) evidenced 
significant homology between the PR0941 amino acid sequence and the following Dayhoff seque«:es 150180 
CADA.CHICK. I50I78. GEN12782. CADC.HUMAN. P_W25637. A38992. P.R49731. D38992 and G02678. ' 

EXAMPIrF. 44 : Isolation of cDNA rinnft. Fnr o ding Hiinyn 

Aconsensus sequence was Obtained relative toavarietyofEST sequences as described in Example 1 above, 
wherein the consensus sequence obtained is herein designated DNA47374. A variety of proprietary Genentech EST 
sequences were employed in the assembly and are shown in Figures 99-107. Based on the DNA47374 consensus 
sequence. oBgomKrleoudes were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of 
interest, and 2) for use as probes to isolate a clone of die full-lengtf» coding sequence for PR0944. 

A pair of PCR primers (forward and reverse) were synthesized: 
fgrvart PTR prim 5'-CGAGCGAGTCATGGCCAACGC-3' (SEQ id NO:280) 

reverse PCR nrimr r 5 -ctgtcacacgtagtctttcccgctgg-3- (seq id no:281) 
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Additionally, a synthetic oligonucleotide hybndization probe was constructed from the consensus DNA47374 
sequeiKC which had the following nucleotide sequence 
{,Y^i^iHj7apon nrobe 

5'-CTGCAGCTGTrGGGCTTCATTCTCGCCTTCCTGGGATGGATCG-3- (SEQ ID NO:282) 

In order to screen several libraries for a source of a full-length clone. DNA from the libraries was screened 
5 by PGR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones 
encoding the PR0944 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of 
the cDNA libraries was isolated from human fetal kidney tissue (UB227) 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PR0944 
[herein designated as UNQ481 (DNA52185-1370)] (SEQ ID NO 269) and the derived protein sequence for PR0944. 
10 The entire nucleotide sequence of UNQ481 (DNA52185-1370) is shown in Figure 97 (SEQ ID NO:269). 

Clone UNQ481 (DNA52 185- 1370) contains a single open reading frame with an apparem iranslational initiation site 
at nucleotide positions 219-221 and ending at the stop codon at nucleotide positions 852-854 (Figure 97). The 
predicted polypeptide precursor is 211 amino acids long (Figure 98). The fUll-lcngth PR0944 protein shown in 
Figure 98 has an estimated molecular weight of about 22.744 daltons and a pi of about 8.51. Analysis of the full- 
15 length PR0944 sequence shown in Figure 98 (SEQ ID NO:270) evidences the presence of the following: a signal 
peptide from about amino acid I to about amino acid 21. transmembrane domains from about amino acid 82 to about 
amino acid 102. from about amino acid 1 18 to about amino acid 142 and from about amino acid 161 to about amino 
acid 187. a potential N-glycosylation site from about amino acid 72 to about amino acid 75. a sequence block having 
homology to PMP-22/EMP/MP20 family of proteins from about amino acid 70 to about amino acid 111 and a 
20 sequence block having homology to ABC-2 type transport system integral membrane protein from about amino acid 
119 to about amino acid 133. Clone UNQ481 (DNA52 185- 1370) has been deposited with ATCC on May 14. 1998 
and is assigned ATCC deposit no. 209861. 

Analysis of the amino acid sequence of the full-length PR0944 polypeptide suggests that it possesses 
signiftcam sequence similarity to the CPE-R prDtein. thereby indicating that PR0944 may be a novel CPE-R 
25 homolog. Mottspecifically.ananalysisofUttDayhoffdaubase(version35.45SwissProt35)evidencedsignific^^ 
homology between die PR0944 amino acid sequence and the following Dayhoff sequences. AB0007I3.1. 
AB00a714_l. AF035814_1. AF000959.I. HSU89916J. EMP2_HUMAN. JC5732. CELF53B3.6. PM22_MOUSE 
andCGU49797_l. 

30 F yAMPI.E4S : l^nlafinn of cDfJA T'""« Encoding Human PRQ983 

A consensus sequence was obtained relative to a variety of EST sequences as described in Example 1 above, 
wherein the consensus sequence obtained is herein designated DNA47473. Various proprietary Genentech EST 
sequences were employed in the assembly, wherein those EST sequences are shown in Figures 1 10-1 16. Based on 
the DNA47473 consensus sequence, oligonucleotides were synthesized: I) to identify by PCR a cDNA library that 

35 oamM the sequence of interest, and 2) for use as probes to isolate a clone of the full-lengdi coding sequence for 
PR0983. 

A pair of PCR primers (forward and reverse) were synthesized: 
fn,w,«lPrRnrimer 5 -GCACCACCGTAGGTACTrGTGTGAGGC-3' (SEQ ID NO:292) 
r.v.r«l»rRnrimer 5 -AACCACCAGAGCCAAGAGCCGGG -3' (SEQ ID NO:293) 
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Additionally, a synthetic oligonucleotide hybridization probe was consm,ctcd from the consensus DNA47473 
sequence which had the following nucleotide sequence 
hybridization probe 

5--CAGCGGAATCATCGATGCAGGGGCCTCAATrAATCTATCTGTCATCTrAC-3' (SEQ ID NO:294) 

in order to scn^n several Bbiaries for a source of a fiill-length clone. DNA from .he libraries was screened 
by PCR anvlificadon with the PGR primer pair identifKd above. A positive library was then used to isolate clones 
encodmg the PR0983 gene using the probe oligonucleodde and one of the PCR primers. RNA for constn^don of 
the cDNA libraries was isolated from human bone marrow (UB256). 

DNA sequencing of the clones isolated as described above gave the Mi-length DNA sequence for PR0983 
(herein designated as UNQ484 (DNA53977-1371)1 (SEQ ID NO:283) and the derived protein sequence for PR0983 
•me entire nucleotide sequence of UNQ484 (DNA53977-I37I) is shown in Figure 108 (SEQ ID NO-283) 
Clone UNQ484 (DNA33^-1371) contains a single open reading frame with an apparent translational inidation site 
at nucleotide positions 234-236 and ending at the stop codon a. nucleotide positions 963-965 (Figure 108) The 
predicted polypeptide precursor is 243 amino acids long (Figure 109). m full-lengd, PR0983 protein shown in 
Figure 109 has an estimated molecular weight of about 27.228 daltons and a pi of about 7.43. Analysis of the hill- 
length PR0983 sequence shown in Figure 109 (SEQ ID NO:284) evidences the presence of the followtng features^ 
a putative transmembrane domain from about amino acid 224 to about amino acid 239; a potential N-glycosylation 
stte from about amino acid 68 to about amino acid 71: and three potential N-myris,oylation sites from about amino 
acd 59 to about amino acid 64. from about amino acid 64 to about amino acid 69. and from about amino acid 235 
to about amino acid 240. Clone UNQ484 (DNA53977.1371) has been deposited with ATCC on May 14. 1998 and 
is assigned ATCC deposit no. 209862. 

Analysis of the amino acid sequence of the full-length PR0983 polypeptide suggests that it possesses 
sigmficant sequence similarity to die vesicle-associated protein. VAP.33. thereby indicating that PR0983 mav be a 
novel vesicle associated membrane protein. More specifKally. an analysis of the Dayhoff database (venion 35 45 
SwissProt 35) evidenced sigmficam homology between PR0983 amino acid sequence and die following Dayhoff 
sequences: VP33_APLCA. CELF33D11_12. CEIj:42G2_2. S50623. YDFC SCHPO. CELF54H5 2. CELZC196 8 
CEF57A10_3. MSP3_GL0R0. CECISHUJ. 



EXAMPLF 46 : Isolation t-nt^fi ^ Q^j^^ pncndin» H..n, n n ff \ ^ \ ^ 

A consensus sequence was obtained relative to a variety of EST sequences as described in Example 1 above, 
wherein the consensus sequence obtained is herein designated DNA49808. Based on dte DNA49808 eonsenn» 
sequence, oligonucleotides were syndtesized: 1) to identify by PCR a cDNA library dut contained d« sequence of 
interest, and 2) for use as probes to isolate a clone of the full-lcngtf, coding sequence for PROI057. 
PCR primers (forward and reverse) were synUiesized: 

fffpyard PCR primrr 5'-gcatctgcaggagagagcgaaggg-3' (seq id no:297) 

35 fcvcrsc FTP prinrr 5--CATCGTrCCCGTGAATCCAGAGGC 3- (SEQIDN0:298) 

Additionally, a symhetic oligonucleotide hybridization probe was constructed from the consensus DNA49808 
sequence which had the following nucleotide sequence 

hvbridi^;a[j^fp 

5 -GAAGGGAGGCCTrcCTITCAGTGGACCCGGGTCAAGAATACCCAC-3' (SEQ ID NO:299) 
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In order to screen several libraries for a source of a full-length clone. DNA from the libraries was screened 
by PCR amplification with the PGR pnmer pair idei>uf.ed above. A positive library was then used to isolate clones 
encoding the PRO1057 gene using the probe oligonucleotide and one of the PCR primers. RNA for coiBtrucnon of 
the cDNA libraries was isolated from human fetal Iddney tissue (UB227) 

DNA seqticncing of the clones isolated as described above gave the full-length DNA sequence for PRO1057 
5 (herein designated as UNQ522 (DNA57253-1382)] (SEQ ID NO:29S) and the derived protein sequence for PRO1057. 

The entire nucleotide sequence of UNQ522 (DNA57253-1382) is shown in Figure 1 17 (SEQ ID NO:29S1 
Clone UNQ522 (DNA57253-1382) contains a single open reading frame with an apparent translaUonal initiation site 
at nucleotide positions 275-277 and ending at the stop codon at nucleotide positions 1514-1516 (Figure 117). The 
predicted polypeptide precursor is 413 amino acids long (Figure 1 18) The full-length PRO1057 protein shown in 
10 Figure 1 18 has an estimated molecular weight of about 47.070 daltons and a pi of about 9.92. Analysis of the full- 
length PRO1057 sequence shown in Figure 118 (SEQ ID NO:296) evidences the presence of the following: a signal 
peptide from about amino acid I to about amino acid 16. potential N-glycosylaiion sites from about amino acid 90 
to about amino acid 93. from about amino acid 1 10 to about amino acid 1 13 and from about amino acid 193 to about 
amino acid 196. a glycosaminoglycan attachment site from about amino acid 236 to about ammo acid 239 and a serine 
15 protease histidine-containing active site from about amino acid 165 to about amino acid 170. Clone UNQ522 
(DNA57253-1382) has been deposited with ATCC on May 14. 1998 and is assigned ATCC deposit no. 209867. 

Analysis of die amino acid sequence of the full-length PRO1057 polypeptide suggests that it possesses 
significant sequence similarity to various ptotease proteins, thereby indicating that PRO1057 may be a novel protease. 
Mote specificaUy. an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced significant homology 
20 between the PRO1057 amino acid sequence and die following Dayhoff sequcnces.TRYE_DROER. P_R14159. 
A69660. EBNl.EBV. S65494. GEN12688. A51084_l. P_R9957l. A57514 and AF003200_l. 

FyAMPLE47 - I rl^ i rn "f tPNA r.lones F.ncoding Human PRQ107) 

A consensus sequence was obtained relative to a variety of EST sequences as described in Example 1 above. 
25 wherein the consensus sequence obtained is herein designated DNA53035. Based on the DNA53035 consensus 
sequence, it was determined that that consensus sequence shared significant sequence identity with Incyte EST clone 
no. 2872569. a clone that upon review appeared to encode a full length protein. As such. Incyte EST clone no. 
2872569 was purchased and its insert was obtained and sequenced so as to confirm the proper sequence. This 
sequence is herein designated UNQ528 or DNA58847-1383. 
30 DNA sequencing of die clone isolated as described above gave the fiill-length DNA sequence for PRO107 1 

Ihercin designated as UNQ528 (DNA58847-1383)) (SEQ ID NO:300) and die derived protein sequence for PRO1071. 

TlK emire nucleotide sequence of UNQ528 (DNA58847-1383) is shown in Figure 119 (SEQ ID NO:300). 
Clone UNQ528 (DNA58848-1383) conuins a single open reading frame widi an apparcm translauonal initiation site 
at nucleotide positions 133-135 and ending at the stop codon at nucleotide positions 1708-1710 (Figure 119). Hie 
35 predicted polypeptide precursor is 525 amino acids long (Figure 120). The full-length PRO1071 protein shown in 
Figure 120 has an estimated molecular weight of about 58.416 daltons and a pi of about 6.62. Analysis of the fiill- 
length PROI071 sequence shown in Figure 120 (SEQ ID NO:301) evidences the presence of the following: a signal 
peptide from about amino acid 1 to about ammo acid 25. a potential N-glycosylauon site from about amino acid 251 
to about amino acid 254. a thrombospondin-l homology block from about amino acid 385 to about amino acid 399 
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and von WUIibrands factor type C homology blocks from abom anuno acid 385 toS anuno acid 399. from about 
amino acid 445 to abom amino acid 459 and from about amino acid 42 to about ammo acid 56. Clone UNQ528 
(DNA58847-1383) has been deposited with ATCC on May 20. 1998 and is assigned ATCC deposit no. 209879 

Analysis of the amino acid sequence of the full-length PRO1071 polypeptide suggests that i, possesses 
stgnancan. sequence similarity to the thiombospondin protein, thereby indicating that PROI071 may be a «^el 
thrombospondin homolog. More specifically, an analyse of the Dayhoff dauibase (version 35.45 SwissPro. 35) 
evidenced signif.can. homology between the PRO1071 amino acid sequence and the following Dayhoff sequences 
AB0Q2364_,. D67076.1. BTPCINPGN.l. CET13H10J. CEF25H8_5. CEF53B6 2. CEC26C6 6 HSSEMG / 
CEniB6_4 and BTY08561_1. 

EXAMPLE 48 : Isolation of cDNA rinn^t Pfrmli ng Humnn pi^f> |^77 

A consensus sequence was obtained relative to a variety of EST sequences as described in Example 1 above 
wherein the consensus sequence obtained is herein designated DNA53125. Based on the DNA53I25 consensus 
sequence, oligomicleotides were syndiesized: 1) to identify by PCR a cDNA library that comained the sequence of 
mterest. and 2) for use as probes to isolate a clone of the full-length coding sequence for PRO1072. 

PCR primers (forward and reverse) were synthesized: 
fgrwar<l PCR priny-r 5 -CCAGGAAATGCTCCAGGAAGAGCC-3- (SEQ id NO:305) 
I6mS£.E£Ej2imjSr5 -GCCCATGACACCAAATTGAAGAGTGG-3' (SEQ ID NO:306) 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA53I25 
sequence which had the following nucleotide sequence 

5--AACGCAGGGATCTrCCAGTGCCCTrACATGAAGACTGAAGATGGG-3- (SEQ ID NO:307) 

In order to screen several libraries for a source of a lull-length clone. DNA from the libraries was screened 
by PCR amplification with the PCR primer pair identified above A positive library was then used to isoUte clones 
encoding die PROI072 gene using *e probe oligonucleotide and one of the PCR primers. RNA for construction of 
the cDNA libraries was isolated from human fetal lung tissue (UB26). 

DNA sequencing of the clones isolated as described above gave die full-lengd, DNA sequence for PROI072 
(herein designated as UNQ529 (DNA58747-1384)| (SEQ ID NO:302) and die derived protein sequence for PRO1072 
nt entire nucleotide sequence of UNQ529 (DNA58747-1384) is shown in Figure 121 (SEQ ID NO-302) 
Clone UNQ529 (DNA58747-1384) contains a single open reading frame with an apparent translational initiation site 
at nucleottde positions 65-67 and ending a. the stop codon a, nucleotide positions 1073-1075 (Figure 121) TTte 
predicted polypeptide precursor is 336 amino acids long (Figure 122). The full-length PROI072 protein shown in 
F«uie 122 has an estimated molecular weight of about 36.865 daltons and a pi of about 9. 15. Analysis of die full- 
length PROI072 sequence shown in Figure 122 (SEQ ID NO:303) evidences die presence of the following- a signal 
pepude fhm. about amino acid 1 to about amino acid 2 1 . shon-chau, alcohol dehydrogenase protein homology blocks 
torn about amino acid 134 to about amino acid 144. from about ammo acid 44 to about amino acid 56 ami from about 
ammo acid 239 to about ami«, acid 248 a«l potential N-glycosylation sites from about amino acid 2 12 to about amino 
acid 215 and from about amino acid 239 to about amino acid 242. Clone UNQ529 (DNA58747.I384) has been 
deposited with ATCC on May 14. 1998 and is assigned ATCC deposit no. 209868. 
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Analysis of Oie aruno acid sequence of ihe full-lenglh PRO1072 polypeptide suggests thai it possesses 
significant sequence similarity to the reductase family of proteins, thereby indtcattng that PRO1072 may be a novel 
reductase. More specificaUy. an analysts of the Dayhoff daubase (version 35 45 SwissProt 35) evidenced significant 
homology between the PR0I(n2 ammo acid sequence and the following Dayhoff sequences. P_W03198. P_W15759. 
P_R60800. MTV037.3. CEC15HU_6. ATAC002343I4. MTV022.13. SCU43704.1. OXIR.STRAT AND 
5 CELC01G8_3. 

py AMPLE 49 - Ir"""" '^PNA riones Kncodinf HM(nai 

Aconsensos sequence was obtained relative toavarietyofEST sequences as described in Example t above. 

wherein the consensus sequence obtamed is herein designated DNA34363. Based on the DNA34363 sequence. 
10 oygonucleoddes were synthesized: 1) to identify by PGR a cDNA library that contained the sequence of .merest, and 
2) for use as probes to isolate a clone of die full-length coding sequence for PRO1075. 
PGR primers (forward and reverse) were synthesized: 
prRnrimer 5 -TGAGAGGGCTCTCTGGAAGTrG-3- (SEQ ID NO:312) 
f.^.,HPrRnrimer S -GTCAGCGATCAGTGAAAGCO' (SEQ ID NO:313) 
15 f.^.rH PPR nrimer 5 -CCAGAATGAAGTAGCTCGGC.3- (SEQ ID NO:314) 
f,^,,HPrRnrimer 5'-CCGACTCAAAATGCATTGTC-3- (SEQ ID NO:315) 
^.^r. PrR nrimer 5 .CATITGGCAGGAATrGTCC-3- (SEQ ID NO:316) 
f,^».prR nrimer 5'-GGTGCTATAGGCCAAGGG-3- (SEQ ID NO:317) 
r,v>rc.PrR nrimer 5--CTCTATCTGTGGGGTATGTCAGAG-3- (SEQ ID NO:318) 
20 ,,,sIS.^CRJda.fiI5^C^AGATATAATGGCACA^^^^^^ (SEQ ID N0:319) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA34363 
sequence which had the following nucleotide sequence 
tiy]^ririi7^tion probe 

5 .CGTGTrCCTATGCTTACCCGACCTCAGATGCTCCCTrCTGCTCCTG-3- (SEQ ID NO:320) 
25 In order to screen several Ubraries for a source of a full-length clone. DNA from the librar.es was screened 

by PGR amplification witf. the PGR primer pair idemified above. A positive library was then used to isolate clones 
e«:oding the PRO107S gene using the probe oligonucleotide and one of the PGR primers. RNA for constructton of 
the cDNA Hbraries was isolated from human skin tumor tissue (UB324). 

DNA sequencing of the clones isolated as described above gave die full-length DNA sequence for PRO1075 
30 therein designated as UNQ532 (DNA57689-1385)] (SEQ ID NO:308) and die derived prote.n sequence for PRO1075. 

TlK entire nucleotide sequence of UNQ532 (DNA57689-1385) is shown in Figure 124 (SEQ ID NO:308). 
Clone UNQ532 (DNA57689-1385) conta.ns a single open reading frame witi, an apparem uanslauomd initiation site 
a, nucleotide positions 137-139 and ending at the stop codon at nucleotide positions 1355-1357 (Figure 124). The 
predicted polypeptide precursor is 406 amino acids long (Figure 125). The full-length PRO1075 protein shown in 
35 Fi8urel25hasanestimatedmolecularweightofabou.46.927daltonsandaplofabout5.21. Analysis of the full- 
length PRO1075 sequence shown in Figure 125 (SEQ ID NO:309) evidences the presence of the foUowmg: a s.gnal 
peptide from about amino acid 1 to about amino add 29. an endoplasmic reticulum targeting sequence from about 
aatino add 403 to about amino acid 406. a tyrosine kinase phosphorylation site from about ami«, acid 203 to about 
an^ add 21 1 and a sequence block having homology to tiie thioredoxin family of proteins from about amtm, ac.d 
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50u,aboutanunoacid66. Clo« UNQ532 (DNA57689-1385) has been depos.tcd wiU. ATCC on May .4. 1998 and 
IS assigned ATCC deposit no. 209869. 

Analysis of U« amino acid sequence of fuli-leng* PRO1075 polypeptide suggest d«. it possesses 
s^mfican. sequence sinularity .0 r>ro^ disulfide isomerase. .hereby indicating tha, PRO1075 au.y be a novel protein 
disulfide ison^rase. More specifically, an analysis of ti,e Dayhoff daubase (version 35.45 SwissPro. 35, evidenced 



10 A cDNA sequence isolated in U)e amylase screen described in Example 2 above was found, by BUVST a«l 

Fa«A sequence aHgnmen.. ,0 have sequence homology u, a nucleodde sequence encoding the comichon protein This 
CDNA sequence is herein designated DNA13242 (Figure 130; SEQ ID N0.323). Based on the sequence homolog,- 
ohgonudeotide probes were generated from dte sequence of the DNA13242 molecule and used .0 screen a human 
I.«nu(UB89)librarypreparedasdescr^^^ Tl« cloning vector was pRK5B 

(PRK5B .s a precursor of pRK5D that does not contain the Sfil site. see. Holmes ec al.. S^, 252.,278-,280 
(1991)). and the cDNA size cut was less than 2800 bp. 
The oligonucleotide probes employed included: 

fomard PCR primfj 5*-gtgcagcagagtggcttaca-3- (SEQ id no:326) 

rSvgKC PCR rrimqi 5 -ACTGGACCAATrCTTCTGTG-3' (SEQ ID NO:327) 
20 hvbridi7jtinn pf^frf 

5'-GATATrCTAGCATATTGTCAGAAGGAAGGATGGTGCAAATTAGCT-3' (SEQ ID NO-328) 

A full length clone was identified that contained a single open reading frame with an apparem translational 
mitaaon site a. nucleotide positions 14-16 and e«ling a. d,e stop codon found at nucleotide positions 446^8 (Figure 
128; SEQ ID NO:321). TTk. predicted polypeptide precursor is 144 amino acids long, has a calculated molecular 
w«gh. of appmximately 16.699 daltons and anesdmated pi of approximately 5.6. Analysis of ti,e Wl-lengti, PR0181 
sequenceshowninFigurc .29 (SEQ ID NO:322) evidences the presence of the following: a signal peptide from about 
annuo acid 1 «o about amino acid 20. a putative type II transmembrane domair. from about amino acid 11 .0 about 
ammo actd 31 and oti^er uansmembrane domains from about amino acid 57 .0 about amino acid 77 and from about 

^^12,^.^^;^^^,,,, ^'-"NQ,55(DNA23330..390) has been deposited witi, ATCC on April 
14, 1998 and is assigned ATCC deposit no. 209775. 

Analysis of die amino acid sequence of ti,e foll-lengti, PR0181 polypeptide suggests that it possesses 
significant sequence similarity to die comichon protein, thereby indicating that PR0181 may be a novel comichon 
bomolog. More specifically, an analysis of the Dayhoff database (vers.on 35.45 SwissProt 35) evidenced stgnifican. 
^mology between tite PR0.8. amino acid sequence and ti,e following Dayhoff sequences. ARC2811 I 
35 CEn)9E8_3. S64058. YGF4,YEAST. YB60_YEAST. EBU89455J. SIU36383_3 and PH1371. 

EXAMPIR U : feOlarion of rPNA rinn^, P^^^ ^ H..n.»n PPn|Q< 

A CDNA sequence was isolated in fte amylase screen described in Example 2 above ami is herem 
designated DNA13.99 (Figure 134; SEQ ID NO:332,. TT^ DNA13199 sequence was then compa«d .0 a variety 
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of expressed sequence cag (EST) databases which included public EST databases (e.g . GenBank) to identify exisnng 
homologies, m homology search was perfomed using the computer program BLAST or BLAST2 (Alishul et al.. 
K^MhnH ^ in Rnrvmoloev 2^:460480 (1996)). "niosc comparisons resulting in a BLAST score of 70 (or in some 
cases 90) or greater that did not encode known protems were clustered and assembled mto consensus DNA sequences 
with the program "phrap" (Phil Green. University of Washington. Seattle. Washington: 
5 http://bozeman.mbi.washington.edu/phrap.docs/phrap.html). The consensus sequence obtamed therefrom is herein 

designated as DNA22778. 

Based on the DNA22778 sequence, oligonucleotide probes were generated and used to screen a human 
placenui Ubrary (LIB89) prepared as described m paragraph 1 of Example 2 above. The clomng vector was pRK5B 
(pRK5B is a precursor of pRK5D thai does not contain the Sfil site; see. Holmes ei al.. SsiSDEE. 252:1278-1280 
10 (1991)). and the cDNA size cut was less than 2800 bp. 

PCR primers (forward and reverse) were synthesized: 
{aOjaid^CRjziimSI 5 -ACAAGCTGAGCTGCTGTGACAG-3- (SEQ ID NO:333) 
r ^v ^r.. PCR nrimer 5 -TGATrCTGGCAACCAAGATGGC-3' (SEQ ID NO:334) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA22778 sequence which 
15 had the following nudeoiidc sequence 
hlW""'"" nrohe 

5'-ATGGCCTrGGCCGGAGGTTCGGGGACCGCTTCGGCTGAAG-3" (SEQ ID NO:335) 

In order to screen several libraries for a source of a ftilHength clone. DNA from the libraries was screened 
by PCR amplilicaaon with the PCR primer pair identified above. A positive library was then used to isolate clones 
20 encoding the PR0195 gene using the probe oligonucleotide and one of the PCR primers. 

A fiill length clone was identified that contained a single open reading frame with an apparent translational 
initiation site at nucleotide positions 70-72 and ending at the stop codon found at nucleotide positions 1039-1041 
(Figure 132; SEQ ID NO:330). The predicted polypeptide precursor is 323 amino acids long, has a calculated 
molecular weight of approximately 36.223 daltons and an estimated pi of approximately 5 .06. AnalysB of the f\ill- 
25 length PROI95 sequence shown in Figure 132 (SEQ ID NO:330) evidences the presence of the following: a signal 
peptide from about amino acid 1 to about amino acid 31. a transmembrane domain from about amino acid 241 to 
about amino acid 260 and a potential N-glycosylat.on site from about amino acid 90 to about amino acid 93. Clone 
UNQ169 (DNA26847-1395) has been deposited with ATCC on April 14. 1998 and is assigned ATCC deposit no. 
209772. 

30 Analysis of the amino acid sequence of the full-length PR0195 polypeptide suggests that it possesses no 

significant sequence similarity to any known protein. However, an analysis of the Dayhoff database (version 35.45 
SwissProt 35) evidenced some degree of homology between the PROI95 amino acid sequence and the following 
Dayhoff sequences. P_P91380. AF035118_1. HUMTROPCS.l. NUOD.SALTY and E70002. 

35 T-yAMPLE S2 - Mmr\ ')f '•"NA Tl""*^ Fncndinp Human PR086S 

A CDNA sequence isolated in the amylase screen described in Example 2 above was herein designated 
DNA37642 (Figure 137. SEQ ID NO:338). The DNA37642 sequence was then compared to a variety of expressed 
sequence tag (EST) databases which included public EST databases (e.g.. GenBank) and a proprietary EST DNA 
database (UFESEQ™. Incyte Fharmaccuncals. Palo Alto. CA) to identify homologies therebetween. TTic homology 
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search was perfonned using *e computer program BLAST or BUST2 (AUshul « al.. Method, in Pn^ v ^ .,,^. 

M:4«M80 (1996)). Those compansons resulting m a BLAST score of 70 ,or .n some cases 90) or greater that did 

not encode known proteins were clustered and assembled into consensus DNA sequences with the program "phrap ' 

(Phil Green. University of Washington, Seattle. Washington; 

http://bo2eman.mbt.washington.edu/phrap.docs/phrap.hmd). The consensus sequence obtained is herein designated 
ONA486I5. 

Based on the DNA48615 consensus sequence, probes were generated and used to screen a human fetal 
kidney (LIB227) library prepared as described in paragraph 1 of Example 2 above. Tire cloning vector was pRK5B 
(PRK5B is a precursor of pRK5D that does no, contain the Sfil site: see. Holmes et al.. Ssism. 252.1278-1280 
(1991)). and the cDNA size cut was less than 2800 bp. 

PCR primers (forward and reverse) were synthesized: 

fonvard PCR pnmfr I 5--aagctgccggagctgcaatg-3- (Seq id no:339) 

fWwardPCRprilKr? 5'-TTGCTTCTTAATCCTGAGCGC-3- (SEQ id NO:340) 

forwani PCR prinrr , ! 5'-aaaggaggactttcgactgc-3' (seq id no:341) 

reverse PCR primer 1 5 -AGAGATTCATCCACTGCTCCAAGTCG-3' (SEQ id N0:342) 

reverse PCR rnmcr? 5'-tgtccagaaacaggcacatatcagc-3- (Seq id no:343) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA48615 
sequence which had the following nucleotide sequence 

hvbridi?atinn prohe. 

5--AGACAGCGGCACAGAGGTGC1TCTGCCAGGTTAGTGGTTACTTGGATGAT-3' (SEQ ID N0:344) 

In order to screen several Ubraries for a source of a ftill-lengd, clone. DNA from the libraries was screened 
by PCR amplification with d« PCR primer pairs identified above. A positive library was then used to isolate clones 
encoding the PR0865 gene using the probe oligonucleotide and one of the PCR primers. 

A full length clone was identified diat contained a single open reading frame with an apparent translational 
initiation site at mrcleotide positions 173-175 and ending at the stop codon found a. nucleotide positions 1577-1579 
(Figure 135: SEQ ID NO:336). The predicted polypeptide precursor is 468 amino acids long, has a calculated 
molecular weight of approximately 54.393 daltons and an estimated pi of approximately 5.63. Analysis of the full- 
tengih PR0865 sequence shown in Figure 136 (SEQ ID NO:337) evidences the presence of the following: a signal 
peptide from about amino acid I to about amino acid 23. potential N-glycosylation sites from about amino acid 280 
to about amino acid 283 and from about amino acid 384 to about amino acid 387. a potential amidauon site from 
about amino acid 94 to about amino acid 97. glycosaminoglycan attachmem sites from about amino acid 20 to about 
amino acid 23 and from about amino acid 223 to about amino acid 226. an aminotransferase class-V pyridoxyl- 
phosphaie amino acid sequence block from about amino acid 216 to about ammo acid 222 and an amino acid sequence 
block similar to Uiat found in the inierleukin-7 protein from about amino acid 338 to about amino acid 343. Clone 
UNQ434 (DNA53974-1401) has been deposited with ATCC on April 14. 1998 and is assigned ATCC deposit no 



35 209774 



Analysis of the amino acid sequence of the luIHength PR0865 polypeptide suggests that it possesses no 
significam sequence similarity to any known protein. However, an analysis of the Dayhoff database (version 35.45 
SwissProt 35) evidenced some degree of homology between the PR0865 amino acid sequence and the following 
Dayhoff sequences. YMNO_YEAST. ATFCA4_43. S44168. P_W14549 and RABTX:RG4_1. 
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F> ;aMPLE 53 . i.=ni=..i,^n of cDNA Clones Encodino Human PRQ827 

A cDNA secjuence isolated in ihe amj-lase screen described in Example 2 above was found, by BLAST and 

FastA sequence alignment, to have sequence homology to nucleotide sequences encodmg various inicgrm proteins 

nus cDNA sequence is herein designated DNA47751 (see Figure 140: SEQ ID NO:347>. Based on the sequence 

homology, probes v/ere generated from the sequence of the DNA47751 molecule and used to screen a human fetal 
5 pigment cpithcUum library (UBl 13) prepared as described in paragraph 1 of Example 2 above The cloning vector 

was pRK5B (pRK5B is a precursor of pRK5D that does not conuin the Sfil site; see. Holmes et al . SsiSDCfi. 

252:1278-1280 (1991)), and the cDNA size cut was less than 2800 bp. 
PCR primers (forward and reverse) were synthesized: 

f^ p.,ard PCR primer 5'-AGGGACAGAGGCCAGAGGACTTC-3- (SEQ ID NO:348) 
10 ££V£rSL££Ejmm£l5'-CAGGTGCATATrCACAGCAGGATG-3' (SEQ ID NO:349) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA47751 

sequence which had the following nucleotide sequence 

fiytrridization orobc 

5'-GGAACTCCCCTTCGTCACTCACCTGTrCTrGCCCCTGGTGTrCCT-3' (SEQ ID 
15 NO:350) 

In order to screen several libraries for a source of a full-length clone. DNA from the libraries was screened 
by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones 
encoding the PR0827 gene using the probe oligonucleotide and one of the PCR primers . 

A full length clone was identified that contained a single open reading frame with an apparem translational 
20 initiation site at nucleotide positions 134-136 and ending at the slop codon found at nucleotide positions 506-508 
(Figure 138; SEQ ID NO:345). The predicted polypeptide precursor is 124 amino acids long, has a calculated 
molecular weight of approximately 13.352 daltons and an estimated pi of approximately 5.99. Analysis of the fiill- 
Icngdi PR0827 sequence shown in Figure 139 (SEQ ID NO:346) evidences the presence of the following: a signal 
peptide from about amino acid 1 to about amino acid 22. a cell attachment sequence from about amino acid 70 to 
25 about amino acid 72. a potential N-glycosylation site from about amino acid 98 lo about amino acid 101 and an 
imegrin alpha cham protein homology sequence from about amino acid 67 to about amino acid 81 . Clone UNQ468 
(DNA57039-1402) has been deposited with ATCC on April 14, 1998 and is assigned ATCC deposit no. 209777. 

Analysis of the amino acid sequence of the lull-length PR0827 polypeptide suggests that it possesses 
significara sequence simDarity to the VLA-2 iniegrin protein and various other imegrin proiems. thereby indicaung 
30 that PR0827 may be a novel integrin or splice variant thereof. More specifically, an analysis of the Dayhoff database 
(version 35.45 SwissProt 35) evidenced significant homology between the PRO240 amino acid sequence and (he 
following Dayhoff seque«:es. S44142. ITA2_HUMAN. rTAl.RAT. ITAl.HUMAN. ITA4.HUMAN. 
ITA9_HUMAN. AF032108_1. ITAM.MOUSE. ITAS.CHICK and ITA6_CH1CK. 

35 FXAMPLE 54 : ^^oiMj nn of rPNA rioncs EncodiTiP H»iman PROllH 

A cDNA sequence isolated in the amylase screen described in Example 2 was found, by the WU-BLAST2 
sequence alignmem computer program, to have certain sequence idemity to other known interferon receptors. This 
CDNA sequence is herein designated DNA48466 (Figure 143: SEQ ID NO:352). Based on the sequence idennty. 
probes were generated from the sequence of the DNA48466 molecule and used to screen a human breast carconoma 
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libraiy (LIB135) prepared as described in paragraph 1 of Example 2 above. The cloiung vector was pRK5B (pRK5B 
IS a precursor of pRKSD (hat does iwt contaitJ the Sfil site; see. Holmes et al.. SsjsnSfi. 252: 1278-1280 (1991),. and 
the cDNA size cm was less than 2800 bp 

The oligonucleotide probes employed were as follows: 
forwarf PCR Primrr 5 -AGGCTTCGCTGCGACTAGACCTC-3' (SEQ id N0:354) 

revCBt PCR primfr 5"-CCAGGTCGGGTAAGGATGGTTGAC-3- fSEQ id N0:355) 
hybridization prntw 

5'-TTTCTACGCATTGATTCCATGTTrGCTCACAGATCAAGTCGCCATrCTGC-3' (SEQ ID NO:356) 

A fiill length done was .dentif.ed that contained a single open reading frame with an apparent translational 
initiation site at nucleotide positions 250-252. and a stop signal at nucleotide pos.ttons 1 183-1 185 (Figure 141 . SEQ 
ID NO:351). The predicted polypeptide precursor is 31! amino actds long, has a calculated molecular weight of 
approximately 35.076 daltons and an estimated pi of approximately 5.04. Analysis of the Ml-length PROll 14 
interferon receptor sequence shown in Figure 142 (SEQ ID NO:352) evidences the presence of the following: a signal 
peptide from about amino acid 1 to about amino acid 29. a transmembrane domam from about amino acid 230 to 
about amino acid 255. potential N-glycosylation sues from about amino acid 40 to about amino acid 43 and from 
about amino acid 134 to about amino acd 137. an amino acid sequence block having homology to tissue factor 
proteins from about amino acid 92 to about amino acid 119 and an amino acid sequence block having homology to 
iniegrin alpha chain proteins from about amino add 232 to about amino acid 262. Clone UNQ557 (DNA57033-1403) 
has been deposited with ATCC on May 27. 1998 and is assigned ATCC deposit no. 209905. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35). using a WU-BLAST2 sequence alignmem 
analysis of the fuU-length sequence shown in Figure 142 (SEQ ID NO:352). evidenced significant homology between 
the PROll 14 interferon receptor amino acid sequence and the following Dayhoff sequences: G0I418 
mRl_MOUSE. P_R71035. INGS_HUMAN, A26595_l. A26593_l. 156215 and TF_HUMAN. 

EXAMPLF 55 : Isolation nf rTi^fl , f Ipnes RnrnHm,, ^ lumnn PRn?17 

A consensus sequence was obtained reladve to a vaiiety of EST sequences as described in Example 1 above, 
wherein the consensus sequence obtained is herein designated DNA30905. Based on the DNA30905 consensus 
sequence. oligoraKJeotides were synthesized: I) to identify by PCR a cDNA library that contained the sequence of 
interest, and 2) for use as probes to isolate a clone of the fiill-length coding sequence for PR0237. 
PCR primers (forward and reverse) were synthesized: 
30 fcnvard PCR nriinfr 5'-TCTGCTGAGGTGCAGCTCA1TCAC-3' (SEQ ID N0:359) 
revcrsg PCR ffrimr r 5 -GAGGCTCTGGAAGATCTGAGATGG-3' (SEQ ID NO:360) 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA30905 
sequence which had the following nucleotide sequence 

hvbridiTarinn prp^^f 

35 5'.GCCTCnTGTCAACGTTGCCAGTACCTCTAACCCATrCCTCAGTCGCCTC-3' (SEQ ID NO:36l) 

In order to screen several libraries for a source of a full-length clone. DNA from the libraries was screened 
by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones 
encoding the PR0237 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of 
the cDNA libraries was isoiaied from human fetal brain tissue (UB153) 
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DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PR0237 
therein designated as UNQ21 1 (DNA34353-1428)1 (SEQ ID NO:357) and ihc derived protein sequence for PR0237. 

TTie entire nucleotide sequence of UNQ21 1 (DNA34353-1428) is shown in Figure 144 (SEQ ID NO:357) 
Clone UNQ21 1 (DNA34353-1428) contains a single open reading frame with an apparent translational initiation site 
at nucleotide positions 586-588 and ending at the stop codon at nucleotide positions 1570-1572 (Figure 144). TTie 
predicted polypeptide precursor is 328 amino acids long (Figure 145). The full-Iengih PR0237 protem shown in 
Figure 145 has an estimated molecular weight of about 36.238 daltons and a pi of about 9.90. Analysis of the full- 
icnglh PR0237 sequence shown in Figure 145 (SEQ ID NO:358) evidences the presence of the followmg: a signal 
peptide from about amino acid 1 to about amino acid 23. a transmembrane domain from about ammo acid 177 to 
about amino add 199. potential N-glycosylation sites from about amino acid 1 18 to about amino acid 121 , from about 
amino acid 170 to about ammo acid 173 and from about amino acid 260 to about amino acid 263 and eukaryotic-type 
carbonic anhydrase sequence homology blocks from about amino acid 222 to about amino acid 270. from about amino 
acid 128 to about amino acid 164 and from about amino acid 45 to about amino acid 92. Clone IJNQ211 
(DNA34353-1428) has been deposited with ATCC on May 12. 1998 and is assigned ATCC deposit no. 209855. 

Analysis of the amino acid sequence of the full-length PR0237 polypeptide suggests that it possesses 
significam sequence similarity to the carbonic anhydrase prouin. More specifically, an analysis of the Dayhoff 
database (version 35.45 SwissProt 35) evidenced significant homology between the PR0237 amino acid sequence and 
the following Dayhoff sequences. AF050I06.1. OACALP.l. CELD1022.8. CAH2_HUMAN. ICAC. 
CAH5_HUMAN. CAHP.HUMAN. CAH3_HUMAN. CAH1_HUMAN and 2CAB. 

20 BYAMPt.E 56 - iirlmim ^"na rmnrs Fnrnding H^iiTian PRQ?41 

Aconsensus sequence was obtainedrelativetoavarietyofEST sequences as described in Example 1 above. 

wherein the consensus sequence obtained is herein designated DNA42259. Based on the DNA42259 consensus 
sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that conu^ined the sequence of 
interest, and 2) for use as probes to isolate a clone of the ftill-length coding sequence for PR0541. 
25 PCR primers (forward and reverse) were synthesized: 

f.^.,. prR nrimer 5 -GGACAGAATTTGGGAGCACACTGG-3- (SEQ ID NO:364) 
fnn..rd PCR orimer 5 -CCAAGAGTATACTGTCCTCG-3- (SEQ ID NO:365) 

Pr« nrimer 5 -AGCACAGATnTCTCTACAGCCCCC-3- (SEQ ID NO:366) 

PCM nrimer 5--AACCACTCCAGCATGTACTGCTGC-3' (SEQ ID NO:367) 

30 Additionally, a synthetic oHgomKleotide hybridization probe was constructed from the consensus DNA42259 
sequence which had the following nucleotide sequence 
hylrrifli^aiignpretK 

5--CCATrCAGGTGTrCTGGCCCTGTATGTACACATTATACACAGGTCGTGTG-3- (SEQ ID NO:368) 

Inorder ,0 screen several Ubraries for a source of a full-length clone. DNA from the libraries was screened 
35 by PCR amplificadon with one of the PCR primer pairs identified above. A positive library was then used to isolate 
clones encoding the PR0541 gene using the probe oligonucleotide and one of the PCR primers. RNA for 
construction of the cDNA libraries was isolated from human fetal kidney tissue (L1B227). 

DNA sequencing of the clones isolated as described above gave the fiill-length DNA sequence for PR0541 
[herein designated as UNQ342 (DNA454I7-I432)1 (SEQ ID NG:362) and the derived protein sequence for PR0541 . 
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•nie entire nucleotide sequence of UNQ342 fDNA45417-1432) is shown in Fieure 146 (SEQ ID NO-362) 
Cioo. UNQ342 (DNA454,7-1432) con..ns a s.gle open readu.g an apparent translauo^U in.«a.ion site 

at nucleotide positions 469-471 and ending a. the stop codon a. nucleotide positions 1969-197) (Figure 146) •n,e 
predicted polypepUde precursor is 500 amino acids long (Figure 147). m ftill-lengm PR0541 pro.em shown in 
Figure 147 has an estimated tnolecular weight of about 56.888 daltons and a pi of about 8.53. Analysis of the ftall- 
length PR0541 sequence shown i„ Figure 147 (SEQ ID NO:363) evidences the presence of the following, a signal 
peptide from about amino acid 1 to about amino acid 20. amino acid sequence bloc.cs having homology to 
extracellular proteins SCP/rpx-l/Ag5/PR-l/Sc7 from about amino acid ,65 to about amino acid 186. from about 
ammo ac.d 196 ,o about ammo acid 218. from abou, ammo acid 134 to about ammo acid ,46. from about amino acid 
96 to about amino acid 108 and from about am.o acid 58 to abou, amino acid 77 and a potential N-glycosyla.ion site 
from about ammo acid 28 to about amino acid 3 1 Clone UNQ342 (DNA45417-1432) has been deposited with ATCC 
on May 27, 1998 and is assigned ATCC deposit no. 209910. 

Analysis of the amino acid sequence of the full-length PR0541 polypeptide suggests that it possesses 
significant sequence similarity to a trypsin inhibitor protein, thereby indicating that PR054 , may be a novel trypsin 
mhtbitor. More specifically, an analysis of the Dayhoff database (version 35.45 SwissPro. 35) evidenced significant 
homologybe,wecn.hePROS4,aminoacidsequenceandthefoIlowingDayhoff sequences. D45027 1 AB009609 I 
JC5308. CRS3_H0RSE. irXLHUMAN. HElX)_HELHO. GEN,435I. A28„2_l. CEmSAloJand P_Wll4i5.' 

EXAMPI F 57 . I solation of rDIMA ri«n^, ^irmi m Human PRffy^i 

A consensus sequence was obtained relative to a variety of EST sequences as described in Example , above 
wherein the consensus sequence obtained is herein designated DNA36465. Based on the DNA36465 consensus' 
sequence, oligonucleotides were synthesized: i) to identify by PCR a cDNA library that contained the sequence of 
interest, and 2) for use as probes to isolate a clone of the liill-length coding sequence for PR0273. 

A pair of PGR primers (forward and reverse) were synthesized; 

foryart PCR prima 5'-cagcgccctccccatgtccctg-3' (seq id no:371) 

is yyrjjff PCR primi 5'-TCCCAACTGGTTTCGAGmTCCC-3' (SEQ ,D NO:372) 

Addi.io«illy. a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA36465 
sequence which had the foltowing nucleotide sequence 
hvbridizatinn py^f^ 

5 -CTCCCGTCAGCATGAGGCTCCTGGCGGCCGCTGCTCCTGCTGCTG.3- (SEQ ID NO:373) 

In order to screen several libraries for a source of a full-length clone. DNA fR,m Ac libraries was screened 
by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones 
encodmg the PR0273 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of 
the cDNA libraries was isolated from human fetal kidney tissue. 

DNA sequencing of the clones isolated as described above gave the full-lengti, DNA sequence for PR0273 
(herein designated as UNQ240 {DNA39523-1 192)] (SEQ ID NO:369) and the derived protem sequence for PR0273. 

•He emiie nucleotide sequence of UNQ240 (DNA39523.1 192) is shown in Figure 148 (SEQ ,D N0 369) 
Ckme UNQ240 (DNA39S23-1 192) contains a single open reading frame with an apparem translauonal initiation site 
at nucleotide positions 167-169 and ending at the stop codon at nucleotide positions 500-502 (Figure 148). The 
predicted polypeptide precursor is 1 1 1 amino acids long (Figure 149). Clone UNQ240 (DNA39523-1 192) has been 
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deposited with the ATCC. It is understood that the deposited clone conuins the actual sequence and that the 
sequences provided herein are merely representative based on current sequencing techniques. Moreover, given ihe 
sequences provided herein and knowledge of the universal genetic code, the corresponding nucleotides for any given 
amino acid can be routinely identified and vice versa. 

Analysis of the amino acid sequence of the full-lengih PR0273 polypeptide suggests that ponions of it 
5 possess sequence identity with human macrophage inflammatory protetn-2, cytokme-induced neutrophil 
chemoanractant 2. and neutrophil chemouctic factor 2-bcta. diereby indicating that PR0273 is a novel chemokine 
As discussed further below, the cDNA was subcloned into a baculovirus vector and expressed in insect cells 
as a C-ienninally tagged IgG fusion protem. N-terminal sequencmg of the resultant protein identified the signal 
sequence cleavage sue. yielding a mature polypeptide of 77 amino acids The mature sequence, showing 31^% 
10 identity to other human CXC chemokines. includes the four canomcal cysteine residues but lacks the ELR motif. 
Northern analysis demonstrates expression at least m the small intestine, colon, spleen, lymph node and bdney. By 
in situ hybridization, also described in detail below. mRNA is localized to the lamina propria of intestmal villi and 
to renal tubules. 



15 PVAMPl .F, 58 - Isolation nf cDNA Clones Enwtlinf HMir^n PR07Q1 

A consensus sequence was obtained relative to a variety of EST sequences as described in Example 1 abtwe, 
wherein the consensus sequence obtained is herein designated DNA39848. Based on the DNA39848 consensus 
sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of 
interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PRO701. 

20 A pair of PCR primers (forward and reverse) were synthesized: 

f^r^rHPTRnrimer S'-GGCAAGCTACGGAAACGTCATCGTO-S (SEQ ID NO:376) 

..v.r«.PrR nrimer S'-AACCCCCGAGCCAAAAGATGGTCACO' (SEQ ID NO:377) 

Additionally, a synthetic oligonucleotide hybridizauon probe was constmctcd from the consensus DNA39848 

sequence which had die following nucleotide sequence: 

25 tivhrir1i7atinn orobe 

5 -GTACCGGTGACCAGGCAGCAAAAGGCAACTATGGGCTCCTGGATCAG-3' (SEQ ID NO:378). 

In Older to screen several Bbraries for a source of a lull-length done. DNA firom the libraries was screened 
by PCR ampUficaoon with the PCR primer pair identified above. A positive library was then used to isolate clones 
encoding the PRO701 gene using die probe oligonucleotide and one of the PCR primers. RNA for construcnon of 
30 the cDNA libraries was isolated from human fetal kidney tissue (L1B227) 

DNA sequencing of the ctones isolated as described above gave the ftill-lengUi DNA sequence for PRO701 
iterein designated as UNQ365 (DNA44205-I285)) (SEQ ID NO:374) and die derived protein sequence for PRO701 . 

■n« entire nucleotide sequence of UNQ365 (DNA4420S-1285> is shown in Figure 150 (SEQ ID NO:374) 
Clone UNQ365 (DNA44205-I285) contains a single open reading frame witfi an apparem translational initiation site 
35 at nucleotide positions 50-52 and ending at the stop codon at nucleotide positions 2498-3000 (Figure 150). The 
predicted polypeptide precursor is 816 amino acids long (Figure 151). The full-length PRO701 protein shown in 
Figure 151 has an estimated molecular weight of about 91.794 dalions. a pi of about 5.88 and NX(S/D bcmg 4 
Oone UNQ365 pNA44205-128S) has been deposited widi the ATCC on March 31. 1998. It is understood that the 
clone was the conect and acnial sequence, wherein the sequences provided herein are representauve based on 
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sequencing lectuiiques. 

Soil regarding ihe amino acid sequence shown in Figure 151. there is a poteniial signal pepude cleavage site 
at about amino acid 25. There are poteniial N-glycosyladon sites at about ammo acid positions 83. 51 1. 716 and 803. 
The carboxylcstcrases lype-B signature 2 sequence is at about residues 125 to 135. Regions homologous with 
carboxylestcrase type-B are also at about residues 54-74. 197-212 and 221-261. A potential transmembrane region 
corresponds approximately to amino acids 671 through about 700. The corresponding nucleic acids can be routinely 
determined from the sequences provided herein. 

Analysis of the amino acid sequence of the full-length PRO701 polypeptide suggests that it possess 
significant homology to the neuroligins from raiius nor%'egicus indicating that PRO701 may be a novel human 
neuroligin. 

EXAMPLF, f^9 : tSQlation of cDNA Clones Rn c odin^ Hnn^ a p f f^mns 

A consensus sequence was obtained relative to a variety of EST sequences as described in Example 1 above, 
wherein the consensus sequence obtained is herem designated DNA43033. Based on the DNA43033 consensus 
sequence, oligonucleotides were syndiesized: 1) to identify by PCR a cDNA library that comamcd the sequence of 
interest, and 2) for use as probes to isolate a clone of die full-length coding sequence for PRO704. 

A pair of PCR primers (forward and reverse) were syndiesized: 

fprward PCR primer 5 •-ccttgggtcgtggcagcagtgg-3 ■ (SEQ id no:38 D; 

reverse PCR primer 5*-CACTCTCCAGGCTGCATGCTCAGG-3- (SEQ id N0:382). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA43033 consensus 

sequence which had the following nucleotide sequence; 

hybridization p^h^ 

5'-GTCAAACGTrCGAGTACTTGAAACGGGAGCACTCGCTGTCGAAGC-3' (SEQ ID NO:383). 

In order to screen several Ubrarics for a source of a full-lengdi clone. DNA from the libraries was screened 
by PCR ampUlfcarion with the PCR primer pair identified above. A positive library was then used to isolate clones 
encoding die PRO704 gene using the probe oligonucleonde and one of die PCR primers. RNA for constniction of 
the cDNA libraries was isolated from human fetal kidney tissue (UB227). 

DNA sequencing of the clones isolated as described above gave die fiilUcngih DNA sequence for PRO704 
Iherein designated as UNQ368 (DNA5091 1-1288)] (SEQ ID NO:379) and die derived protein sequence for PRO704. 

TTie cnrire nucleotide sequence of UNQ368 (DNA5091 1-1288) is shown in Figure 152 (SEQ ID NO:379). 
Clone UNQ368 (DNA5091 1-1288) contains a single open reading frame with an apparent translaiional initiation site 
at nucleodde positions 8-10 and ending at the stop codon at nucleotide positions I052-I054 (Figure 152). TTie 
predicted polypeptide precursor is 348 amino acids long (Figure 153). The full-length PRO704 protein shown in 
Figure 153 has an estimated molecular weight of about 39.71 1 and a pi of about 8.7. Clone UNQ368 (DNA5091 1- 
1288) has been deposited with the ATCC on March 31, 1998. Regarding the sequence, it is understood diat the 
deposited clone contains the correct sequence, and die sequences provided herein are based on known sequencing 
techniques. 

Analysis of the amino acid sequence of the fiill-lengdi PRO704 polypeptide suggests that poruons of h 
possess significani homology to the vesicular integral membrane protein 36. thereby indicating that PR0704 may be 
a novel vesicular integral membrane protein. 
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Still analyzing the amino acid sequence of SEQ ID NO:380. the putative signal peptide is at about anuno 
acids 1-39 of SEQ ID NO:380. The transmembrane domain is at amino acids 310-335 of SEQ ID NO:380. A 
potential N-glycosylation site is at about amino acids 180-183 of SEQ ID NO:380 The corresponding nucleotides 
can be routinely determined given the sequences provided herein. 

5 F XAMPI E 60 : Unlatinn of cDN A rinnes Encoding Human PRQ706 

A consensus sequence was obtained relative to a variety of EST sequences as described in Example 1 above, 
wherein the consensus sequence obtained is herein designated DNA40669. Based on the DNA40669 consensus 
sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of 
interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PRO706. 

10 A pair of PCR primers (forward and reverse) were synthesized: 

f^ ru;;.rdPrR Primer 5'-CCAAGCAGCTTAGAGCTCCAGACC-3' (SEQ ID NO:386) 
ri ^v^rsg PCR primer 5'.TTCCCTATGCTCTGTATTGGCATGG-3' (SEQ ID NO:387> 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA40669 
sequence which had the following nucleotide sequence 
15 ^Yt)pdi7■aIion probe 

5'.GCCACTTCTGCCACAATGTCAGCnTCCCTGTACCAGAAATGGCTGTGTT.3' (SEQ ID NO:388) 

In order to screen several libraries for a source of a full-length clone, DNA from die libraries was screened 
by PCR amplification with the PCR primer pair identified above. A positive library was dien used to isolate clones 
20 encodmg the PRO706 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of 
the cDNA libraries was isolated from human fetal brain tissue (LIB153). 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PRO706 
[herein designated as UNQ370 (DNA48329- 1290)1 {SEQ ID NO:384) and the derived protein sequence for PRO706. 
It IS understood diat the deposited clone contains the acnial sequence, and that the sequences provided herein arc 
25 representative based on current sequencing techniques. 

The entire nucleotide sequence of UNQ370 (DNA48329-1290) is shown in Figure 154 (SEQ ID NO:384). 
Gone UNQ370 03NA48329-1290) contains a single open reading frame witii an apparent translational initiation site 
at nucleotide positions 279-281 and ending at the stop codon at nucleotide positions 1719-1721 (Figure 154). Hie 
predicted polypeptide precursor is 480 amino acids long (Figure 155). The ftiU-lcngth PRO706 protein shown in 
30 Figure 155 has an estimated molecular weight of about 55,239 daltons and a pi of about 9.30. Clone UNQ370 
(DNA48329-1290) has been deposited widi the ATCC on April 21, 1998. 

StiU regarding the amino acid sequence shown in Figure 155, tiicre is a potential signal peptide cleavage site 
at about amino acid 19. There are potential N-glycosylation sites at about amino acid positions 305 and 354. There 
is a potential tyrosine kinase phosphorylation site at about amino acid position 333, A region homologous with 
35 histidinc acid phosphatase is at about residues 87-102. The corresponding nucleic acid regions can be routinely 
determined given the provided sequences, i.e., the codons can be determined from the specifically named amino acids 
given. 

Analysis of die amino acid sequence of the full-length PRO706 polypeptide suggests tiiat portions of it 
possess significam homology to the human prostatic acid phosphatase precursor thereby indicating that PRO706 may 

233 



<W0 9946281 A2J_> 



10 



IS 



20 



25 



30 



35 



WO 99/4628 1 PCT/US9V;05028 
be a novel human prosuiic acid phosphatase. 

EXAMPLE (i l: Isolation of rPNA rionP. Fnr ^ ing Human Ppft 7 fi 7 

A consensus sequence was obtained relative to a variety of EST sequences as described in Example I above, 
wherein the consensus sequence obtained is herein designated DNA42775. Based on DNA42775. oligonucleoUdes 
were synthesized: 1) ,o identify by PCR a cDNA library that contained the sequence of interest. and~2) for use as 
probes to isolate a clone of the full-length coding sequence for PRO707. 

A pair of PCR primers (forward and reverse) were syrahesized: 
forward PCR primfr 5'-TCCGTCTCTGTGAACCGCCCCAC-3' (SEQ id N0:391); 
rgvcrsg PCR primrr 5 -CTCGGGCGCA1TGTCGTTCTGGTC-3' (SEQ ID NO:392). 
Additionally, a synthetic oligonucleotide hybridizauon probe was constructed from the DNA42775 sequence which 
had the following nucleotide sequence: 
hvbridization nmhe. 

5'-CCGACTGTGAAAGAGAACGCCCCAGATCCACTTATTCCCC-3' (SEQ ID NO:393). 

In order to screen several Ubraries for a source of a ftill-length clone. DNA from the libraries was screened 
by PCR ampliiicadon with the PCR primer pair identifted above. A positive library was then used to isolate clones 
encoding die PRO707 gene usirig the probe oligonucleotide and one of the PCR primers. RNA for constniction of 
the cDNA Hbraries was isolated from human fetal kidney tissue (L1B227). 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PRO707 
(hciein designated as UNQ371 (DNA48306-1291)I (SEQ ID NO:389) and the derived protein sequence for PRO707. 

■nie entire nucleotide sequence of UNQ371 (DNA48306-1291) is shown in Figure 156 (SEQ ID NO:389). 
Clone UNQ371 (DNA48306-I291) contains a single open reading frame with an apparem translational iniuauon site 
at nucleotide positions 371-373 and ending at the stop codon at nucleotide positions 3119-3121 of SEQ ID NO:389. 
The predicted polypeptide precursor is 916 amino acids long (Figure 157). The ftill-length PRO707 protein shown 
in Figure 157 has an estimated molecular weight of about 100.204 daltons and a pl of about 4.92. Clone UNQ371 
(DNA48306-129I) has been deposited widi ATCC on May 27. 1998. It is understood that the clone UNQ37 1 which 
is deposited is that which encodes PRO707. and that die sequences herein are merely representations based on known 
sequencing techniques which may be subject to minor errors. 

Regarding analysis of the amino acid sequence, the signal sequence appears to be at about 1 through 30 of 
SEQ ID NO:390. Cadherins exinceUular repeated domain signaure sequence is at about amino acids 121-131. 230- 
240. 335-345. 44O450. and 550-560 of SEQ ID NO:390. Tyrosine kinase phosphorylation sites are at about imino 
acids 124-132 and 580-586 of SEQ ID NO:390. A potential transmembrane domain is at about amino acids 682-715 
± 5. TTic nucleic acid positions can be derived by referring to die corresponding codon for the named amino acid. 

Analysis of (he amino acid sequence of the full-length PRO707 polypeptide suggests diat portions of it 
possess significant homology to the cadherin FIB3 protein, expressed in human fibroblasts, thereby indicating that 
PRO707 may be a novel cadherin. 
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F. ^AMPLE 62 : Unlaiion nf cDNA Clones Rncoding HMman PRQ322 

A consensus sequence was obtained Tclaave lo a variety of EST sequences as described in Example I above, 
wherein the consensus sequence obtained is herein designated DNA48336 Based on the DNA48336 consensus 
sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained die sequence of 
interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR0322. 
5 A pair of PCR primers (forward and reverse) were synthesized: 

f o rward PrR primer 5'.CAGCCTACAGAATAAAGATGGCCC-3' (SEQ ID NO:396) 
r ^v^r... pen nrimer 5*-GGTGCAATGATCTGCCAGGCTGAT-3' (SEQ ID NO:397) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA48336 consensus 
sequence which had the following nucleotide sequence: 
10 ^iyhn(;!i7aiion probe 

5'.AGAAATACCTGTGGTTCAGTCCATCCCAAACCCCTGCTACAACAGCAG-3' (SEQ ID NO:398). 

In order to screen several libraries for a source of a full-length clone, DN A from the libraries was screened 
by PCR amplificadon with the PCR primer pair idemified above. A positive library was then used to isolate clones 
encoding the PR0322 gene using the probe oligonucleotide and one of the PCR pruncrs RNA for construction of 

15 the cDNA libraries was isolated from human fetal kidney tissue (LIB227). 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PR0322 
[herein designated as UNQ283 (DNA48336-1309)J (SEQ ID NO:394) and the derived protein sequence for PR0322 
It is understood that UNQ283 (DNA48336-1309) in fact encodes PR0322, and that SEQ ID NQ:394 is a 
representation of the sequence based on sequencing techniques known in die art. 

20 The entire nucleotide sequence of UNQ283 (DNA48336-1309) is shown in Figure 158 (SEQ ID NO:394). 

Clone UNQ283 (DNA48336-1309) contains a single open reading frame with an apparent translauonal initiation site 
at nucleotide positions 166-168 and endmg at the stop codon at nucleotide positions 946-948 (Figure 158). TTic 
predicted polypeptide precursor is 260 amino acids long (Figure 159). The full-length PR0322 proiem shown in 
Figure 159 has an estimated molecular weight of about 28,028 daltons and a pi of about 7.87 Clone UNQ283 

25 (DNA48336-1309) has been deposited with ATCC and is assigned ATCC deposit no. 209669. 

Regarding the amino acid sequence of Figure 159, a potential N-glycosylation site is at amino acid 1 10 of 
SEQ ID NO:395. The serine proteases, trypsin family and histidine active site is identified at amino acids 69 through 
74 of SEQ ID NO:395 and the consensus sequence is idemified at amino acids 207 through 217 of SEQ ID NO: 395 
The kringle domain proteins motif is identified at amino acids 205 through 217 of SEQ ID NO:395. The putative 

30 signal peptide is encoded at about amino acids 1-23. 

Analysis of the amino acid sequence of the full-length PR0322 polypeptide suggests that portions of it 
possess significant homology to neuropsm and other serine proteases, thereby indicaung that PR0322 is a novel 
serine protease related to neuropsin. 

35 FXAMPLE 63 : Isolation of c DNA Clones Fncodinp Human PR0526 

A consensus sequence was obtained relative to a variety of EST sequences as described in Example 1 above, 
wherein the consensus sequence obtained is herein designated DNA39626. Based on the DNA39626 consensus 
sequence, oligonucleoddes were synthesized: 1) to identify by PCR a cDNA library that contained die sequence of 
interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR0526. 
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A pair of PGR primers (forward and reverse) were synthesized. 
fPfwart PCR primer 5 -TGGCTGCCCTGCAGTACCTCTACC-3' (SEQ id NO:401); 
reverse PCR primer 5'-CCCTGCAGGTCATTGGCAGCTAGG-3- (SEQ id NO:402). 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA39626 consensus 
setjuence which had the following nucleotide sequence: 
hybridization prnhe 

5 -AGGCACTGCCTGATGACACCTTCCGCGACCTGGGCAACCTCACAC-3' (SEQ ID NO:403) 

In order to screen several Ubraries for a source of a fiill-lcngdi clone. DNA from the libraries was screened 
by PCR ampUfication with the PCR primer pair identified above. A positive library was then used to isolate clones 
encoding the PR0526 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of 
the cDNA libraries was isolated from human fetal liver tissue (LIB228). 

DNA sequencmg of the clones isolated as described above gave the full-length DNA sequence for PR0526 
(herein designated as UNQ330 (DNA44184-1319)1 (SEQ ID NO:399) and the derived protein sequence for PR0526. 

The entire nucleodde sequence of UNQ330 (DNA44I84-1319) is shown in Figure 160 (SEQ ID NO:399). 
Clone UNQ330 (DNA44184-1319) contains a single open reading frame wim an apparent translational miuat.on site 
at nucleotide positions 514-516 and ending at the stop codon at nucleotide positions 1933-1935 (Figure 160) The 
predicted polypeptide precursor is 473 amino acids long (Figure 161). The ftill-length PR0526 protein shown in 
Figure 161 has an estimated molecular weight of about 50.708 daltons and a pi of about 9.28. Clone UNQ330 
(DNA44184-1319) has been deposited with the ATCC on March 26. 1998. It is understood that the clone comains 
the acmal sequence, whereas the sequences presented herein are representative based on current sequencing 
20 techniques. 

Analysis of the amino acid sequence of the ftiU-length PR0526 polypeptide suggests that portions of it 
possess significant homology to the leucine repeat rich proteins including ALS. SLIT, carboxypeptidase and platelet 
glycoprotein V thereby indicating that PR0526 is a novel protein which is involved in protein-protein interactions. 
Still analyzing SEQ ID NO:400. the signal peptide sequence is at about ammo acids 1-26. A leucine zipper 
25 pattern is at about amino acids 135-156. A glycosaminoglycan attachmem is at about amino acids 436^39. N- 
glycosylarion sites are at about amino acids 82-85. 179-182. 237-240 and 423^26. A von Willebrand factor (VWF) 
type C domainCs) is found at about amino acids 41 1-425. The skilled anisan can understand which nucleotides 
correspond to these amino acids based on the sequences provided herein. 

30 EXAMPI R 64 : Isglation of cDNA Clones Pnrnrfinp H uman PPnSti 

An ECD database was searched and an expressed sequence tag (EST) from UFESEQ^. Incyte 
Phamiaceuncals. Palo Alto. CA was identified which showed homology to protocadherin 3. Based on this sequence, 
a search was performed using the computer program BLAST or BLAST2 (Altshul et al.. Method-i in FnTym fflffgY 
2fifi:460-480 (1996)) as a comparison of the ECD protein sequences to a 6 frame translation of the EST sequence. 
Those comparisons resultmg in a BLAST score of 70 (or in some cases 90) or greater that did not encode known 
proteins were clustered and assembled into consensus DNA sequences with the program "phrap" (Phil Green. 
University of Washington. Seattle. Washington; http://bozeman.mbt.washington.edu/phrap.docs/phrap.hmil). 

A consensus DNA sequence was assembled relative to other EST sequences using phrap. Based on the 
consensus sequence obtained, oligonicleotides were synthesized: I ) to .demify by PCR a cDNA library that contained 
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ihe sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR0531 

A pair of PCR primers (forward and reverse) were synthesized: 
f„rn,^ rH PrR nrimer 5'-CTGAGAACGCGCCTGAAACTCTG-3- (SEQ ID NO:406): 
Tp^ r"- Pf^R """'g'- 5 -AGCGTTGTCATTCACATCGGCG-3- (SEQ ID NO-407). 

Addiiionany. a syniheuc oligonucleotide hybridization probe was constructed from the consensus DNA sccjuence 
5 which had the following nucleotide sequence: 
hYtfrifliyaii"" P'""'^ 

5'.TTAGTrGCTCCATTCAGGAGGATCTACCCTrCCTCCTGAAATCCGCGGAA-3' (SEQ ID NO:408). 

In order to screen several Ubrarics for a source of a full-length clone, DNA from the libraries was screened 
by PCR amplilicaiion with the PCR primer pair identified above. A positive library was then used to isolate clones 
10 encoding the PR0531 gene using the probe oligonucleotide and one of the PCR primers. RNA for construcuon of 
the cDNA libraries was isolated from human fetal brain tissue (LIB153). The cDNA libraries used to isolate the 
cDNA clones were constnicted by standard methods using commercially available reagents such as those from 
Invitrogen, San Diego. CA. The cDNA was primed with oligo dT containing a NotI sue. linked with blunt to SaU 
hemikinased adaptors, cleaved with Notl. sized appropriately by gel electrophoresis, and cloned in a defined 
15 orientation into a suitable cloning vector (such as pRKB or pRKD; pRK5B is a precursor of pRK5D that does not 
contain the Sfil site; see. Holmes ei al.. ScififlSfi. 252:1278-1280 (1991)) in the umque Xhol and Notl sites. 

DNA sequencing of die clones isolated as described above gave the full-length DNA sequence for PR0531 
(herein designated as UNQ332 (DNA48314.1320)] (SEQ ID NO:404) and the derived protein sequence for PR0531. 
TTie entire represemarive nucleotide sequence of UNQ332 (DNA48314-1320) is shown in Figure 162 (SEQ 
20 ID N0:404). It is understood that the acmal sequence is that within the clone deposited with the ATCC as 
DNA48314-1320. Clone UNQ332 (DNA48314-1320) contains a single open reading frame with an apparera 
translational inination site at nucleotide positions 171-173 and ending at the stop codon at nucleotide positions 2565- 
2567 (Figure 162). The predicted polypeptide precursor is 789 amino acids long (Figure 163). THe fiill-tength 
PR0531 protein shown in Figure 163 has an estimated molecular weight of about 87.552 daltons and a pi of about 
25 4.84. Clone UNQ332 (DNA48314-1320) has been deposited with the ATCC on March 26. 1998. 

Analysis of the amino acid sequence of the full-length PR0531 polypeptide suggests that portions of it 
possess significant homology to protocadherin 3. Moreover. PR053I is found in the brain, like other protocadherins. 
thereby indicating that PR0531 is a novel member of the cadherin superfamily. 

StiU analyzing the amino acid sequence of SEQ ID NO:405. the cadherin extracellular repeated domain 
30 signamre is found at about amino acids 122-132. 231-241 . 336-346, 439^9 and 549-559 of SEQ ID NO:405. An 
ATT/GTP-binding site motif A (P-loop) is found at about amino acids 285-292 of SEQ ID NO:405. N-glycosylation 
sites are found at least at about amino acids 567-570. 786-790. 418-421 and 336-339 of SEQ ID NO:405. The signal 
peptide is at about amino acids 1-26. and the transmembrane domain is at about amino acids 685-712 of SEQ ID 
NO:40S. 



35 



FXAMPLE 65 : f<nlation of cP N A F-nrndinP Humatl PRQ534 

Aconsensus sequence was obtained lelanve to a variety of EST sequences as described in Example 1 above. 
v*crein the consensus sequence obtained b herein designated DNA43038. Based on the 43048 consensus sequence, 
oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that comamed the sequence of interest, and 
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2) for use as probes to isolate a clone of the ftill-length coding sequence for PR0534. 

A pair of PCR primers (forward and reverse) were synthesized: 
fprvarf PCP Primgr 5 -CACAGAGCCAGAAGTGGCGGAATC-3- (SEQ id N0:41 «; 
reverse? pCR prirwr 5-CCACATGTTCCTGCTCTTGTCCTGG-3- (SEQ id N0:412). 
Addiiionally. a synthetic oligonucleotide hybridization probe was constnicted from the consensus DNA43038 
sequence which had the following nucleotide sequence: 
hvbridiT^tion probe 

5'-CGGTAGTGACTGTACTCTAGTCCTGTnTACACCCCGTGGTGCCG-3' (SEQ ID NO:413). 

In order to screen several libraries for a source of a ftill-Iength clone, DNA from the libraries was screened 
by PCR amplification with the PCR primer pair idenufied above. A positive library was then used to isolate clones 
encoding the PR0534 gene using the probe oligonucleotide and one of the PCR primers. RNA for construcuon of 
the cDNA libraries was isolated from human fetal lung tissue (LIB26). 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PR0534 
[herein designated as UNQ335 fDNA48333-132i)J (SEQ ID NO:409) and the derived protein sequence for PR0534. 

The ennre nucleotide sequence of UNQ335 (DNA48333-1321) is shown in Figure 164 (SEQ ID NO:409) 
Ckme UNQ335 (DNA48333-132I) contains a single open reading frame w,th an apparent .ranslational initiation site 
at nucleotide positions 87-89 and ending at the stop codon at nucleotide positions 1167-1169 (Figure 164). Tbc 
predicted polypeptide precursor is 360 amino acids long (Figure 165). TTie ftill-length PR0534 protein shown in 
Figure 165 has an estimated molecular weight of about 39.885 dakons and a pi of about 4.79. Clone UNQ335 
(DNA48333-1321) has been deposited with ATCC on March 26. 1998. It is understood that the deposited clone 
coniains the actual sequence, and that the sequences provided herein are representative based on current sequencing 
techniques. 

Analysis of the amino acid sequence of the fulMenglh PR0534 polypeptide suggests that ponions of it 
possess significant sequence identity with the protein disulfide isomerase, thereby indicating that PR0534 may be a 
novel disulfide isomerase. 

SdU analyzing the amino acid sequence of PR0534. die signal peptides is at about amino acids 1-25 of SEQ 
ID NO:410. The transmembrane domain is at about amino acids 321-340 of SEQ ID NO:410. The disulfide 
isomerase corresponding region is at amino acids 212-302 of SEQ ID NO:410. The thioredoxin domain is at amino 
acids 211-227 of SEQ ID NO:410. N-glycosylation sites are at: 165-168. 181-184. 187-190. 194-197.206-209.278- 
281. and 293-296 of SEQ ID NO:410. The corresponding nucleotides can routinely be determined from the 
sequences provided herein. PR0534 has a transmembrane domain rather dum an ER retention peptide like other 
protein disulfide isomerases. Additionally. PR0534 may have an imron ai the 5 prime end. 

EXAMPI F, 6(i : Isolation of rPNA rinn^. Fn c odini. Human pp ftfioy 

A consensus sequence was obtained relative to a variety of EST sequences as described in Example 1 above, 
wherein tiie consensus sequence obuined is herein designated DNA43052. Based on tiiis consensus sequence, 
oligomicleotides were synthesized: 1) to idemiiy by PCR a cDNA library that conuined the sequence of interest, and 
2) for use as probes to isolate a clone of the full-length cocUng sequence for PROe97. 

A pair of PCR primers (forward and reverse) were synthesized: 

fonvard PTR nrimf r 5 -cctggctcgctgctgctgctc-3' (seq id no:416): 
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T^vpr.P prR primer 5'-CCTCACAGGTGCACTGCAAGCTGTC-3' (SEQ ID NO:417). 
Additionally, a syniheiic oligonucleotide hybridizauon probe was constructed from the DNA43052 consensus 
sequence which had the following nucleotide sequence: 

5'-CTCTTCCTCTTTGGCCAGCCCGACTTCTCCTACAAGCGCAGAATrGC-3* (SEQ ID N0:418). 
5 In order to screen several libraries for a source of a full-length clone, DNA from the libraries was screened 

by PGR ampUficarion with the PGR primer pair identified above. A positive library was then used to isolate clones 
encoding the PR0697 gene using the probe oligonucleondc and one of the PGR primers. RNA for consu\iction of 
the cDNA libraries was isolated from human fetal kidney tissue (LIB227). 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PR0697 
10 therein designated as UNQ361 {DNA50920- 1325)1 (SEQ ID NO:414) and the derived protein sequence for PR0697, 
The entire nucleodde sequence of UNQ361 (DNA50920-1325) is shown in Figure 166 (SEQ ID NO:414). 
Clom UNQ361 (DNA50920-1325) contains a single open reading frame with an apparent iranslational initiation site 
at nucleotide posiuons 44-46 and ending at the stop codon at nucleotide positions 929-931 (Figure 166). The 
predicted polypeptide precursor is 295 amino acids long (Figure 167). The full-length PR0697 protein shown in 
15 Figure 167 has an estimated molecular weight of about 33.518 daltons and a pi of about 7.74. Clone UNQ361 
(DNA50920-1325) was deposited with the ATCC on March 26. 1998. It is understood that the deposited clone 
contains the actual sequence, and that the sequences provided herein are representative based on currem sequencing 
techniques. 

Analysis of the amino acid sequence of the full-length PR0697 polypeptide suggests that portions of it 
20 possess significant sequence identity with sFRPs, thereby indicating that PR0697 may be a novel sFRP family 
member. 

Still analyzing the amino acid sequence of PR0697. the signal peptides is at about amino acids 1-20 of SEQ 
ID N0:415. The cystein rich domain, having identity with die frizzled N-terminus. is at about amino acids 6-153 
of SEQ ID N0:415. Hie corresponding nucleotides can routinely be determined from the sequences provided herein. 

25 

jgyAMPLE 67 - Ifir'^M^n *>f ^DNA Cfnnes Encoding Human PRQ7I7 

A consensus sequence was obtained relative to a variety of EST sequences as described in Example 1 above. 

wherein the consensus sequence obtained is herein designated DNA42829. Based on the DNA42829 consensus 

sequence, oligonucleotides were synthesized: 1) to identify by PGR a cDNA library that contained die sequence of 
30 interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR0717. 
A pair of PGR primers (forward and reverse) were syndiesized: 

forward PCR primer 5'-AGCTTCTGAGGGCTGGTGGAGCAG-3' (SEQ ID NO:42l); 

r^ V^rse PGR primer 5 -CGGGTCAATAAACCTGGACGCTrGG-3' (SEQ ID NO:422). 

Additionally, a synthetic oUgonucleotide hybridization probe was constructed from die DNA42829 consensus 
35 sequence which had the following nucleotide sequence: 

hY^ridi7flrion nrobe 

5'-TATGTGGACCGGACCAAGCACTrCACTGAGGCCACGAAGATrG-3* (SEQ ID NO:423). 

in order to screen several libraries for a source of a full-length clone. DNA from the libraries was screened 
by PCR amplification with die PCR primer pair identified above. A positive library was then used to isolate clones 
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encoding the PR0717 gene using the probe oligonudeoride and one of the PCR primers. RNA for construction of 
the cDNA libraries was isolated from human fetal liver tissue (UB229) 

DNA sequencing of the clones isolated as described above gave the ftill-lengih DNA sequence for PR0717 
nierein designated as UNQ385 (DNA50988-1326)| (SEQ ID NO:419) and the derived protein sequence for PR0717. 

The entire nucleodde sequence of UNQ385 (DNA50988-1326) is shown in Figure 168 (SEQ ID NO-4I9) 
Clone UNQ385 (DNA50988-1326)comams a single open reading frame with an apparent transladonal initiation stte 
at nucleotide positions 17-19 and ending at the stop codon at nucleotide positions 1697-1699 (Figure 168). The 
predicted polypeptide precursor is 560 amino acids long (Figure 169). The fiill-length PR0717 protein shown in 
Figure 169 has an estimated molecular weight of about 58.427 daltons and a pi of about 6.86. Clone UNQ385 
(DNA50988.1326) has been deposited with the ATCC on April 28. 1998. Regarding the sequence, it is understood 
that the deposited clone contains the correct sequence, and the sequences provided herein are based on known 
sequencing techniques. 

Analysis of the ammo acid sequence of the full-length PR0717 polypeptide suggests that PR07I7 may be 
a novel 12 transmembrane receptor. The reverse complemeni strand of DNA50988 has a stretch that matches 
identically with human regulatory myosin light strand. 

Still analyzing the amino acid sequence of SEQ ID NO:420. transmembrane domains are at about amino 
acids 30-50. 61-79. 98-112. 126-146. 169-182. 201-215. 248-268. 280-300, 318-337. 341-357. 375-387. and 420441 
of SEQ ID NO:420. N-glycosylaiion sites arc at about amino acids 40-43 and 43-46 of SEQ ID NO:420. A 
glycosaminoglycan attachmem site is at about amino acids 468471 of SEQ ID NO:420. n>e corresponding 
nucleotides can be routinely determined given the sequences provided herein. 

EXAMPLE 68: isolation nf r PNA Clonp^ Encoding Human PR07:^) 

A database was used to search expressed sequence tag (EST) databases. The EST database used herein was 
the proprietary EST DNA database LIFESEQ™. of Incyte Pharmaceuticals. Palo Alto. CA. Incyte clone 2581326 
was herein ideniiikd and termed DNA4280I . Based on dte DNA42801 sequence, oligonucleoddcs were syndiesized: 
1) to Identify by PCR a cDNA library diat conuiined the sequence of interest, and 2) for use as probes to isolate a 
clone of the full-length coding sequence for PR073I . 

A pair of PCR primers (forward and reverse) were synthesized: 

fpryani PCR primffr 5'-gtaagcacatgcctccagaggtgc-3' (seq id no:426): 

revcrw PCR priinrr 5 -GTGACGTGGATGCTTGGGATGTTG-3- (SEQ id N0:427). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA42801 sequence which 

had the following nucleotide sequence: 

hvbridi/arinn jtroht- 

5 -TGGACACCTrCAGTATTGATGCCAAGACAGGCCAGGTCATTCTGCGTCGA-3" (SEQ ID NO:428). 

In order 10 screen several Ubraries for a source of a full-length clone. DNA from the libraries was screened 
by PCR ampliiicadon with the PCR primer pair idemificd above. A posiuve library was dien used to isolate clones 
encoding the PR0731 gene using die probe oligonucleotide and one of the PCR pnmers. RNA for construction of 
die cDNA libraries was isolated from human bone manow tissue (UB255). The cDNA libraries used to isolate the 
CDNA clones were constructed by standard methods using commercially available reagenu such as diose from 
hvitiogen. San Diego. CA. The cDNA was primed with oligo dT containing a Noti site, linked with blum to Sail 
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hemikinased adaptors, cleaved with Noll, sized appropriately by gel electrophoresis, and c!oncd in a defined 
orientation into a suitable cloning vector (such as pRKB or pRKD; pRK5B is a precursor of pRK5D thai does not 
contain the Sfil site; see. Holmes ei ai.. ScifflQEfi. 252:1278-1280 (1991)) in the unique Xhol and Noil sues. 

DNA sequencing of the clones isolated as described above gave the ftilMength DNA sequence for PR0731 
Ihcnrin designated as UNQ395 (DNA48331-1329)1 (SEQ ID NO:424) and die derived protein sequence for PR0731 
5 The entire nucleotide sequence of UNQ395 (DNA48331-1329) is shown in Figures 170A-B (SEQ ID 

NO:424). Clone UNQ395 (DNA4833 1-1329) contains a single open reading frame with an apparent iranslaiional 
initiation site ai nucleotide positions 329-331 and ending at the stop codon at nucleotide positions 3881-3883 (Figures 
170A-B). The predicted polypeptide precursor is 1184 amino acids long (Figure 171). The full-length PR0731 
protein shown in Figure 171 has an estimated molecular weight of about 129.022 daltons and a pi of about 5.2. Clone 
10 UNQ395 (DNA48331-1329) was deposited with the ATCC on March 31. 1998. Regarding die sequence, it is 
understood that the deposited clone contains the correct sequence, and the sequences provided herein are based on 
known sequencing techniques. 

Analysis of the amino acid sequence of die hill-lengih PR0731 polypeptide suggests dwt portions of ii 
possess significant identity and similarity to members of ihe protocadherin family, thereby indicating diat PR0731 
15 may be a novel protocadherin. 

Still analyzing the amino acid sequence of SEQ ID NO:425. die putative signal peptide is at about amino 
acids 1-13 of SEQ ID NO:425. The transmembrane domain is at ammo acids 719-739 of SEQ ID NO:425. The N- 
glycosylation of SEQ ID NO:425 are as follows: 415^18, 582-586, 659-662, 662-665, and 857-860. The cadhcrin 
cxiraccUular repeated domain signatures are at about amino acids (of SEQ ID NO:425): 123-133, 232-242. 340-350. 
20 448-458, and 553-563. The corresponding nucleotides can be routinely determined given die sequences provided 
herein. 



pXf MPLE 69 : i^niannn of cPNA Clones Encoding Human PR0218 

25 A consensus sequence was obtained relative to a variety of EST sequences as described in Example 1 above, 

wherein the consensus sequence obtained is herein designated DNA17411. Two proprietary Gcncmcch EST 
sequences were employed in die consensus assembly and are shown in Figure 174 and 175. Based on die DNA1741 1 
consensus sequence, oligonucleotides were syndiesized: 1) to idennfy by PCR a cDNA library that contained Uw 
sequence of interest, and 2) for use as probes to isolate a clone of die full-lengdi coding sequence for PR0218. 

30 A pair of PCR primers (forward and reverse) were syndiesized: 

forward PCR primer S'-AAGTCGAGCCGGAGCCTTCC-S' (SEQ ID NO:433); 
ri^vi^fse PCR Primer 5'-TCGTTGTTTATGCAGTAGTCGG.3* (SEQ ID NO:434). 

AdditionaUy. a syndietic oligonucleotide hybridization probe was constructed from die consensus DNA 174 1 1 
sequence which had die following nucleotide sequence: 
35 hybridization probe 

5'.ATTGnTAAAGACTATGAGATACGTCAGTATGrrGTACAGG-3' (SEQ ID NO:435). 

In order to screen several Ubraries for a source of a ftilMengdi clone. DNA from die libraries was screened 
1^ PCR araptification widi die PCR primer pair idenufied above. A positive Ubrary was dicn used to isolate clones 
encoding die PR02I8 gene using die probe oligonucleotide and one of die PCR primers. RNA for construcuon of 
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the cDNA libraries was isolated from human fetal icidney tissue (UB28). 

DNA sequencing of the clones isolated as described above gave the fiill-lengih DNA sequence for PR0218 
(herein designated as UNQ192 (DNA30867-1335)I (SEQ ID NO:429) and the derived protein sequence for PR0218. 

TTw entire nucleotide sequence of UNQI92 (DNA30867-1335) is shown in Figure 172 (SEQ ID NO:429) 
Clone UNQ192 (DNA30867-1335) contains a single open reading frame wiU. an apparem translational initiation site 
5 at nucleotide positions 150-152 and ending at the stop codon at nucleotide positions 1515-1517 (Figure 172). TTk: 
predicted polypeptide precursor is 455 amino acids long (Figure 173). The full-length PR0218 protein shown in 
Figure 173 has an estimated molecular weighi of about 52.917 daltons and a pi of about 9.5. Clone UNQ192 
{DNA30867-1335) has been deposited w.th the ATCC on April 28. 1998. Regarding the sequence, it is understood 
that the deposited clone contains the correct sequence, and the sequences provided herem are based on known 
10 sequencing techniques. 

Analysis of the amino acid sequence of the fuIMength PR0218 polypeptide suggests that PR0218 may be 
a novel transmembrane protein. 

Still analyzing the amino acid sequence of SEQ ID NO:430. the putative signal peptide is at about amino 
acids 1 dirough 23 of SEQ ID NO:430. Transmembrane domains are potentially at about amino acids 37-55. 81-102. 
150-168. 288-311. 338-356. 375-398. and 425-444 of SEQ ID NO:430. N-glycosylauon sites are at about amino 
acids 67. 180. and 243 of SEQ ID NO:430. Eukaryoric cobalamin-binding protein is at about amino acids 151-160 
of SEQ ID -NO:430. Tbt corresponding nucleotides can be routinely daennined given the sequences provided herein. 
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EXAMPLE 70 : [so i a tion of rONA rion es Fncodinp Human PROTfiH 

A consensus sequence was obtained relative to a variety of EST sequences as described in Example 1 above, 
wherein the consensus sequence obtained is herein designated DNA43448. Based on die DNA43448 consensus 
sequence, oligonucleotides were syndiesized: 1) to identify by PCR a cDNA library that contained the sequence of 
interest, and 2) for use as probes to isolate a clone of die ftjil-length coding sequence for PR0768. 
A pair of PCR primers (forward and reverse) were syndiesized: 

25 fgpvard PCR priincr s'-ggctgacaccgcagtgctcttcag-S' (seq id no:438): 

revCW PCR priniBr 5 -GCTGCTGGGGACTGCAATGTAGCTG-3' (SEQ id N0:439). 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from die DNA43448 consensus 
sequence which had the following nucleotide sequence: 
hvbridiMtion prnhf 

5'-CATCCTCCATGTCTCCCATGAGGTCTCTATrGCTCCACGAAGCATC-3' (SEQ ID NO:440). 

In order to screen several Ubraries for a source of a ftill-length clone. DNA from the libraries was screened 
by PCR amplificaaon with die PCR primer pair idemified above. A positive library was tiien used to isolate clones 
encoding the PR0768 gene using the probe oligonucleotide and one of die PCR primers. RNA for construction of 
the cDNA libraries was isolated from human bone marrow ussue (LIB255). 

DNA sequencing of die clones isolated as described above gave die full-length DNA sequence for PR0768 
(herein designated as UNQ406(DNA55737-1345)1 (SEQ ID NO:436) and the derived protem sequence for PR0768. 

The entire nucleotide sequence of UNQ406 (DNA55737.1345) is shown in Figures 176A-B (SEQ ID 
NO:436). Clone UNQ406 (DNA55737-1345) contains a single open reading frame witfi an apparem translational 
initiation site at micleotide positions 20-22 and ending at the stop codon at nucleotide positions 3443-3445 (Figures 

242 



30 



35 



BNSOOCIO- <W0_««4«M1A2J,> 



PCT/US')9/05028 

WO 99/46281 

176A-B). The predicted polypeptide precursor is 1141 amino acids long (Figure 177). The fulHengih PR0768 
protein shown in Figure 177 has an estimated molecular weight of about 124.671 daltons and a pi of about 5.82. 
Clone UNQ406 (DNA55737-1345) has been deposited with the ATCC on April 6. 1998. Regarding the sequence, 
it b understood that the deposited clone contains dje correct sequence, and die sequences provided herein are based 
on known sequencing techiuques. 
5 Analysis of the amino acid sequence of the full-length PR0768 polypeptide suggests that ponions of it 

possess significant sequence identity and similarity with integrin 7. 

Still analyzing the amino acid sequence of SEQ ID NO:437. the putative signal peptide is at about ammo 
acids 1-33 of SEQ ID NO:437. The transmembrane domain is at amino acids 1039-1064 of SEQ ID NO:437. N- 
glycosylation sites are at amino acids: 86-89. 746-749. 949-952. 985-988 and 1005-1008 of SEQ ID NO:437. 
10 Integrinalphachainproteindomainsareidentifiedataboutanunoacids: 1064-1071,384^9. 1041-1071.317-346. 
443^65. 385^07. 215-224. 634-647 . 85-99. 322-346. 470-179, 442-466. 379-M)8 and 1031-1047 of SEQ ID 
NO:437. The corresponding nucleotides can be routinely determined given the sequences provided herein. 

F, ^AMPt.E71 : [■iniaiion of cDM A Tinngs Encndinp Htiman PRQ771 
15 A consensus sequence was obtained relative to a variety of EST sequences as described in Example I above. 

wherein the consensus sequence obtained is herein designated DNA43330. Based on the DNA43330 sequence. 

oligonucleoiides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 

2) for use as probes to isolate a clone of the fiiU-Iength coding sequence for PR0771. 
A pair of PCR primers (forward and reverse) were synthesized: 
20 fnrw«rH PCR primer 5 -CAGCAATATrCAGAAGCGGCAAGGG-3' (SEQ ID NO:443): 

r.v.r..PrR nrimer 5 -CATCATGGTCATCACCACCATCATCATC-3' (SEQ ID NO:444). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA43330 consensus 

sequence which had the following nucleoudc sequence: 

hYhrirfiTaiinn nfobe 

25 5--GGTTACTACAAGCCAACACAATGTCATGGCACTGTrGGACACTGCTCG-3' (SEQ ID NO:445). 

In Older to screen several hTjtaries for a source of a fiiU-length clone . DNA from the libraries was screened 
by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones 
encoding the PR0771 gene using the probe oligomicleoUde and one of the PCR primers. RNA for construcoon of 
the cDNA Ubraries was isolated from human fetal kidney tissue (UB28). 
30 DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PR0771 

(herein designated as UNQ409 (DNA49829-1346)] (SEQ ID NO:441) and the derived protein sequence for PR0771. 

m entire nucleotide sequence of UNQ409 (DNA49829-1346) is shown in Figure 178 (SEQ ID NO:441). 
Clone UNQ409 (DNA49829-1346) contains a single open reading frame with an apparent translational initiation site 
at nucleotide positions 134-136 and ending at the stop codon at nucleotide positions 1442-1444 (Figure 178). lite 
35 predicted polypeptide precursor is 436 amino acids long (Figure 179). n>e (uU-length PR0771 protein shown in 
Figure 179 has an estimated molecular weight of about 49.429 daltons and a pi of about 4.8. Clone UNQ409 
(DNA49829-1346) has been deposited with the ATCC on April 7. 1998. Regarding the sequence, it is understood 
that the deposited clone comains the correct sequence, and the sequences provided herein arc based on known 
sequencing techniques. 
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Analysis of the anuno acid sequence of the full-length PR0771 polypeptide suggests that poraons of it 
possess significant homology to the tcstican proiem. thereby indicating that PR0771 may be a novel lestican 
homologue. 

Still analyzing the amino acid sequence of SEQ ID NO:442. the puutive signal peptide, leucine zipper 
panem. N-myristoylation sites, and thyroglobulin type-l repeats are also shown in Figure 179. The corresponding 
nucleotides can be routinely determined given the sequences provided herein. 

EXAMPLE 72 : [solation of cDNA Cf^^ Ftl podinf pR ft??"* 

A consensus sequence was obtained relative to a variety of EST sequences as described in Example 1 above, 
wherein the consensus sequence obtained is herein designated DNA45600. Based on the DNA45600 consensus 
sequence, oligonucleotides were synthesized: I) to identify by PCR a cDNA library that contained the sequence of 
interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR0733. 

A pair of PCR primers (forward and reverse) were synthesized: 

forward pcR nrimgr 5 •cccagcagggatgggcgacaaga-3- (Seq id no:448); 

Kvmc PCR Primer 5"-GTCTTCCAGnTCATATCCAATA-3' (SEQ ID NO:449). 

Additionally, a synthetic oligonucleotide hybridizauon probe was constructed from the DNA45600 consensus 
sequence which had the following nucleotide sequence: 

hybridization prnhg 

5'-CCAGAAGGAGCACCGGGAAGGGCAGCCAGATCTTGTCGCCCAT.3* (SEQ ID NO:450). 

In order to screen several libiaries for a source of a full-length clone. DNA from the libraries was screened 
by PCR ampMcarion with die PCR primer pair identified above. A positive library was then used to isolate clones 
encoding the PR0733 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of 
the cDNA libraries was isolated from human bone marrow tissue (UB255). 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PR0733 
(herein designated as UNQ411 (DNA52196-1348)J (SEQ ID NO:446) and the derived protein sequence for PR0733. 

The entire nucleotide sequence of UNQ41I (DNA52196-1348) is shown in Figures 180A-B (SEQ ID 
NO:446). Clone UNQ411 (DNA52196-1348) contains a single open reading frame with an apparent translational 
initiation site ai nucleotide positions 106-108 and ending at die stop codon at nucleotide positions 793-795 (Figures 
18(JA-B). Tlie predicted polypeptide precursor is 229 amino acids loi« (Figure 181). Hie full-length PR0733 protein 
shown in Figure 181 has an estimated molecular weight of about 26.017 daltons and a pi of about 4.73. Clone 
UNQ41 1 (DNA52196-1348) has been deposited with the ATCC on April 7. 1998. Regarding ibt sequence, it is 
understood that the deposited clone contains die correct sequence, and die sequences provided herein are based on 
known sequencing techniques. 

Analysis of the amino acid sequence of the full-length PR0733 polypeptide suggests diat portions of it 
possess significant sequence identity and similarity to the T1/ST2 receptor binding protein precursor and dierefore 
may have a similar Amction in cell signaling. If it is a cytolcine, it may be useful in the treamient of inflammation 
anticancer. 

Still analyzing die amino acid sequence of SEQ ID NO:447. the putative signal peptide, transmembrane 
domain. N-myristoylation site, and tyrosine kinase site are also shown in Figure 181. TTie corresponding nucleotides 
can be routinely determined given die sequences provided herein. 
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FXAMPLE 73 : Isolation of c HNA riones Encoding Human PR0162 

An expressed sequence lag (EST) DNA database (Merck/Washingion University) was searched and an EST 
AA397543 was identified which showed homology to human pancreamis-associaied protein. The EST AA397543 
cole was purchased and its tnsen obtained and sequenced and the sequence obtained is shown in Figure 182 (SEQ 
IDNO:45I). 

5 The entire nucleotide sequence of PR0162 is shown in Figure 182 (SEQ ID NO:451). DNA sequencing 

of the clone gave the full-length DNA sequence for PR0162 [herein designated as UNQ429 (DNA56965-1356)I (SEQ 
ID NO:451) and the derived protein sequence for PR0162. Clone UNQ429 (DNA56965-1356) contains a single open 
reading frame with an apparent translational initiation site at nucleotide positions 86-88 and ending at the stop codon 
at nucleodde positions 611-613 (Figure 182). The predicted poiypepude precursor is 175 amino acids long (Figure 

10 183). The full-length PR0162 protein shown in Figure 183 has an estimated molecular weight of about 19.330 
daltons and a pi of about 7.25. Clone UNQ429 (DNA56965-I356) has been deposited with the ATCC. Regarding 
the sequence, it is understood that the deposited clone contains the correct sequence, and the sequences provided 
herein are based on known sequencing techniques. 

Analysis of the amino acid sequence of the full-length PRO 162 polypeptide suggests that ponions of it 

15 possess significant homology to the human pancreatitis-associated protein, thereby mdicating that PRO 162 may be 
a novel pancreatitis*associaied protein. 

Sdll analyzing the amino acid sequence of SEQ ID NO:452. the putanve signal peptide is at about amino 
acids 1-26 of SEQ ID NO:452. A C-type lectin domain signamrc is at about amino acids 146-171 of SEQ ID 
NO:452. The corresponding nucleotides can be routinely determined given the sequences provided herein. 

20 

f;VAMPLE 74 : Tsolattnn of cDNA C lones Encoding Human PR0788 

A consensus DNA sequence (designated herein as DNA49308) was assembled relative to other EST 
sequences using phrap as described in Example 1 above. Based upon an observed homology between the DNA49308 
consensus sequence and the Incyte EST cloOne no. 2777282, the Incyte EST clone no. 2777282 was purchased and 
25 its insert obtained and sequenced, which gave the fiill-lengih DNA sequence for PR0788 [herein designated as 
UNQ430 (DNA56405-1357)] (SEQ ID NO:453) and the derived protein sequence for PR0788. 

Clone UNQ430 (DNA56405-1357) contains a single open reading frame with an apparent uranslational 
initiation siie at nucleotide positions 84-86 and ending at the stop codon at nucleotide positions 459-461 (Figure 184). 
TTic predicted polypeptide precursor is 125 amino acids long (Figure 185). The full-length PR0788 protein shown 
30 in Figure 185 has an estimated molecular weight of about 13,115 daltons and a pl of about 5.90. Clone UN(}430 
(DNA56405-1357) has been deposited with the ATCC. Regarding the sequence, it is understood tiiat die deposited 
clone contains the correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Still analyzing Figure 185. a signal peptide is shown at about amino acids 1-17 of SEQ ID NO:454. An N- 
glycosylation site is at about amino acids 46-49 of SEQ ID NO:454. 

35 

PXAMPLE 75 ; T^nUiinn of cDNA Clones Encoding Human PRO1Q08 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described in 
Example 1 above. This consensus sequence is herein designated as DNA49804. An EST proprietary to Genentech 
was employed in die consensus assembly and is herein designated as DNAI6508 (Figure 188; SEQ ID NO:457). 
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Based upon an observed homology between the DNA49804 sequence and Merck EST clone no. AA 143670. the 
Merck EST clone no. AA143670 was purchased and its insert obtained and sequenced. Thai sequence is shown 
herein in Figure 186 (SEQ ID NO:455). 

Sequencing gave ihc ftil! length sequence for PRO1008 [herein designated as UNQ492 (DNA57530-1375)] 
(SEQ ID NO:455) and the derived protein sequence for PRO1008 were identified. 

TTie entire nucleotide sequence of UNQ492 (DNA57530^1375) is shown in Figure 186 (SEQ ID NO:455). 
Clone UNQ492 (DNA57530-1375) contains a single open reading frame with an apparem iranslauonal initiation site 
at nucleotide positions 138-140 and ending at the stop codon at nucleotide positions 936-938 (Figure 186). The 
predicted polypeptide precursor is 266 amino acids long (Figure 187) The full-length PRO1008 protein shown in 
Figure 187 has an estimated molecular weight of about 28,672 dalions and a pi of about 8.85. Clone UN(3492 
(DNA57530-I375) has been deposited with the ATCC on May 20, 1998. Regarding the sequence, it is understood 
that the deposited clone contains the correct sequence, and the sequences provided herein are based on known 
sequencing techniques. 

Analysis of the amino acid sequence of the full-length PRO 1 008 polypeptide suggests that portions of it 
possess signiiicant sequence identity and/or similarity with mdkk-1. thereby indicating that PROI008 may be a novel 
15 member of this family and have head inducing activity. 

Still analyzing the amino acid sequence of SEQ ID NO:456, die putative signal peptide is at about amino 
acids 1-23 of SEQ ID NO:456. The N-glycosylation site is at about amino acids 256-259 of SEQ ID NO:456, and 
the fungal zn-(2)-cys(6) binuclear cluster domain is at about amino acids 110-126 of SEQ ID NO:456. The 
corresponding nucleotides can of aU the amino acids can be routinely determined given the sequences provided herein. 

20 

EXAMPLE 76: Isolation of cnN/\ Clones Encoding Human PROinn 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described in 

Example 1 above, wherein the consensus sequence is herein designated DNA49313. Based on the DNA49313 

consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the 
25 sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR01012. 
A pair of PCR primers (forward and reverse) were synthesized: 

fPPVart pep primer 5*.ACTCCCCAGGCTGTTCACACTGCC-3' (SEQ id NO:460); 

reverse PCM primgr 5 -GATCAGCCAGCCAATACCAGCAGC-3' (SEQ ID N0:461), 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA49313 consensus 
30 sequence which had the following nucleotide sequence: 

hybridization 

5'-GTGGTGATGATAGAATGCnTGCCGAATGAAAGGAGTCAACAGCTATCCC-3* (SEQ ID NO:462). 

In order to screen several Ubraries for a source of a full-length clone, DNA from the libraries was screened 
by PCR an^lification with the PCR primer pair identified above. A positive library was then used to isolate clones 
35 encoding the PRO1012 gene using the probe oligonucleotide and one of the PCR primers. RNA for construction of 
the cDNA libraries was isolated from human fetal kidney tissue (LIB227). 

DNA sequencing of the clones isolated as described above gave the fall-length DNA sequence for PRO1012 
(herein designated as UNQ495 (DNA56439- 1376)1 (SEQ ID NO:458) and the derived protein sequence for PRO1012. 
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The entire nucJeoiidc sequence of L'NQ495 (DNA56439.1376) is shown in Figures 189A-B (SEQ ID 
NO:458). Clone UNQ495 (DNA56439-I376) coniains a single open reading frame w»h an apparent iranslaiionai 
initiacion site at nucleotide positions 4OM06 and ending at the stop codon at nucleotide positions 2645-2647 (Figures 
189A-B), The predicted polypeptide precursor is 747 amino acids long (Figure 190). The full-length PRO10I2 
prolan shown m Figure 190 has an estimated molecular weight of about 86. 127 daltons and a pi of about 7.46. Clone 
5 UNQ495 (DNA56439-1376) has been deposited with ATCC on May 14. 1998. Regarding the sequence, it is 
understood that the deposited clone contains the correct sequence, and the sequences provided herein are based on 
known sequencing techniques. 

Analysis of the amino acid sequence of the full-length PRO1012 polypeptide suggests that ponions of it 
possess sequence identity with disulfide isomerasc thereby indicating that PR01012 may be a novel disulfide 
10 isomerase related protein. 

Still analyzing the amino acid sequence of SEQ ID NO:459. the cyiochromc C family heme-binding site 
signanirc is at about amino acids 158-163 of SEQ ID NO:459. The Nt-DNAJ domam signature is at about amino 
acids 77-96 of SEQ ID NO:459. An N-glycosylaiion site is at about amino acids 484-487 of SEQ ID NO:459. The 
ER targetii^ sequence is at about amino acids 744-747 of SEQ ID NO:459. It is understood dial the polypeptide and 
15 nucleic acids disclosed can be routinely formed with or without, these portions as desired, in alternative embodiments. 
For example, it may be desirable to produce PROI012 without the ER targeting sequence. The corresponding 
nucleotides can be routinely determined given the sequences provided herein. 

EXAMPLE 77: Isolation of cDNA Clong^ Encoding Human PROinu 
20 A consensus DNA sequence was assembled relative to other EST sequences using phrap as described in 

Example 1 abobc. wherein the consensus sequence obtained is herein designated DNA49811. Based upon an 
observed homology between the DNA49811 sequence and Incyie EST clone no. 2612207, Incyte EST clone no. 
2612207 was purdiascd and its insert was obtained and sequenced, wherein the sequence obtained is shown in Figure 
191 (SEQ OD NO:463). 

25 DNA sequencing gave the full-length DNA sequence for PRO10I4 (herein designated as UN(3497 

(DNA56409-1377)] (SEQ ID NO:463) and the derived protein sequence for PRO1014. 

The entire nucleotide sequence of UN<3497 (DNA56409-1377) is shown in Figure 191 (SEQ ID NO:463). 
Clone UN(J497 (DNA56409-1377) coniains a single open reading frame with an apparcm iranslational initiation site 
at nucleotide positions 66-68 and ending at the stop codon at nucleotide positions 966-968 (Figure 191). The 

30 predicted polypeptide precursor is 300 amino acids long (Figure 192). The full-length PRO1014 protein shown in 
Figure 192 has an estimated molecular weight of about 33,655 daltons and a pl of about 9.31. Clone UN(}497 
(DNA56409-1377) has been deposited with the ATCC on May 20. 1998. Regarding the sequence, it is understood 
that the deposited clone contains die correct sequence, and the sequences provided herein are based on known 
sequencing techniques. 

35 Analysis of the amino acid sequence of the full-length PRO1014 polypeptide suggests that portions of it 

possess sequence identity with reductase, thereby indicating that PR01014 may be a novel member of the reductase 
fttmily. 

Sdll analyzing the amino acid sequence of SEQ ID NO:464. die putative signal peptide is at about amino 
adds 1-19 of SEQ ID NO:464. The cAMP and cGMP dependent protein kinase phosphorylation sites arc at about 
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amino acids 30-33 and 58-61 of SEQ ID NO:464. Short chain alcohol dehydrogenase family proteins are ai about 
amino acids 165-202. 37-49, 1 12-122 and 210-219 of SEQ ID NO-464. The corresponding nucleotides of these 
domains and any other amino acids provided herein can be routinely deienmned given the sequences provided herem. 

EXAMPLE 78: Isolation of cPNA Clones Encoding Human PR01Q17 

5 A consensus DNA sequence was assembled relative to other EST sequences using phrap as descnbed m 

£xan:q}le 1 above, wherein that consensus DNA sequence is herein designated DNA5323S. Based upon an observed 
homology between the DNA53235 consensus sequence and the Merck EST clone no. AA243086, the Merck EST 
clone no. AA243086 was purchased and its insert obtained and sequenced, wherein the sequence obtained is shown 
in Figure 193 (SEQ ID NO:465), DNA sequencing gave the full-length DNA sequence for PRO1017 (herein 

10 designated as UNQ500 (DNA561 12-1379)] (SEQ ID NO:465) and the derived protein sequence for PRO1017. 

The entire nucleotide sequence of UNQ500 (DNA561 12-1379) is shown in Figure 193 (SEQ ID NO:465) 
Clone UNQ500 (DNA561 12-1379) contains a single open reading frame with an apparent translational initiation sue 
at nucleotide positions 128-130 and ending at the stopcodon at nucleotide positions 1370-1372 (Figure 193). The 
predicted polypeptide precursor is 414 amino acids long (Figure 194). The ftill-length PRO1017 protein shown m 

15 Figure 194 has an estimated molecular weight of about 48.414 daltons and a pi of about 9.54. Clone UNQ500 
(DNA561 12-1379) has been deposited with the ATCC. Regarding the sequence, it is understood that the deposited 
clone contains die correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Analysis of the amino acid sequence of the full-length PRO 10 17 polypeptide suggests that portions of it 
20 possess sequence identity with HNK-1 sulfotransferase, thereby indicating diat PRO1017 may be a novel 
sulfotransferase. 

Still analyzing the amino acid sequence of SEQ ID NO:466. the putative signal peptide is at about ammo 
acids 1-31 of SEQ ID NO:466. N-glycosylauon sites are at about amino acids 134-137. 209-212, 280-283 and 370- 
273 of SEQ ID NO:466. The TNFR/NGFR family cystein-rich region protein is at about amino acids 329-332 of 
25 SEQ ID NO:466. The corresponding nucleotides can be routinely determined given the sequences provided herein. 
The protein can be secreted. 

EXAMPLE 79 : Unlation nf cDNA riones Encoding Human PRQ474 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described in 

30 Example 1 above, wherein die consensus sequence obtained is herein designated DNA49818. Based upon an 
observed homotogy between the DNA49818 consensus sequence and die Merck EST clone no. H77889. the Merck 
EST clone no. H77889 was purchased and its insert obtained and sequenced, wherein die sequence obtained is herein 
shown in Figure 195 (SEQ ID NO:467). DNA sequencing gave the full-lengdi DNA sequence for PR0474 [herein 
designated as UNQ502 (DNA56045-1380)] (SEQ ID NO:467) and die derived protein sequence for PR(M74. 

35 The entire nucleonde sequence of UNQ502 (DNA56045-1380) is shown in Figure 195 (SEQ ID NO:467). 

Clone UNQ502 (DNA56045-1380) contains a single open reading frame wiUi an apparent translational initiation site 
at nucleotide positions 106-108 and ending at die stop codon at nucleotide positions 916-918 (Figure 195). The 
predicted polypeptide precursor is 270 amino acids long (Figure 196). The ftill-length PR0474 protein shown in 
Figure 196 has an estimated molecular weight of about 28.317 daltons and a pi of about 6.0. Clone UNQ502 
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(DNA56045-1380) has been deposited with the ATCC. Regarding the sequence, ii is understood that the deposited 
cloi« contains the conect sequence, and the sequences provided herein are based on known sequencing techniques. 

Still analyzing the amino acid sequence of SEQ ID NO:468, an N-glycosylation site is at about amino acids 
138-141 of SEQ ID NO:468, Shon-chain alcohol dehydrogenase family proteins are at about amino acids 10-22. 81- 
91. 134-171 and 176-185 of SEQ ID NO:468. The corresponding nucleotides can be routinely determined given 
5 the sequences provided herein. 

pyA i MPiE 80 - i«»i«»inn of cDNA rinncs Fncoriing Human PRQ1031 

An initial consensus DNA sequence was assembled relative to other EST sequences usmg phrap as described 
in Example 1 above, wherein the consensus sequence obtained is herein designated as DNA47332. Based upon an 
observed homology between the DNA47332 sequence and the Merck EST clone no. W74558. Merck EST clone no. 
W74558 was purchased and its insert obtained and sequenced, wherein the sequence obtamed is shown in Figure 197 
(SEQ ID NO:469). DNA sequencing gave the foU-length DNA sequence for PRO1031 [herein designated as 
UNQ516 (DNA59294.1381)] (SEQ ID NO:469) and the derived protein sequence for PRO1031. 

The entire nucleotide sequence of UNQ516 (DNA59294-1381) is shown in Figure 197 (SEQ ID NO:469). 
Clone UNQ516 (DNA59294-1381) contains a single open reading frame with an appaitm translauonal initiation site 
at nucleotide positions 42^ and ending at the stop codon at nucleoudc positions 582-584 (Figure 197). 
predicted polypeptide precursor is 180 amino acids long (Figure 198). The full-length PRO1031 protein shown in 
Figure 198 has an estimated molecular weight of about 20.437 dallons and a pi of about 9.58. Clone UNQ516 
(DNA59294-1381) has been deposited widi the ATCC. Regarding the sequence, it is understood that the deposited 
clone contains the correct sequence, and die sequences provided herein are based on known sequencing techniques. 
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Analysis of the amino acid sequence of the full-length PRO1031 polypeptide suggests thai it is a novel 

cytokine. 

Still analyzing the amino acid sequence of SEQ ID NO:470. the putative signal peptide is at about amino 
acids l-20ofSEQ ID NO:470. An N-glycosyUtion s.te is at about amino acids 75-78 of SEQ ID NO:470. A region 
having sequence identity with IH7 is at about amino acids 96-180. The corresponding micleotides can be routinely 
determined given dte sequences provided herein. 

pvAMPi F. 81 - 1-iffliiiifTii fff rP^** Fnrodinp Hiiman PR0938 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described in 
Exanvle 1 above, wherein .hat consensus sequence is herein designated DNA49798. Based on the DNA49798 DNA 
consensus sequence, oligonucleotides were synthesized. 1) to identify by PCR a cDNA library that contained the 
sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR0938. 
A pair of PCR primers (forward and reverse) were synthesized: 
35 forward PCR Primer 5'.GTCCACX:CCATGACCGCCTCCAAC-3' (SEQ ID NO:473) 
r.v.,.. PCR nrimer 5 -CTCTCCTCATCCACACCAGCAGCC-3' (SEQ ID NO:474) 

Additionally, a synthetic oligonucleotide hybridizadon probe was constructed from the consensus DNA49798 
sequence which had the following nucleotide sequence: 



30 
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hvbndiratinn nrnh,. 

5--GTGGATGCTGAAATnTACCCCCCATGGTGTCCATCCTGCCAGC-3- (SEQ ID NO:475) 

In order lo screen several libranes for a source of a Ml-lcn|th clone. DNA from ihc libraries was screened 
by PCR amplificanon w,tf, d,c PCR primer pa.r .dcm.ned above A posinvc library was then used ,o isolate clones 
encoding (he PR0938 gene using (he probe oligonucleoude and one of (he PCR pnmcrs. RNA for conscrucuon of 
(he cDNA libranes was isolated from human fe(al kidney tissue (LIB227). 

DNA sequcncmg of the clones isolated as descnbed above gave the full-length DNA sequence for PR0938 
fhercm designated as UNQ475 (DNA5643M406)) (SEQ ID NO:471) and the derived ptptem sequence for PR0938. 

nie enorc nucIcoDdc sequence of UNQ475 fDNA56433-l406) is shown in Figure 199 (SEQ ID NO-47I) 
Clone UN0475 (DNA56433-1406) comains a smgle open reading frame with an apparent translat.onal imuation site 
at nucleotide positions 134-136 and ending at the stop codon at nucleoude positions 1181-1183 (Figure 199). n»e- 
predicted polypeptide precursor is 349 amino acids long (Figure 200). TTie full-length PR0938 protem shown in 
Rgure 200 has an estimated molecular weight of about 38.952 daltons and a pi of about 4.34. Analysis of the full- 
length PR0938 sequence shown in Figure 200 (SEQ ID NO:472) evidences the presence of the following feamres: 
a signal pepude from ammo I to about amino acid 22. a .ransmembrane domain from about ammo acd 1 91 to about 
amino acd 211. a potential N-glycosylation sue from about ammo acd 46 to about ammo acid 49. a region 
homologous to disulfide isomerase from about amino acd 56 to about amino acid 72. and a region having sequence 
identity with flavodoxin proteins from about amino acid 173 to about amino acid 187. 

Clone UNQ475 (DNA56433.1406) has been deposited with ATCC on May 12. 1998. and is assigned ATCC 
Accession No. 209857, 

Analysis of the amino acid sequence of the full-length PR0938 polypepudc suggests that it possesses 
significant sequence similano^ ,o protein disulfide isomerase. (hereby indicating that PR0938 may be a novel protein 
disulfide isomerase. An analysis of <he Dayhoff database (version 35.45 SwissProt 35) evidenced significant 
homology between the PR0938 amino acid sequence and the following Dayhoff sequences. P W03626. P_W03627 
P.R70491. GARP_PLAFF. XLU85970J. ACADISPROA.I. IE68.HSVSA. KSU52064 «. U93872 83 
P R97866. 



EXAMPI F ff? : hOlaiion of rPNA Clone, Fnrnrtjnf H uman PRniOOT 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as descnbed in 
Example 1 above, whecin the consensus sequence is herein designated DNA38097. Based on this consensus 
sequence, oligonucleotides were synthesized: 1) to .deniily by PCR a cDNA libraiy that contained the sequence of 
interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PROI082. 

A set of PCR pnmen (two forward and one reverse I were synthesized: 

foryarOnmcrl 5 -gtccacagacagtcatctcaggagcag-3' (SEQ id no:478): 

t Pfward primrr 2 S -ACAAGTGTCTTCCCAACCTG-3' (SEQ id N0:479); 
KvcrKpnnurl 5 -ATCCTCCCAGACCCATGGTACCTC-3- (SEQ id NO:480). 

Additionally, a synthetic oligonucleotide hybridization probe was constnicted from the DNA38097 consensus 
sequence which had the following nucleotide sequence: 

hvbridi7aiinn nmh^ 

5'-CCAAGGATAGCTGTTGTITCAGAGAAAGGATCGTGTGCTCCATCTCCTCCT-3' (SEQ ID NO:481) 
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In order lo saeen several Lbranes for a source of a full-length clone. DNA from ihe libranes was screened 
by PCR arnplificaaon with the PCR primers idenuficd above. A positive library was then used to isolate clones 
encoding the PRO1082 gene using the probe oligonuclcoudc and one of the PCR primers. RNA for constnicoon of 
the cDNA libraries was isolated from human tctal kidney tissue iLIB227). 

DNA sequencmg of the clones isolated as described above gave the ftilMcngih DNA sequence for PRO1082 
5 [herem designated as UNQ539 a>NA539l2-1457)] (SEQ ID NO:476) and the derived protem sequence for PRO1082, 

The ennre nucleotide sequence of UNQ539 (DNA53912-1457) is shown in Figure 201 (SEQ ID NO:476). 
Clone UNQ539 (DNA53912-I457) contains a smgle open reading frame with an apparent u-anslational initiation site 
ai nucleotide positions 160-162 and ending at the stop codon at nucleotide positions 763-765 (Figure 201). The 
predicted polypcpude precursor is 201 ammo acids long (Figure 202). The fiilMengih PRO 1082 protein shown in 
10 Figure 202 has an estimated molecular weight of about 22.563 daltons and a pi of about 4.87. Clone UNQ539 
(DNA539I2-1457) has been deposited with the ATCC. Regardmg the sequence, it is understood that the deposited 
clone coniains the concci sequence, and the sequences provided herein are based on known sequencmg techniques. 

Soil analyzing the ammo acid sequence of SEQ ID NO:477. the transmembrane domain is at about amino 
15 acids 45-65 of SEQ ID NO:477. A cAMP- and cGMP-dependent protein kinase phosphorylation sue is at about 
amino acids 197-200 of SEQ ID NO:477. N-mynstoylauon sues arc at about ammo acids 35-40 and 151-156 of SEQ 
ID N0 477. The regions which share sequence identity with the LDL receptor are at about anuno acids 34-67 and 
70-200 of SEQ D NO:477. The correspondmg nucleotides of Uiese ammo acid regions and others can be routinely 
detenmned given the sequences provided herein. 

20 

EXAMPLE 83 : Isolation of cDNA Clones Encoding Human PRO1 083 

A cDNA sequence was identified using the amylase screemng lechmque described in Example 2 above, 
wherein that cDNA sequence is designated herem as DNA24256 (Figure 205: SEQ ID NO:484). That cDNA 
sequence was then compared and aligned with other known EST sequencees as described in Example 1 above to 
25 obtam a consensus DNA sequence which is designated herem as DN A43422. Based on the DNA 43422 consensus 
sequence, oligonucleotides were synthesized: I) to identify by PCR a cDNA library that contained the sequence of 
mtcrest. and 2) for use as probes to isolate a clone of the full-length coding sequence for PRO1083. 

A pair of PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5*.GGCATTGGAGCAGTGCTGGGTG-3' (SEQ ID NO:485): 
30 reverse PCR primer 5*-TGGAGGCCTAGATGCGGCTGGACG-3' (SEQ ID NO:486). 

in order to screen several libraries for a source of a full-length clone, DNA from the libranes was screened 
by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones 
encoding the PRO1083 gene using the reverse PCR pnmer. RNA for construction of the cDNA libraries was isolated 
from human fetal kidney tissue (LIB227). 
35 DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for PRO 1083 

[herein designated as UNQ540 fDNA5092 1-1458)) (SEQ ID NO:482) and the derived protein sequence for PRO1083. 

The cndre nucleotide sequence of UNQ540 (DNA5092 1-1458) is shown in Figure 203 (SEQ ID NO:482) 
Clone UNQ540 (DNA5092 1-1458) contains a single open reading frame with an apparent iranslational initiauon site 
at nucleonde posidons 214-216 and ending at the stop codon at nucleoude positions 2293-2295 (Figure 203). The 
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predicted polypeptide precursor is 693 amino acids long iFigure 204). The ftiU-lengtPpRO1083 protein shown 



in 



Figure 204 has an esumated molecular weight of about 77.738 daltons and a pi of about 8.87. Clone UNQ540 
(DNA50921-1458) has been deposited with the ATCC. Regardmg the sequence, it is understood that the deposited 
clone contains the correct sequence, and the sequences provided herem are based on known sequencing techniques. 

StiU analyzing the anuno acid sequence of SEQ ID NO:483. the puianvc signal peptide is at about amino 
acids 1-25 of SEQ ID NO:483 The transmembrane domains arc at about amino acids 382-398. 402-420. 445-468 
47^491. 519-537. 568-590 and 634-657 of SEQ ID NO:483. A microbodics C-tcrminal targeting signal is at about 
amino acids 691^93 of SEQ ID NO:483. cAMP- and cGMP-dependem protein kinase phosphorylation sites are at 
about amino acids 198-201 and 370-373 of SEQ ID NO:483. N-glycosylation sites are at about amino acids 39-42. 
148-151, 171-174. 234-237, 303-306. 324-227 and 341-344 of SEQ ID NO:483. A G-pro.cm coupled receptor family 
domain is at about amino acids 475-504 of SEQ ID NO:483. The correspondmg nucleotides can be routinely 
determined given the sequences provided herein. 

EXAMPLE 84 : Isolaiinn nf rpfJ/^ r|^ne!i Fm-t^mg ^ uman PR02nn 

Probes based on an expressed sequence ug (EST) idenniied from the Incyte Pharmaceuucals database due 
to homology with VEGF were used to screen a cDNA library derived from the human glioma cell line G61. In 
particular. Incyte Clone ''INC1302516" was used to generate die following four probes: 
(SEQ ID NO:489) ACTTCTCAGTGTCCATAAGCG; 

(SEQ ID NO:490) GAACTAAAGAGAACCGATACCATmCTGGCCAGGTTGTC: 
20 (SEQ ID NO:491) CACCACAGCGTTTAACCAGG; and 
(SEQ ID NO:492) ACAACAGGCACAG1TCCCAC. 

Nine positives were idenufied and characterized. Three clones contained die ftill coding region and were 
identical in sequence. Paruai clones were also identified from a feial lung library and were identical with the 
glioma-derived sequence with the exception of one nucleotide change which did not alter the encoded amino acid. 



EXAMPLE 85: Ewression romtnici^ for PP(?7 ffl 

For mammalian proiem expression, die entire open reading frame (ORF) was cloned into a CMV-bascd 
expression vector. An epitope-tag (FLAG. Kodak) and HIsddine-tag (His8) were insened between the ORF and stop 
codon. VEGF-E-His8 and VEGF-E-FLAG were transfected into human embryonic kidney 293 cells by SuperFcct 
(Qiagcn) and pulse-labeled for 3 hours with [«S)methionine and P Ocysteine, Both epitope-tagged proteins 
co-migrate when 20 microliters of 15-fold concentrated serum-frec conditioned medium were eiectrophoresed on a 
polyacrylamide gel (Novex) in sodium dodecyl sulfate sample buffer (SDS-PAGE). The VEGF-E-IgG expression 
plasmid was constructed by cloning the ORF in from of die human Fc flgG) sequence. 

The VEGF-E-IgC plasmid was co-transfected with Baculogold Baculovirus DNA (Pharmingen) usmg 
UpofectinfGibcoBRL) into 10* Sf9 cells grown in Hink's TNM-FH medium (JRH Biosciences) supplemented with 
10% fetal bovine scrum. Cells were mcubated for 5 days at 28°C. The supernatant was harvested and subsequenUy 
used for die first viral amplification by infeamg Sf9 cells at an approximate multiplicity of infection (MOT) of 10 
Cells were incubated for 3 days, dien supernatant harvested, and expression of die recombinant plasmid determined 
by binding of 1 ml of supematani to 30 ul of Protein-A Sepharose CL-4B beads (Pharmacia) followed by subsequem 
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SDS-PAGE analysis. The first ampUficaaon supernatant was used to infect a 500 ml spinner culture of Sf9 cells 
grown in ESF-921 niedium (Expression Systems LLC) at an approximate MOl of 0. 1 . Cells were treated as above, 
except harvested supernatant was sterUe filtered. Specific protein was punfied by binding to Proiein-A Sepharose 
4 Fast Flow (Pharmacial column. 

PMftmle «6: Nnrihem Blot AtlfllYWif PRO^OO 

Blois of human poly(A)+ RN A from multiple adult and fetal tissues and tumor cell lines were obtained from 
Clomcch (Palo Alto. CA). Hybiidizaoon was carried out using '=P-labeled probes containing the enure coding region 
and washed in 0. 1 x SSC, 0. 1 % SDS at trC. 

VEGF-E mRNA was detectable in fetal lung, kidney, bram. liver and adult hean. placenta, liver, skeletal 
muscle, kidney, and pancreas. VEGF-E mRNA was also found in A549 lung adenocarcinoma and HeLa cervical 
adenocarcinoma cell lines. 



Fiiairrl f Hv b ririiraiinn of Human Fetal Tissuf ^r,n ' \m tftr PR02W 

Formalin-fixed, paraffin-embedded human fetal bram. liver, lower limb. smaU imcstme. thyroid, lymph 
15 node, thymus, stomach, trachea, skin, spleen, spinal cord, adrenal, placenta, cord, and adult liver, pancreas, lung, 
spleen, lymph node, adrenal, hean. aorta, and skin were sectioned, deparaffinized, deproteinated in proteinase K (20 
xig/ml) for 15 minutes at 3TC, and fimher processed for in sim hybridization as described by U LH and GUleo NA 
(CcM Vision 1: 169-176, 1994). A (c-^-PlUTP-labeled antisense riboprobc was generated from a PGR product of 
980 bp (primers GGCGGAATCCAACCTGAGTAG and GCGGCTATCCTCCTGTGCTC. SEQ ID NOS: 493 and 
20 494. respectively). The slides were dipped in Kodak NTB2 nuclear track emulsion and exposed for 4 weeks. 

VEGF-E mRNA expression included localization at die growth plate region and embracing fetal myocytes. 

^ifa jnpte ««r Mvocv ic Hvpenrophy A«av for PRO20Q 

Myocytes from neonatal Harlan Sprague Dawley rat hean vemncle (23 days gcsiation» were plated in 
25 duplicate at 75000 cells/ml in a 96-well plate. Cells were treated for 48h with 2000. 200. 20. or 2 ng/ml 
VEGF-E-IgG. Myocytes were stained widt crystal violet to visualize morphology and scored on a scale of 3 to 7. 
3 being nonstimulaied and 7 being fiill-blown hypertrophy. 

2000 ng/ ml and 200 ng/ ml VEGF-E caused hypenrophy. scored as a 5. 

30 FTamnte 89; Cell Pmliferatinn Assay for PR02QQ 

Mouse embryonic fibroblast C3H10T1/2 cells (AJCC) were grown in 50:50 Ham s F-12: low glucose 

DMEM medium containing 10% fetal calf scmm (FCS). Cells were plated in duplicate m a 24.well plate at 1000. 

2000. and 4000 cells/well. After 48 hours, cells were switched to medium containing 2% FCS and were incubated 

for 72 houn with 200. 800. or 2000 ng/ml VEGF-E or no growth fector added. 
35 Approximately 1 .5 fold greater number of cells were measured in the presence of 200 ng/ml VEGF-E as 

in its absence, at all three cell densities. 

Fifiinir'' Pnrtn'he""' ^"r^^^i Assay for PRQ200 
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Human umbiUcal veuj endothelial cells (HUVEC, Cell Systems) were mainiamcd in Complete Media (Cell 
Systems) and plated in tripUcaie in seium-tree medium <Basic Media from Cell Systems contauung 0.1% BSA) at 
:M.OOO cells/weU of a 48-weU plate. Cells were incubated for 5 days with 200 or 400 ng/ml VEGF-E-IgG. 100 ng/ml 
VEGF. 20 ng/ml basic FGF. or no addition. 

Survival was 2-3 times greater with VEGF-E as compared to lack of growth factor addition. VEGF and 
basic FGF were included as positive controls. 

EXAMPLE 91 : Isolation of rUf^f ^ ^l^p^^ Encndino Hnman Ppf^7f|^ 

A proprietaiy expressed sequence tag (EST) DNA database (UFESEQ'**. Incyte Pharmaceuticals. Palo Alto. 
CA) was searched and an EST (#2243209) was identified which showed homology to the Drvsophila ToU protein. 

Based on the EST, a pair of PCR pnmcrs (forward and reverse): 
TAAAGACCCAGCTGTGACCG (SEQ ID NO:499) 
ATCCATGAGCCTCTGATGGG (SEQ ID NO: 500). and 
a probe: 

ATTTATGTCTCGAGGAAAGGGACTGGTTACCAGGGCAGCCAGTTC (SEQ ID NO: 501) 
15 were synthesized. 

mRNA for constnicrion of the cDNA libraries was isolated from human placenta tissue. The cDNA libraries 
used ro isolate the cDNA clones were constructed by standard methods using commercially available reagents such 
as those from Invitrogen. San Diego. CA (Fast Track 2). The cDNA was pnmed with oligo dT containing a NoU 
site, linked with blum to Sail hemikinased adaptors, cleaved with NoU. sized appropriately by gel electrophoresis, 
and cloned in a defined orientation into the cloning vector pCR2.1 (Invitrogen. Inc.) usmg reagents and prowMls 
from life Technologies, Gaidiersburg, MD (Super Script Plasmid System). The double stranded cDNA was sized 
to greater than 1000 bp and the cDNA was cloned into BamHI/NotI cleaved vector. pCR2.l is a commeicially 
available plasmid. designed for easy cloning of PCR fragments, that carries AmpR and KanR genes for selection, 
and LacZ gene for blue-white selection. 

In order to screen several libranes for a source of a fiill-lengdi clone. DNA from the libraries was screened 
by PCR amplification with the PCR pruner pair identified above. A positive library was then used to isolate clones 
encoding the PR0285 gene using the probe oligonucleotide and one of the PCR primers. 

A cDNA ckine was sequenced in enurety. The emire nucleoude sequence of DNA40021-1 154 (encoding 
PR0285) is shown in Figure 208 (SEQ ID NO:495). Clone DNA40021.1 154 contains a single open reading frame 
with an apparem iranslaiional initiauon site at nucleotide positions 61-63 (Figure 208) The predicted polypeptide 
precursor is 1049 amino acids long, including a putative signal peptide at ammo acid positions 1-29, a putative 
transmembrane domain between amino acid positions 837-860. and a leucine zipper panem at amino acid positions 
132-153 and 704-725, respectively. It is noted that die indicated boundaries are approximate, and the actual limits 
of the indicated regions might differ by a few amino acids. Clone DNA40021-1 154 has been deposited with ATCC 
35 (designation: DNA40021-1 154) and is assigned ATCC deposit no.209389. 

Based on a BLAST and FasiA sequence alignment analysis (using the ALIGN computer program) of the fiiU- 
length sequence is a human analogue of the Drosophila Toll protem. and is homologous to die following human Toll 
proteins: ToUl (DNAX# HSU88540-1. which is identical with the random sequenced full-length cDNA 
«nJMRSC786-l); ToIC (DNAX# HSU88878-1): Toll3 (DNAX# HSU88879-1): and Toll4 (DNAX# HSU88880-1). 
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FVAMPLE 92 : >solanon of cP ^A rinncs EnrodmP Human PRQ286 

A propnetan* expressed sequence tag (EST) DNA database (UFESEQ™. Incytc Phannaceuiicals. Palo Alto, 
CA) was searched and an EST m9AAQ\) was idcnurled which showed homology lo the Drosophila Toll proiein. 



Based on ihe EST, a pair of PGR primers (forward and reverse): 
5 GCCGAGACAAAAACGTTCTCC (SEQ ID NO:502) 

CATCCATGTTCTCATCCATTAGCC (SEQ ID NO: 503), and 
a probe: 

TCGACAACCTCATGCAGAGCATCAACCAAAGCAAGAAAACAGTATT (SEQ ID NO: 504) 
were synthesized. 

10 mRNA for consmicrion of the cDNA libraries was isolated from human placenta tissue. This RNA was used 

to generate an oligo dT primed cDNA library in the vector pRK5D using reagents and protocols from Ufe 
Technologies, Gaithersburg. MD (Super Script Plasmid System). pRKSD is a cloning vector that has an $p6 
tianscnpdon initiaDon sue followed by an Sfil restriction enzyme site preceding the XhoI/NotI cDNA cloning sites. 
The cDNA was pruned wnh oligo dT containing a NotI site, linked with blunt to Sail hemikinased adaptors, cleaved 
15 with Nod. sized to greater Own 1000 bp appropriately by gel electrophoresis, and cloned in a defined oneniation into 
XhoI/Notl-cleaved pRK5D. 

In order to screen several Ubraries for a source of a full-length clone, DNA from the libraries was screened 
by PGR amplification with the PGR primer pair idemificd above. A positive library was then used to isolate clones 
encoding die PR0286 gene using the probe oligonucleotide identified above and one of the PGR primers. 
20 A cDNA clone was sequenced in cnurety. The entire nucleotide sequence of DNA42663-1 154 (encoding 

PR0286) is shown in Figures 210A-B (SEQ ID NO:497). Clone DNA42663.1 154 contains a single open reading 
frame with an apparent translational iniuarion site at nucleotide positions 57-59 (Figure 211). The predicted 
polypeptide precursor is 1041 amino acids long, mcluding a putative signal peptide at amino acid positions 1-26, a 
potential transmembrane domain at amino acid posinons 826-848. and leucine zipper patterns at amino acids 130-151. 
25 206-227. 662-684. 669-690 and 693-614, respectively. It is noted that die indicated boimdaries are approximate, 
and the acniaJ limits of die indicated regions might differ by a few amino acids. Clone DNA42663-1 154 has been 
deposited widi ATGC (designation: DNA42663-1 154) and is assigned ATGG deposit no. 209386. 

Based on a BLAST and FastA sequence alignmcm analysis (using the ALIGN computer program) of die ftiU- 
leqgth sequence of PR0286. it is a human analogue of the Drosophila Toll proiein, and is homologous to die following 
30 human ToU proteins: Toll 1 (DN AXJ^ HSU88540. 1 . which is idenucal widi die random sequenced full-lengdi cDN A 
fflUMRSC786.i); ToII2 (DNAX# HSU88878-I); ToU3 (DNAX/f HSU88879.1): and Toll4 (DNAX# HSU88880-1). 



Example 93 : NF-kB Assay for PR0285 and PR0286 

As die Toll proteins signal dirough die NF-kB padiway, dieir biological activity can be tested in an NF-kB 
35 assay. In diis assay Jurkai ceils are transiently transfected using Lipofectamine reagent (Gibco BRL) according to 
dK manufacturer's instructions. 1/ig pB2XLuc plasmid. containing NF-KB-driven luciferase gene, is contransfccted 
with l^g pSRaN expression vector widi or without die insert encoding PR0285 or PR0286. For a positive control, 
cells are treated widi PMA (phorbol myrisiyl aceute: 20 ng/ml) and PHA (phytohaemaglunnin. 2Mg/ml) for three 
to four hours. Cells are lysed 2 or 3 days later for measurement of luciferase acuvity using reagents from Promcga. 
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E XAMP t. F. 94 : I so l ation Ot rDNA rionrs Encnrimp H.im.n PR02n.| PRQnsO and PROI449 

A consensus DNA sequence was assembled relative lo other EST sequences using phiap as described m 
Eumple 1 above. This consensus sequence is herein designated DNA28735. Based on the DNA28735 consensus 
sequence, oligonucleotides were synthesized. 1) to identify by PGR a cDNA library that contained the sequence of 
imeresi. and 2) for use as probes to isolate a clone of the fiill-length coding sequence for PR0213-1 . PRO1330 and/or 
5 PR01449. A pair of PGR pruners (forward and reverse) were synthesized: 

fOfwgrd PCR Primer 5'-TGGAGCAGCAATATGCCACCC-3' (SEQIDN0:5I1) 
reverse PCR nrimcr 5 -TnTCCACTCCTGTCGGGTTGG-3* (SEQID N0.512) 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA28735 
sequence which had the following nucleotide sequence: 
10 hvbridi7aiion prnhe 

5 -GGTGACACTrGCCAGTCAGATGTGGATGAATGCAGTGCTAGGAGGG-3- (SEQ ID NO:513) 

In order to screen several Ubranes for a source of a full-length clone. DNA from the libraries was screened 
by PCR amplificarion with the PCR primer pair identified above. A positive library was then used to isolate clones 
encoding the PR0213.1. PRO1330 and/or PR01449 gene using the probe oligonucleotide and one of the PCR 
15 primers. RNA for consuucuon of the cDNA libraries was isolated from human fetal lung ussue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence encoding 
PR0213-1, PRG1330 and/or PR01449 (DNA30943-1-1 163-1 (SEQ ID NO:505). DNA64907-1 163-1 (SEQ ID 
NO:507) and DNA64908-1 163-1 (SEQ ID NO:509). respectively!. 

The entire imcleotide sequences corresponding to DNA30943-1-1163-! (SEQ ID NO:505), DNA64907- 
1163-1 (SEQIDNO:507)and DNA64908.1163-1 (SEQ ID NO:509). respectively. DNA30943.1 163. DNA64907. 
1 163-1 and DNA64908-1 163-1 contain a single open reading frame with an apparent transladonal initiation site at 
nucleotide positions 336-338. 488-190 and 32«28. respectively, and ending at the stop codon at nucleotide posiuons 
1221-1223. 1307-1309 and 1145-1147. respecDvely (Pigures212, 214 and 216). The predicted polypeptide precursor 
U 295. 273 and 273 amino acids long, respectively (Figures 213. 215 and 217) DNA30943-I-1 163-1. DNA64907- 
1163-1 and DNA64908-1163-1 have been deposited with ATCC and are assigned ATCC deposit no. 209791, 203242 
and 203243. respectively. 

Analysis of the amino acid sequence of the full-length PR0213-1 polypeptide suggests thai a ponion of it 
possess significant homology to the human growth arrest-specific gene 6 protein. More specifically, an analysis 
of the Dayhoff database (version 35.45 SwissProt 35) evidenced significant homology between the PR0213 amino 
30 add sequence and the following Dayhoff sequences, HSMHC3W5A_6 and B48089. 

Additional analysis of the amino acid sequence of the fiill-length PROI330 and PR01449 polypeptide 
indicates significant idenUty with notch4. l^ore specifically, an analysis of the Dayhoff database (version 35.130 
SwissProt 35) evidenced significant identity between PRO1330 and die following Dayhoff sequences. D86566_l and 



NEL HUMAN. 



EXAMPLF, 95 : Isolation of rPNA rinne< Pn c cdmo H.iman P^(;}'>Q^ 

A cDNA isolated in the amylase screen described in Example 2 above is herein designated DNA26832 
(Figure 220: SEQ ID NO:516). The sequence of DNA26832 was then used to search expressed sequence tag (EST) 
databases. TTie EST databases included public EST daobases (e.g.. GenBankt and a propnetary EST database 
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(LIFESEQ'". Incyte Pharmaccuucals. Palo Alto. CA) The search was performed using the computtr program 
BLAST or BLAST2 (Altshul et aJ.. |^Mhfvi< in FnTvmologv M- 469-480 (1996)). Those comparisons resuloog m 
a BIj\ST score of 70 (or m some cases 90) or greater that did noi encode proieins were clustered and assembled mio 
consensus DNA sequences with the program -phrap' (Phil Green. University of Washington. Seattle. Washington: 
http://bozeinan.mbt.washington.edu/phrap.docs/phrap.hiral). 
5 A consensus DNA sequence was assembled relauve to other EST sequences usmg phrap. A consensus 

sequence was determined, which was then extended usmg repeated cycles of BLAST and phrap to extend the 
consensus sequence as far as possible using the sources of EST sequences discussed above. The extended assembly 
sequence was designated DNA35861.Based on the DNA35861 consensus sequence, oligonucleotides were 
sywhesiied: 1) to idenufy by PCR a cDNA library that coniamed the sequence of interest, and 2) for use as probes 
10 to isoUte a clone of the full-length coding sequence of PR0298. Forward and reverse primers generally range from 
20 to 30 nucleotides and are often designed to give a PCR product of about 100-1000 bp in length. The probe 
sequence is typically 40-55 bp in length. In some cases, additional oligonucleoudes are synthesized when the 
consensus sequence is greater than about 1-1 .5kbp. In order to screen several libraries for a fulMength clone. DNA 
from the libraries was screened by PCR amplification, as per Ausubel ei al., CHrrcm PfWWClS in MolCCWiar BlQlOgY . 
15 with the PCR primer pau. A positive Ubraiy vras used to isolate clones encoding the gene of interest usmg the probe 
oligonucleotide and one of the primer pairs. 

PCR pnmers (forward and reverse) and a hybndizanon probe were synthesized: 
fnp^rA PTR Primer I CAACGTGATnTCAAAGCTGGGCTC (SEQ ID NO:517) 
fnrwarri PTR pnmer 2 GCCTCGTATCAAGAATTTCC (SEQ ID N0:5 18) 
20 f prwafrt PCR nrimer 3 AGTGGAAGTCGACCTCCC (SEQ ID N0:5 19) 

r-v^r^e PPR Primer 1 CTCACCTGAAATCTCTCATAGCCC (SEQ ID NO:520) 

hvhririi7.-,..nn probe 1 CGCAAAACCCATnTGGGAGCAGGAATTCCAATCATGTCTCTGATGGTGGCSEQ 
ID NO:521) 

In order to screen several libraries for a source of a full-length clone. DNA from the libraries was screened 
25 by PCR amplification with the PCR primer pair identified above. A positive library was then used to isolate clones 
encoding the PR0298 gene using the probe oligonucleotide and one of the PCR primers. 

RNA for construction of the cDNA Ubrancs was isolated from human fetal lung ussue (LIB25). The cDNA 
libraries used to isolated the cDNA clones were constructed by standard methods using commercially available 
reagents such as those from tovitrogen. San Diego. CA. The cDNA was primed with oligo dT containing a NotI site. 
30 linked with blunt to Sail henukinased adaptors, cleaved with Noil, sized appropriately by gel electrophoresis, and 
cloned in a defined orientation into a suitable clonmg vector (such as pRKB or pRKD: pRK5B is a precursor of 
pRK5D that does not contam the Sfil site: see. Holmes et al.. Sosnsfi. 252 1278-1280 (1991)) in the unique Xhol 
and NotI sites. 

DNA sequencing of the clones isolated as described above gave the ftill-length DNA sequence for PR0298 
35 (herem designated UNQ261 (DNA39975-12101) (SEQ ID NO:514). and the derived proiera sequence for PR0298 
(SEQ ID N0:51S). 

The ennre nucleonde sequence of UNQ261 (DNA39975-1210) is shown m Figure 218 (SEQ ID N0:514) 
Clone DNA39975-1210 contains a single open reading frame with an apparent translanonal initiauon sue at nucleotide 
positions 375-377. The predicted polypeptide precursor is 364 ammo acids long. The protem contains four putauve 
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transmembrane domains between anuno acid posiuons 36-55 (type II TM). 65-84. 188-208, and 229-245. 
respectively. A putauve N-l,nked glycosylauon site starts at amino acid pos.uon 253. In addiuon. the following 
features have been idcnufied .n the protem sequence: cAMP- and cGMP-dependent protein kinase phosphorylation 
site, starting at position 8. N-mynstoylation sites startmg a position 173 and 262. respectively: and a ZP domain 
between ammo acid positions 45-60. Clone DNA39975.1210 has been deposited with ATCC (April 21 . 1998) and 
is assigned ATCC deposit no.209783. 

EXAMPLE % : i SPlaUOn of cDNA none. Pr^ , ^\, ^ Human PRnH7 

A cDNA sequence identified m the amylase screen described in Example 2 above is herein designated 
DNA4230I (Figure 223. SEQ ID NO:524). TT,e DNA42301 sequence was then compared to other EST sequences 
using phrap as described in Example 1 above and a consensus sequence designated herem as DNA28761 was 
idemified. Based on this consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA 
library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding 
sequence. In order to screen several libraries for a source of a full-length clone. DNA from the libraries was 
screened by PCR amplificanon with the PCR pnmer pair idennfied above A positive library was then used to isolate 
clones encoding the PR0337 gene using the probe oligonucleotide and one of the PCR primers. RNA for 
construction of the cDNA libraries was isolated from human fetal brain. 

A cDNA clone was sequenced in its entirety. The full length nucleotide sequence of DNA433 16-1237 is 
shown in Figure 221 (SEQ ID NO:522). Clone DNA43316-1237 contains a single open reading frame with an 
apparent translational initiauon site at nucleotide posiuons 134-136 (Figure 221; SEQ ID NO:522). The predicted 
polypeptide precursor is 344 amino acids long. Clone DNA43316.1237 has been deposited with ATCC and is 
assigned ATCC deposit no. 209487 

Based on a BLAST-2 and FastA sequence alignment analysis of die ftill-length sequence. PR0337 shows 
amino acid sequence identity to rat neuroirimin (97%). 

25 EXAMPLE 97 : ISPlatiOn of cDNA Clone«; Fnrn d ino Hnm;.n PRfM(>t 
Introdiicfinn - 

Human thrombopoienn (THPO) is a glycosylated hormone of 352 amino acids consisting of two domains. 
The N-iciminal domain, sharing 50% similarity to erythropoietin, is responsible for the biological acuvity. The C- 
tenninal region is required for secreuon. TTie gene for tiuombopoierin (THPO) maps to human chromosome 3q27- 
q28 where the six exons of this gene span 7 kilobase base pairs of genomic DNA (Chang et al.. Genomics 26: 636-7 
(1995): Foster eial.. Proc. Natl. Acad. Sci. USA 91: 13023-7 (1994): Gumey ei al.. Blood 8S: 981-988 (1995). In 
order to determine whether there were any genes encodmg THPO homoiogues located in close proximity to THPO. 
genomic DNA fragments from this region were identified and sequenced. Three PI clones and one PAC clones 
(Genome Systems Inc., St. Uuis. MO: cat. Nos, Pl-2535 and PAC-6539) encompassing the THPO locus were 
isolated and a 140 kb region was sequenced usmg die ordered shotgun strategy (Chen et al.. Genomics 17: 651-656 
(1993)). coupled with a PCR-based gap filling approach Analysis reveals that the region is gene-rich witii four 
additional genes located very close to THPO: tumor necrosis factor-receptor type 1 associated protem 2 (TRAP2) 
and ekHiganon initiation bear gamma felF40. chloride channel 2 (CLCN2) and RNA polymerase II subunit hRPBI7. 
While no TWO homolog was found in die region, four novel genes have been predicted by computer-assisted gene 
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detccDon (GRAIL)(Xu ct al.. Gen. Encin. 16 241-253 (1994). the presence of CpG islands (Cross. S. and Bird. A.. 
Curr. Opin. Genci, & DcveL 5 109-314 (1995). and homology to known genes las detected by WU- 
BLAST2.0)(Alischul and Gish, Methods Enzymol. 266: 460^80 (1996) (http://'biasi.wustl.edu;'blast/README.hiinl). 
Procedures: 

PI M PAC Clones; 

5 The initial fauinan PI clone was isolated from a genomic PI library (Genome Systems Inc., St. Louis, MO: 

cat. no.: Pl-2535) screened with PCR primers designed from the THPO genomic sequence (A.L. Gumcy, cl al.. 
Blood 85: 981-88 (1995). PCR pnmers were designed from the end sequences derived from this PI clone were then 
used to screen PI and PAC libraries (Genome Systems. Cat. Nos.: Pl-2535 & PAC-6539) to identify overlapping 
clones (PACl. pl.t, and Pl.u). The 3'-end sequence from PAC.z was used to define the primers used for the 

10 screening of a human BAC library (Genome Systems Inc., St. Louis, MO; Cat. No.: BDTW-4533A). 
^T^'-rr^ 5;hoigun Strategy: 

The Ordered Shotgun Stiaicgy (OSS) (Chen ei al.. Genomics 17: 651-656 (1993)) involves the mapping and 
sequencing of large genomic DNA clones with a hierarchical approach. The PI or PAC clone was sonicated and the 
fragments subcloned mto lambda vector (ABluestar) (Novagcn, inc.. Madison, WI: cat. no. 69242-3). The lambda 

15 subclone mserts were isolated by long-range PCR (Barnes. W. Proc. Natl. Acad. Sci. USA 91: 2216-2220 (1994) 
and die ends sequenced. The lambda-end sequences were overlapped to create a partial map of the original clone. 
Those lambda clones with overlapping end-sequences were idendfied. the insets subcloned into a plasmid vector 
(pUC18 or pUCl9. Hoefcr Pharmacia Biotech. Inc., San Francisco. CA, Cat. Nos. 27-4949-01 and 27^951-01) and 
the ends of the plasmid subclones were sequenced and assembled to generate a conuguous sequence. This directed 

20 sequencing strategy minimizes the redundancy required while allowing one to scan for and concentrate on interesting 
regions. 

In order to define better the THPO locus and to search for other genes related to the hemaiopoieun family, 
five genomic clones were isolated from this region by PCR screening of human PI and PAC libraries (Genome 
System, Inc., Cat. Nos.: Pl-2535 and PAC-6539). 

25 The sizes of the genomic fragments are as follows: Pi .t is 40 kb: PI .g is 70 kb; PI .u is 70 kb: PAC.z is 200 kb; and 
BAC.l is 80 kb. Approximately 75% (140 kb) of die 190 kb genomic DNA region was sequenced by the Ordered 
Shotgun Strategy (OSS) (Chen et al.. Genomics 17: 651-56 (1993). and . assembled into comigs using 
AuioAssemblerTM (Applied Biosysiems. Perkin Elmer, Foster City. CA, cat. no, 903227). The preliminary order 
of these comigs was determined by manual analysis. There were 47 comigs the 140 kb region. A PCR-based 

30 approach to ordering the comigs and filling in the gaps was employed. The following summarizes die number and 
sizes of the gaps. The 50 kb of sequence unique to BAC.l was sequenced by a total shotgun approach with a ten-fold 



redundancy. 






number 


<50 bp 


13 


50-150 bp 


7 


150-300 bp 


7 


300-1000 bp 


10 


1000-5000 bp 


7 


> 5000bp 


2 ((15.000 bp) 
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DNA seauencinp- 

ABI DYE-pnincrTM chemistry (PE Applied Biosyscems. Foster City. CA; Cat. No.: 4021 12) was used to 
end-scquctice the lambda and plasmid subclones. ABI DYE-ientunaterTM chemistry (PE AppUed Biosystcms. Foster 
City. CA. Cat. No 403044) was used to sequence the PCR produce with their respecove PCR pnmers. TTic 
sequences were coUected with an ABD77 insoumeiu. For PCR products larger than Ikb. walking pnmers were used. 
5 The sequences of condgs generated by the OSS strategy m AutoAssemblerTM (PE Applied Biosystems. Foster City. 
CA: Cat. No: 903227) and the gap-filling sequencing trace files were imponed into SequencherTM (Gene Codes 
Corp., Ann Arbor. MD for overlappmg and editing. The sequences generated by the total shotgun strategy were 
assembled using Phred and Phrap and edited using Consed (http://chimeta.biotech.washington.eduAiwgc/project$.htm) 
and GFP (Genome Reconstruction Manager for Phrap). version 1.2 (http://$tork.cellb.bcm.in»c.cdu/gfp/). 
10 PCR-Based pa p filling ^tntt-fv - 

Primers were designed based on the 5' - and 3 '-end sequenced of each conrig. avoiding repetitive and low 
quality sequence regions. All primers were designed to be l9-24-tncrs with 50-70% G/C contem. Oligos were 
synthesized and gel-purifled by standard methods. 

Since the oneniaoon and order of the conugs were unknown, permutations of the pnmers were used in the 
15 amplification rcacuons. Two PCR kits were used: first. XL PCR kit (Perkin Elmer. Norwalk. CT. Cat. No.: 
N8080205). with extension tunes of approximately 10 minutes: and second, the Taq polymerase PCR kit (QUgen 
Inc.. Valencia. CA: Cat. No.: 201223) was used under high stringency conditions if smeared or multiple products 
were observed with the XL PCR kit. The mam PCR product from each successful reaction was extracted from a 
0.9% low melting agarose gel and purified with the Geneclean DNA Purification kit prior to sequencing. 

20 Ana|Y !}j|f 

The identification and charactcrizauon of coding regions was carried out as follows: First, repetitive 
sequences were masked using RepeatMasker (A.F.A. Smit & P Green, 
http://ftp.genome.washington.edu/RM/RM_details.html) which screens DNA sequences in FastA format against a 
libraiy of rcpeoiive elements and renmis a masked query sequence. Repeats not masked were identified by comparing 
25 the sequence to the GenBank database using WUBLAST2.0 f Altschul. S & Gish. W.. Methods Enzymol. 266: 460- 
480 (1996); http://blasi.wusd.edu/blast/README.huni] and were masked manually. 

Next, knowm genes vnn revealed by comparuig the genomic regions against Genenttch's protem dacibase 
using the WUBLAST2.0 algorithm and then annotated by aligning the genomic and cDNA sequences for each gene, 
respectively, usuig a Needleman-Wunch (Needleman and Wunsch. J. Mol. Biol. 48: 443-453 (1970) algoriihm to find 
regions of local identity between sequences. The strategy results m detection of all exons of the five known genes 
in the region. THPO. TRAP2. elF4g. CLCN2 and hRPBI? (see below). 
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eukaryotic translation initiation factor 4 gamma 3q27-qter 

35 thrombopoietm 3q26-q27 

chloride channel 2 3q26-<jter 

TNF receptor associated protein 2 „« previously mapped 

RNA polymerase II subunit hRPBl 7 „o, previously mapped 
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Finally, novel iranscnpiion units were prcdicicd using a number oi approaches. CpG islands (S. Cross & 
Bird, A,, Cun. Opin. Genci. Dev. 5: 109-314 (1995) islands were used lo define promoicr regions and were 
idcnrificd as clusters of sites cleaved by enzymes recognizing GC-rich. 6 or 8-mer palmdromic sequences (Nod. Narl, 
BssHn. XhoL CpG islands are usually associated with promoter regions of genes WUBLAST2.0 analysis of shon 
genomic regions (10-20 kb) versus GenBank revealed matches to ESTs. Tlie individual EST sequences (or where 
5 possible, their sequence chrotnaiogram files) were retrieved and assembled with Sequencer to provide a theoretical 
cDNA sequence (DNA36443). GRAIL2 (ApoCom Inc., KnoxviUc, TN. command line version for the DEC alpha) 
was used lo predict a novel exon. The five known genes in die region served as internal controls for the success of 
the GRAIL algorithm. 
Uatarion : 

10 A partial cndoihclin convcnmg enzyme-2 (ECE-2) cDNA clone was isolated by first spUcing in silico the 

ECE-2 cxons predicted in the genomic sequence to generate a putative sequence (DNA36443). An oligonucleotide 
probe: GAAGCAGTGCAGCCAGCA(JrAGAGAGCjCACCTGCTAAGA) fSEQ ID NO:530) was designed and used 
to screen a human fetal small intestine library (LIBllO) and internal PCR primers (3d443fl) 
(ECE2.f:ACGCAGCTGGAGCTGGTCTTAGCA) (SEQ ID NO:53l) and (36443rl) (ECE2.r) 

15 (GGTACTGGACCCCrrAGGGCCACAA) (SEQ ID NO:532) were used to confirm clones hybridizing to the probe 
prior to sequencing. One positive clone was obtained, however this cDNA (DNA49830) represented a partially 
spliced transcript containing appropriately spliced exons 1 through 6, followed by intron 6 sequence. The oligo dT 
primer annealed to a polyA-stretch within an Alu element present in iniron 6. An additional ECE-2 cDNA fragment 
{DNA49831) was obtained by PCR from a human fetal kidney library (LIB227) with primers designed from the 

20 presumed cDNA sequence [364430: CCTCCCAGCCGAGACCAGTGG (SEQ ID NO:533) and 36443r2: 
GGTCCTATAAGGGCCAAGACC (SEQ ID NO:534)]. This PCR product extended from exon 13 into the 3' 
untranslated region in exon 18. 

A full length endoihelin convcnmg enzyme 2 (ECE-2) cDNA clone (DNA55800-1263> was isolated from 
an oligo-dT-primcd human fetal brain library. RNA from human letal brain tissue (20 weeks gesuaon. 

25 /^283005)(SRC175) was isolated by guanidine ihiocyanaie and 5 Mg used lo generate double stranded cDNA which 
was cloned into the vector pRKSE. The 3* -primer 

(pGACTAGTTCTAGATCGCGAGCGGCCGCCCrn i I'l 1 1 U 1 1 1 11) (SEQ ID NO:535) and the 5 -linker 
(pCGGACGCGTGGGTCCA) (SEQ ID NO:536) were designed to introduce Xhol and Noll restriction sites. The 
library was screened with PCR primers f36443pcrfl: CGGCCGTGATGGCTGGTGACG (SEQ ID NO:537) and 

30 36443r3: GGCAGACTCCTTCCTATGGG (SEQ ID NO:538)] designed from the partial human ECE-2 cDNA 
sequences (DNA49830 and DNA49831). PCR products were cloned into the vector pCR2.1-T0P0 {Invitrogen 
Corp., Carlsbad, CA, Cat. No. K4500-01) and sequenced with DYE-terminator chemistry as described above. 

FXAMPLE 98 ! fin^h'-m ^\c^i and m situ RNA Hybridization AnaWsis for PRQ403 
35 Expression of PRO403 mRNA in human tissues was examined by Nordiem blot analysis. Human polyA + 

RNA blots derived from human fetal and adult nssues(aontcch, Palo Alto, CA, Cat. Nos. 7760-1,7756-1 and 7755- 
1) were hybridized to a f32P-a)dATP-labcllcd cDNA fragments from probe based on the full length PRO403 cDNA. 
Blots were incubated with the probes m hybridization buffer (5X SSPE: 2X Dcnhardi s solution; 100 mg/mL 
denatured stored sahnon sperm DNA: 50% formamidc: 2% SDS) for 18 hours at 42X. washed to high stringency 
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(O.IXSSC. 0 l?o SDS. SOX) and autoradiographed. The biois were developed after overnight exposure by 
phosphonmager analysis (Fuji). 

PRO403 mRNA iranscnpts were detected. Analysis of the expression pattern showed ihe strongest signal 
of the expected 3.3 kb transcnpi in adult brain (highest in the cerebellum, puiamen. medulla, and temporal lobe, and 
lower m the cerebral concx. occipital lobe and frontal lobe), spinal cord, lung and pancreas and higher levels of a 
4.5 kb transcript in fetal bram and kidney. 

EXAMPI.F 99 : Usg of PRO PolVPfntidf -Encoding Nucleic Acid a. Hvhn dization Prnhg^ 

The foUowuig method describes use of a nucleondc sequence encoding a PRO polypepdde as a hybridizanon 

probe. 

DNA comprismg the coding sequence of of a PRO polypeptide of interest as disclosed herein, may be 
employed as a probe or used as a basis from which to prepare probes to screen for homologous DNAs (such as those 
encoding namrally-occurring variants of ihc PRO polypeptide) in human tissue cDNA libraries or human tissue 
genomic libraries. 

Hybndizaaon and washing of filters coniammg either library DNAs is performed under the followmg high 
15 stringency conditions. Hybridizanon of radiolabeled PRO polypeptide-encoding nucleic acid-derived probe to the 
filters IS performed m a solution of 50% formamide, 5x SSC. 0.1% SDS, 0.1% sodium pyrophosphate, 50 mM 
sodium phosphate. pH 6.8. 2x Denhardt's solution, and 10% dexiran sulfate at 42-C for 20 hours. Washing of the 
filters is performed in an aqueous solution of 0. 1 x SSC and 0. 1 % SDS at 42"'C. 

DNAs having a desired sequence identity with die DNA encoding fiilWengih native sequence PRO 
20 polypeptide can then be identified using standard techniques known in the an. 

EXAMPLE |0Q: Ej^prcssion of PRO Pnlvpepii des in E. colt 

This example illustrates preparanon of an unglycosylaied form of a desired PRO polypeptide by recombinant 
expression m £. coii. 

25 The DNA sequence encoding the desired PRO polypepnde is miually amplified using selected PCR pruners. 

The primers should contain restriction enzyme sites which correspond to the restriction enzyme sues on the selected 
expression vector. A variety of expression vectors may be employed. An example of a suitable vector is pBR322 
(derived from £. coin see Bolivar et al.. QaK, 2:95 (1977)) which contains genes for ampicillm and tetracycline 
resistance. The vector is digested with restricuon enzyme and dephosphorylatcd. The PCR amplified sequences are 
then ligated into the vector. The vector will preferably include sequences which encode for an anubiotic resistance 
gene, a irp promoter, a polyhis leader (including the first six STII codons, polyhis sequence, and cmcrokinase 
cleavage sue), the specific PRO polypeptide coding region, lambda transcriptional terminator, and an argU gene. 

The ligation mixture is then used to transform a selected £. coU strain usmg the methods described in 
Sambrook et al., siipQ. Transformants are identified by their ability to grow on LB plates and anubiotic resistant 
35 colonies are dien selected. Plasmid DNA can be isolated and confirmed by resinction analysis and DNA sequencing. 

Selected clones can be grown overnight in liquid culnire medium such as LB broth supplemented with 
amibioiics. The overmght culmre may subscquemly be used to moculate a larger scale culwre. The cells are then 
grown to a desired optical density, durmg which the expression promoter is turned on. 
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After culninnfi ihc cells for several more hours, the cells can be harvested by ceninfugauon. The cell pellet 
obiaincd by ihc ceninfugauon can be solubilized usmg vanous agcnis known m ihc an, and the solubilized PRO 
polypeptide can then be purified using a metal chclaung column under conditions that allow tight binding of the 
protein. 

PR018K PR0195. PRO200. PR0237. PR0273, PRO540. PR0322. PRO1017. PR0938, PR0162, 
PR01114. PR0827 and PRO1008 were expressed in £. coli in a poly-His tagged form, using the following 
procedure. The DNA encoding the PRO polypeptide was initially amplified using selected PGR pmncrs. The 
primers contained resmcdon enzyme sites which correspond to the resinction enzyme sues on the selected expression 
vector, and other useful sequences providing for efficient and reliable translation iniiiauon, rapid purification on a 
metal chelation cohunn, and proieolync removal with enicrokinase. T\\t PCR-amplified, poly-His tagged sequences 
were then ligaied into an expression vector, which was used to transform an E. coli host based on strain 52 (W3 1 10 
fuhA(tonA) Ion galE rpoHisflirpRts) clpPOaclq). Transformants were first grown in LB coniaming 50 mg/ml 
carbcmciUin ai 30*^0 with shaking until an O.D.600 of 3-5 was reached. Cuiwres were then diluted 50-100 fold into 
CRAP media (prepared by mixing 3,57 g (NHO^SO,. 0.71 g sodium curaie-2H20. 1 .07 g KCl. 5.36 g Difco yeast 
extract. 5 36 g Sheffield hycase SF in 500 mL water, as well as 1 10 mM MPOS, pH 7.3. 0.55% (w/v) glucose and 
7 mM MgSO^) and grown for approximately 20O0 hours at 30X wiih shakmg. Samples were removed lo verify 
expression by SDS-PAGE analysis, and the bulk culwre is cemrihiged to pellet the cells. Cell pellets were frozen 

until purification and refolding. 

E. coli paste from 0.5 to 1 L fermentations (6-10 g pellets) was resuspcndcd in 10 volumes (w/v) in 7 M 
guanidine. 20 mM Tris. pH 8 buffer. Solid sodium sulfite and sodium ictrathionate is added to make final 
concentrations of O.IM and 0.02 M, respectively, and the solution was stirred overnight at 4*C. This step results 
in a denatured protein with all cysteine residues blocked by sulfiiolizauon. The solution was cenirifiiged at 40,000 
rpm in a Beckman Ultracenafuge for 30 min. The supernatant was diluted with 3-5 volumes of metal chelate column 
buffer (6 M guanidinc. 20 mM Tris, pH 7,4^ and fihercd through 0.22 micron filters to clarify. Depending the 
clarified extract was loaded onto a 5 ml Qiagcn Ni-NTA metal chelate column equilibrated in the metal chelate 
column buffer. The column was washed with additional buffer containing 50 mM imidazole (Calbiochem, Utrol 
grade), pH 7.4. The protcm was cluted with buffer containing 250 mM imidazole. Fractions containing the desired 
proicm were pooled and stored at 4*C. Protem concentration was estimated by its absorbancc at 280 nm using the 
calculated extinction coefficient based on its amino acid sequence. 

The proteins were refolded by diluting sample slowly into freshly prepared refolding buffer consisting of: 
20 mM Tris, pH 8.6, 0.3 M NaCl, 2.5 M urea, 5 mM cysteine, 20 mM glycine and I mM EDTA. Refolding 
volumes were chosen so that the final protein concentration was between 50 to 100 micrograms/ml. The refolding 
solution was srirred gently at 4*C for 12-36 hours. The refolding reaction was quenched by the addition of TFA to 
a final concentration of 0.4% fpH of approximately 3). Before ftinher purification of the protcm. the solution was 
filtered through a 0.22 micron filter and aceiomirilc was added lo 2-10% fuval concentration. The refolded proicm 
was chromatographed on a Poros Rl/H reversed phase column using a mobile buffer of 0. 1 % TFA with cluiion with 
a gradiem of acetonitnle from 10 to 80%. Aliquots of fractions with A280 absorbance were analyzed on SDS 
polyacrylamide gels and fractions containing homogeneous refolded protein were pooled. Generally, the properly 
refolded species of most proteins arc cluted at the lowest concentrations of acctomirilc since those species are the 
most compact with their hydrophobic inicnors shielded from interaction with the reversed phase resm. Aggregated 
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speoes are usuaUy eluted at higher acetommle concemranons. in addition to resolving misfolded forms of proteins 
from the desired form, the reversed phase step also removes endoiojun from the samples. 

Fractions containing the desired folded PRO proteins were pooled and the acetonitnle removed using a 
gentle stream of nitrogen directed ai the solunon. Proteins were formulated into 20 mM Hepes. pH 6.8 with 0. 14 
M sodium chloride and 4% maanitol by dialysis or by gel filiraoon usmg G25 Superfine (Pharmacia) resins 
equilibrated in the formulanon buffer and stenle filtered. 

EXAMPLE 101 : Expression nf PRO Pnlvn,>p..rtes in Mammaha^ rj.\ \ ^ 

This example illustrates preparation of a glycosylated form of a desired PRO polypeptide by recombinant 
expression in mammalian cells. 

The vector, pRK5 (see EP 307.247. published March 15. 19891. is employed as the expression vector. 
Optionally, the PRO polypcpnde-encoding DNA is ligated into pRK5 with selected restriction enzymes to aUow 
insertion of the PRO polypeptide DNA usmg ligation methods such as described in Sambrook « al.. wm- The 
resulting vector is called pRK5-PRO polypeptide. 

In one embodiment, the selected host cells may be 293 cells. Human 293 cells (ATCC CCL 1573) are 
grown to confluence m tissue culture plates m medium such as DMEM supplemented with fetal calf serum and 
optionally, nutriem components and/or antibioucs. About 10 m pRKS-PRO polypeptide DNA is mixed with about 
I Mg DNA encoding the VA RNA gene frhimmappaya et al.. £dl. 21:543 (I982)| and dissolved in 500 mI of 1 mM 
. Tris-HQ, 0.1 mM EDTA, 0.227 M CaCL. To this mixnire is added, dropwise. 500 ^1 of 50 mM HEPES (pH 7.35). 
280 mM NaCl. 1.5 mM NaPO,. and a precipitate is allowed to form for 10 mmutes at 25"C. The precipitate is 
suspended and added to the 293 cells and allowed to settle for about four hours at ST-C. The culmre medium is 
aspirated off and 2 ml of 20% glycerol in PBS is added for 30 seconds. The 293 cells are Uien washed wid» serum 
free medium, fresh medium is added and the cells are incubated for about 5 days. 

Approximately 24 hours after the transfections. the culmre medium is removed and replaced with culmre 
medium (alone) or culture medium containing 200 ^Ci/ml "S-cysteme and 200 /iCi/ml "S-methionine. After a 12 
hour incubation, the condidoned medium is collected, concentrated on a spin filter, and loaded orao a 15% SDS gel. 
The processed gel may be dried and exposed to film for a selected period of time to reveal the presence of PRO 
polypeptide. The cultures containing transfected cells may undergo further incubation (in scnim free medium) and 
the medium is tested in selected bioassays. 

In analieniaiive technique. PRO polypeptide may be introduced into 293 cells transiently using the dextran 
sulfate method described by Somparyrac et al.. Proc. Natl Ac^a 12:7575 (1981). 293 cells are grown to 
maximal density in a spinner flask and 700 ^g pRKS-PRO polypeptide DNA is added. The cells are first eoncemiated 
from die spinner flask by centrifiigarion and washed with PBS. TTie DNA^extran precipitate is incubated on the cell 
pellet for four hours. The cells are treated widi 20% glycerol for 90 seconds, washed with tissue culnire medium, 
and re-imroduced into the spinner flask comaming tissue culmre medium. 5 /xg/ml bovine insulin and 0.1 fig/ml 
bovine transferrin. After about four days, the conditioned media is centnfuged and filtered to remove cells and 
debris. The sample containing expressed PRO polypeptide can then be concentrated and purified by any selected 
mediod. such as dialysis and/or column chromatography. 

In another embodiment. PRO polypeptides can be expressed in CHO cells. The pRK5-PR0 polypeptide 
can be transfected mio CHO cells using known reagents such as CaPO, or DEAE-dexiran. As described above, the 
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ceil culrures can be incubated, and the mcdHun replaced with culture medium (alone) or medium containing a 
radiolabcl such as ''S-mtthionine. After determining the presence of PRO polypeptide, the culture medium may be 
replaced with serum free medium. Preferably, the culnires are incubated for about 6 days, and then the conditioned 
medium is harvested. The medium containing the expressed PRO polypeptide can then be conccrarated and purified 
by any selected method. 

5 Epitope-tagged PRO polypeptide may also be expressed in host CHO cells. Tlie PRO polypeptide may be 

subcloned out of the pRK5 vector. The subclone insert can undergo PCR to fiise in frame with a selected epitope 
tag such as a poly-his tag into a Baculovirus expression vector. The poly-his tagged PRO polypeptide insen can then 
be subcloned into a SV40 driven vector containing a selection marker such as DHFR for selecnon of stable clones. 
Finally, the CHO cells can be iransfected (as described above) with the SV40 dnven vector. Ubeling may be 

10 performed, as described above, to verify expression. The culture medium containing the expressed poly-His tagged 
PRO polypcpude can then be concentrated and purified by any selected method, such as by Ni=*-chelate affinity 
chromatography. 

Stable expression in CHO cells was performed usuig the following procedure. The proteins were expressed 
as an IgG construct (immunoadhesin). in which the coding sequences for the soluble forms (e.g. cxtracelltilar 
15 domains) of the respective proteins were fiised to an IgGl constant region sequence containing the hinge. CH2 and 
CH2 domains and/or is a poly-His tagged form. 

Following PCR amplification, the respective DNAs were subcloned in a CHO expression vector using 
standard techniques as described in Ausubel et al.. Current Proiocots of Molecular Biology. Unit 3.16. J<*n Wiley 
anl Sons (1997). CHO expression vectors are constnicted to have compatible restriction sites 5" and 3' of the DNA 
20 of interest to allow the convenient shuttling of cDNA's. TTic vector used expression in CHO cells is as described 
in Ucas et al.. Nucl. Acids Res. 24: 9 (1774-1779 (1996). and uses the SV40 eariy promoter/enhancer to drive 
expression of the cDNA of interest and dihydrcfolate reductase (DHFR). DHFR expression permits selection for 
stable maintenance of the plasnud following iransfcction. 

Twelve micrograms of the desired plasmid DNA were introduced into approximately 10 million CHO cells 
25 using commercially available tnuisfection reagents Superfect' (CJuiagen). Dosper* or Fugcne' (Boehringer Mannheim). 
The cells were grown and described in Lucas « al. , supra. Approximately 3 x Iff' ceUs are frozen m an ampule for 
ftirther growth and production as described below. 

The ampules containing the plasmid DNA were thawed by placement into water bath and mixed by 
vortexing. The contents were pipetted into a cenirifiige mbe contaming 10 mLs of media and centrifiiged at 1000 ipm 
30 fiw 5 minutes. The supernatant was aspirated and the cells were resuspended in 10 mL of selective media (0.2 Mm 
fihered PS20 with 5% 0.2 ^m diafUtered fetal bovine serum). The cells were then aliquoted into a 100 mL spinner 
containing 90 mL of selective media. After 1-2 days, the cells were transferred into a 250 mL spinner filled widi 
150 mL selective growth medium and incubated at 37«C. After another 2-3 days, a 250 mL. SCO mL and 2000 mL 
spinnen were seeded with 3 x 10' cclls/mL. The cell media was exchanged with fresh media by centnfugauon and 
35 resuspension in production medium. Although any suitable CHO media may be employed, a production medium 
described in US Patent No. 5.122.469. issued June 16. 1992 was acnially used. 3L production spinner is seeded at 
1.2 X 10' cells/mL. On day 0. the cell number pH were determined. On day 1. the spinner was sampled and 
spargii« with filtered air vras commenced. On day 2. the spinner was sampled, die lemperanire shifted to 33 *€. and 
30 mL of 500 g/L glucose and 0.6 mL of 10% anufoam (e.g.. 35 % polydimethylsiloxane emuUion. Dow Coming 
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365 Medical Grade Emulsion), -niroughoui the producuon. pH was adjusted as necessary lo keep ai around 7.2. 
After 10 days, or until viability dropped below 70%. the cell culture was harvested by centriftiguon and filtcnng 
through a 0.22 filter. TTie filtrate was either stored at 4'C or immediately loaded onto columns for purification. 

For (he poly-His tagged constructs, die protems were punfied using a Ni-NTA column (Qiagen). Before 
purification, imidazole was added to the conditioned media to a concentration of 5 mM. The conditioned media was 
5 pumped onto a 6 ml Ni-NTA column equilibrated m 20 mM Hepes. pH 7.4. buffer containmg 0.3 M NaCl and 5 mM 
imidazole at a How late of 4-5 ml/mm. at 4-'C. After loading, the column was washed with additional equilibration 
buffer and the protein eluted with equilibration buffer containing 0.25 M imidazole. The highly purified protein was 
subsequently desalted into a storage buffer containing 10 mM Hcpcs. 0. 14 M NaCl and 4% mannitol. pH 6.8. with 
a 25 ml G25 Superfine (Pharmacia) column and stored at -80*C. 

10 Immunoadhesin (Fc containing) constructs of were purified from die conditioned media as follows. The 

conditioned medium was pumped onto a 5 ml Protein A column (Pharmacia) which had been equilibrated in 20 mM 
Na phosphate buffer. pH 6.8. After loading, the column was washed extensively with equilibration buffer before 
eiurion widi 100 mM citric acid. pH 3.5. The eluted protein was immediately neutralized by collecting 1 ml fractions 
mto tubes contaimng 275 of I M Ins buffer. pH 9. The highly purified protein was subsequently desalted into 

15 storage buffer as described above for the poly-His tagged protems. The homogeneity was assessed by SDS 
polyacrylamide gels and by N-tcrminal amino acid sequencing by Edman degradation. 

The foUovring PRO polypepndes were succcssfiilly transienUy expressed in CHO cells: PRO200. PRO320. 
PR0237. PR0273. PR0337. PR0846. PR0363. PR0322. PRO1083. PR0938. PRO1012. PROllH. PROlOOSand 
PRO1075. 

The foUowing PRO polypepndes were successfully transiemly expressed m COS cells: PR0181. PROI95, 
PRO200. PRO320. PR0237. PR0273. PR0285. PR0337. PR0526. PRO540. PR0846. PR0362. 
PRO700. PRO707. PR0617. PR0322. PR0719. PRO1083. PR0868. PR0866. PR0768. PR0938. PRO1012. 
PR0162. PROllH. PR0827. PRO1008 and PRO1075. 

The following PRO polypeptides were successfully stably expressed in CHO cells: PR0181, PR0195. 
PR62OO. PRO320. PR0285. PR0337. PR0846. PR0362. PR0363, PRO707. PR0617. PR0322. PROI083. 
PR0868. PR0866. PRO1017. PR0792. PR0788. PR0938. PRO1012. PR0162. PROllH. PR0827. PRO1008. 
PRO1075andPROI03I. 



20 



30 



35 



EXAMPLE 107: Expression of PRO Pnlvp«.p iides in Ymm 

The following mediod describes recombinant expression of a desired PRO polypeptide in yeast. 

Fust, yeast expression vectors are consmicted for imracellular producuon or secreuon of PRO polypeptides 
from the ADH2/GAPDH promoter. DNA encoding a desired PRO polypeptide, a selected signal peptide and the 
promoter is inserted into suitable restnciion enzyme sites in the selected plasmid to direct miracellular expression of 
die PRO polypeptide. For secretion. DNA encoding the PRO polypeptide can be cloned into the selected plasmid. 
logeOier with DNA encoding die ADH2/GAPDH promoter, die yeast alpha-factor secretory signal/leader sequence, 
and linker sequences (if needed) for expression of die PRO polypeptide. 

Yeast ccUs. such as yeasi strain ABIIO, can dien be transformed widi the expression plasmids described 
above and culnircd in selected fermentation media. The tfansformed yeast supematants can be analyzed by 
precipitation with 10% trichloroacetic acid and separanon by SDS-PAGE. followed by staining of die gels widi 
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Coomassie Blue stain. 

Recombinani PRO polypeptide can subsequently be isolated and punficd by removing the yeast cells from 
the fenneniauon medium by cenirifugauon and then concemraung ihe medium using selected canndge filters. Ttt 
coiKcniraie contaimng the PRO polypeptide may further be punned usuig selected column chromatography rcsms. 



5 F^AMPLE 103 : pxpr^^^mn of P RO PolvnePiides in Baculovinis-lnffCted inSCCt CcllS 

The foUowmg method describes recombinant expression of PRO polypcpudes in Baculovirus-infccted insect 

cells. 

The desired PRO polypeptide is fused upstream of an epitope tag contained with a baculovirus expression 
vector. Such epitope tags include poly-his ugs and immunoglobulin tags (like Fc regions of IgG), A variety of 
10 plasmids may be employed, including plasmids derived from commercially available plasmids such as pVL1393 
(Novagen), Briefly, the PRO polypeptide or the desired portion of the PRO polypeptide (such as the sequence 
encoding the extiaceUular domain of a uansmembrane protem^ is amplified by PCR with pruners complementary to 
the 5* and 3" regions. The 5' primer may incorporate flanking (selected) restriction enzyme sues. The product is 
then digested with those selected restriction enzymes and subcloned into the expression vector. 
15 Recombinant baculovirus is generated by co-transfeciing the above plasmid and BaculoGold^" virus DNA 

(Phaimmgen) mto Spodopiera fmgiperda cells (ATCC CRL 1711) using lipofcctin (commercially available 

from GIBCO-BRU. After 4-5 days of incubation at 28"C, the released viruses arc harvested and used for further 
amplifications. Viral infection and protein expression is performed as described by O'Reilley ci al.. Baculovirus 
expression vectors: A laboratory Manual. Oxford: Oxford University Press (1994). 
20 Expressed poly-his tagged PRO polypeptide can then be purified, for example, by Ni-*-chelaie affinity 

chromatography as foUows. Extracts are prepared from recombinant virus-infected Sf9 cells as described by Rupen 
etal., Nature. 362:175-179 (1993). Brieny, Sf9 cells are washed, resuspended in sonication buffer (25 mL Hcpcs» 
pH 7.9; 12.5 mM MgCK. O.l mM EDTA: 10% Glycerol: 0.1% NP-40; 0.4 M KCl), and sonicated twice for 20 
seconds on icc. The somcaies are cleared by centrifugation. and the supernatant is diluted 50-fold in loading buffer 
25 (50 mM phosphate. 300 mM NaCI. 10% Glycerol. pH 7.8) and filtered through a 0.45 ^ni filter. A Ni'^-NTA 
agarose column tcommercially available from Qiagen) is prepared with a bed volume of 5 mL. washed with 25 mL 
of water and equiUbraicd widi 25 mL of loading buffer. The filtered cell extract is loaded onto the column at 0.5 mL 
per minute. The column is washed to baseline A,„ with loading buffer, at which point fraction collection is started. 
Next, the column is washed with a secondary wash buffer (50 mM phosphate; 300 mM NaCl. 10% Glycerol. pH 
30 6.0). which clutes nonspecifically bound proicm. After reaching A,,o baseline agam, the column is developed widi 
a 0 to 500 mM Imidazole gradient in the secondary wash buffer. One mL fractions are collected and analyzed by 
SDS-PAGE and silver staining or western blot with Ni-*-NTA-conjugated to alkaline phosphatase (Qiagen). 
Fractions coniaiiung the eiuied His.a-wgged PRO polypepndc are pooled and dialyzed against loading buffer. 

Altcmaovely, purification of the IgG tagged (or Fc tagged) PRO polypeptide can be performed using known 
35 chromatography techmques. including for instance. Protein A or protein G column chromatography 

PR0195, PR0526. PRO540, PR0846. PR0362. PR0363. PRO700. PRO707. PR0322. PR0719, 
PRO1083, PR0868. PR0866. PR0768, PR0788. PR0938. PR0827 and PROI031 were successfully expressed in 
baculovinis infected Sf9 insect cells. While the expression was actually performed in a 0.5-2 L scale, it can be readily 
scaled up for larger tc.g. 8 L) preparations. The proteins were expressed as an IgG construct timmunoadhesm), in 
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which the protein cxtraceUular region was tused to an IgGl constant region seouence containmg the hinge. CH2 and 
CH3 domains and/or in poly-His ueged forms. 

For expression m baculovirus infected Sf9 cells, following PCR amphficauon. the respective coding 
sequences were subcioned into a baculovirus expression vecior (pb.PH.IgG for IgG fusions and pb.PH.His.c for poly. 
His lagged proteins), and the vector and Baculogold* baculovirus DNA (Pharmingen) were co-transfected into 105 
SpodopierafiugiperdaCSt9') cells fATCC CRL 1711). using Lipofecun (Gibco BRL). pb.PH.IgG and pb.PH.His 
are modifications of the commerciaUy available baculovirus expression vector pVL1393 fPhanningen). widi modified 
polylinker regions to include the His or Fc tag sequences. TTie cells were grown m Hink's TNM-FH medium 
supplemented with 10% FBS (Hydonei. Cells were mcubated for 5 days at 28'C. The supernatant was harvested 
and subsequently used for the first viral ampUficaoon by infecting Sf9 cells in Hink's TNM-FH medium supplemented 
widi 10% FBS at an approximate multiplicity of infection (MOO of 10. Cells were incubated for 3 days at 28«C. 
The supernatant was harvested and the expression of the constructs in die baculovirus expression vector was 
determined by batch binding of I ml of supernatant to 25 mL of Ni-NTA beads (QIAGEN) for histidine tagged 
proteins or Protein-A Sepharose CL-4B beads (Pharmacia) for IgG tagged proteins followed by SDS-PAGE analysis 
comparmg to a known concemration of protein standard by Coomassie blue staining. 

The firet viral amplification supernatant was used to infect a spinner culnire (500 ml) of Sf9 cells grown m 
ESF-921 medium (Expression Systems LLC) at an approximate MCI of 0. 1 . Cells were incubated for 3 days at 
28'C. Tlie supeniatam was harvested and filtered. Batch binding and SDS-PAGE analysis was repeated, as 
necessary, until expression of the spinner culture was confirmed. 

TTie conditioned medium from the transfected ceils (0.5 to 3 L) was harvested by cerarifiigation to remove 
the cells and fiUeied through 0.22 micron filters. For the poly-His tagged constructs, the protein construct were 
purified using a Ni-NTA column (Qiagen). Before purification, imidazole was added to the conditioned media to a 
concemration of 5 mM. The conditioned media were pumped onto a 6 ml Ni-NTA column equilibrated in 20 mM 
Hepes. pH 7.4. buffer containing 0.3 M NaCI and 5 mM imidazole ai a flow rate of 4-5 ml/min. at 4-C. After 
loading, the column was washed with additional equilibration buffer and the protein eluted widi equilibration buffer 
containing 0.25 M imidazole. The highly purified protein was subsequently desalted into a storage buffer containing 
10 mM Hepes. 0. 14 M NaCl and 4% mannitol, pH 6.8. with a 25 ml G25 Superfine (Pharmacia) column and stored 
at -80'C. 

tauainoadhesin (Fc containing) consmicts of proteins were purified from the conditioned media as follows. 
•n» condiuoned media were pumped onto a 5 ml Protein A column (Pharmacia) which had been equilibrated in 20 
mM Na phosphate buffer. pH 6.8. After loading, the column was washed extensively with equilibraUon buffer before 
ehjiion with 100 mM citric acid. pH 3.5. The eluted protein was immediately neutralized by collecting 1 ml fractions 
into tubes comainiog 275 mL of 1 M Tris buffer. pH 9. The highly purified protein was subsequently desalted into 
storage buffer as described above for the poly-His tagged proteins. TTie homogeneity of the proteins was verified by 
SDS polyacrylamide gel (PEG) electrophoresis and N-tetminal amino acid sequencing by Edman degradation. 
35 PR0181 . PR0195. PRO200. PRO320. PR0237. PR0273, PR0285. PR0337. PR0526. PRO540. PR0846. 

PR0362. PR0363. PR0617. PR0322. PRO1083. PR0868. 768. PR0792. PR0788. PR0162. PR0U14. PR0827. 
PRO1075 and PRO1031 were successfully expressed in baculovirus infected Hi5 insect cells. While the expression 
was acmally performed in a 0.5-2 L scale, it can be readily scaled up for larger (e.g. 8 L) preparations. 

For expression m baculovirus-infccted Hi5 insect cells, the PRO polypcpnde-encoding DNA may be 
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amplified with suitable systems, such as Pfu (Sirataeenei. or fused upstream (5'-oO of an epuope tag coniamed with 
a baculovirus expression vector. Such epitope tags include poly-his tags and iramunoglobulm tags Hike Fc regions 
of IgG). A vanety of plasmids may be employed, including plasnuds derived from commercially available plasmids 
such as pVL1393 (Novagenj. Briefly, the PRO polypepndc or the desired poruon of the PRO polypcpnde fsuch as 
the sequence encoding the extracellular domain of a transmembrane proiem) is amplified by PCR with primers 
5 complementary to the 5' and 3' regions. The 5* primer may incorporate Hanking (selected) restriction enzyme sites. 
The product is then digested with those selected restriction enzymes and subcloncd imo the expression vector. For 
example, derivadves of pVLl393 can include the Fc region of human IgG (pb.PH.IgG) or an 8 histidine (pb.PH.His) 
tag downstream (3'-of) the NAME sequence. Preferably, the vector construct is setjuenced for confirmation. 

Hi5 cells arc grown to a confluency of 50% under the conditions of. 27'C. no C02. NO pen/strep. For each 
10 150 mm plate. 30 ug of pIE based vector containing PRO polypepude is mixed with I ml Ex-Cell medium (Media: 
Ex-Cell 401 + 1/100 L-Glu JRH Biosciences (!^14401-78P (note: this media is light sensiuve)). and in a separate 
tube. 100 ul of CellFecim (CellFECTlN (GibcoBRL #10362-010) (vonexed to mix)) is mixed with I ml of Ex-Cell 
medium. The two solunons are combined and allowed to incubate at room temperamre for 15 minutes. 8 ml of Ex- 
Cell media is added to the 2ral of DNA/CellFECTTN mU and this is layered on Hi5 cells that have been washed once 
1 5 with Ex-CeU media. The plate is then incubated in darkness for 1 hour at room temperamre. The DN A/CellFECTIN 
mix is then aspirated, and the cells are washed once with Ex-Cell to remove excess CellFECTlN . 30 nU of fresh 
Ex-Cell media is added and the cells arc incubated for 3 days at 28"C. Tlie supematam is harvested and the 
exi»ession of fte PRO polypeptide in the baculovirus expression vector can be determined by batch binding of 1 ml 
of supemaiem to 25 mL of Ni-NTA beads (QIAGEN) for histidine tagged proteins or Protein-A Sepharose CL-tB 
20 beads (Pharmacia) for IgG tagged proteins followed by SDS-PAGE analysis comparing to a known concentranon of 
protein standard by Coomassie blue staining. 

■me conditioned media from the iransfected cells (0.5 to 3 L) is harvested by cemrifugation lo remove the 
cells and filtered through 0.22 micron filters. For the poly-His tagged constructs, the protem compnsing the PRO 
polypeptide is purified using a Ni-NTA column (Qiagen). Before purification, imidazole is added to the conditioned 
25 media to a concentration of 5 mM. TTic condiuoned media is pumped onto a 6 ml Ni-NTA column equilibrated in 
20 mM Hepes, pH 7.4. buffer containing 0.3 M NaCI and 5 mM imidazole at a flow rate of 4-5 ml/min. at 4-C. 
Alier loading, the column is washed with additional equiUbration buffer and the protein eluted with equilibration 
buffer containing 0.25 M imidazole. TTie highly purified protein is subsequently deslaied imo a storage buffer 
containing 10 mM Hepes. 0. 14 M NaCl and 4% manmtol. pH 6 8. with a 25 ml G25 Superfine (Pharmacia) column 
30 and stored at -WTC. 

Immunoadhesin (Fc containing) constructs of proteins are purified from the conditioned media as follows. 
Jht conditioned media is pumped onto a 5 ml Protein A column (Pharmacia) which had been equilibrated in 20 mM 
Na phosphate buffer. pH 6.8. After loading, the column is washed extensively with equilibration buffer before elution 
wifli 100 mM citric acid. pH 3.5. TTie eluted protein is immediately neutralized by collecting 1 ml fractions into tubes 
35 containing 275 mL of I M Tris buffer. pH 9. The highly purified protein is subscquenUy desalted into storage buffer 
as described above for die poly-His tagged proteins. TTie homogeneity of PRO polypeptide can be assessed by SDS 
polyacrylamide geU and by N-terminal amino acid sequencing by Edman degradation and other analytical procedures 
as desired or necessary. 
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EXAMPLE 104 : PrtParatinn nf Annhfw i ieji ihar Rin<j fo PRO Polvni>pt,ri^.; 

This example Hlusttatcs preparauon of monoclonal anubodies which can specificaUy bind co a PRO 
polypeptide. 

Techniques for producing the monoclonal anubodies are known in the art and are described, for instance, 
in Coding, aima. bnmmiogens that may be employed include purified PRO polypeptide, fusion proteins containing 
the PRO polypeptide, and cells expressing recombmant PRO polypeptide on (he cell surface. Selection of the 
immunogen can be made by the skilled anisan without undue experimentanon. 

Mice, such as Balb/c. are immunized with the PRO polypeptide immunogen emulsified m cotnplete Freund's 
adjuvant and injected subcutaneously or intraperitoncally in an amount fiom 1-100 microgiams. Alteraatively. die 
immunogen is emulsified in MPL-TDM adjuvant (Ribi Immunochemical Research. Hamilton. MT) and injected into 
the animal s hind foot pads. The immunized mice are then boosted 10 to 12 days later with additional immunogen 
emulsified in the selected adjuvant. Thereafter, for several weeks, the mice may also be boosted with additional 
immunization injections. Serum samples may be periodically obuuned from die mice by retro-orbital bleeding for 
testing in ELISA assays to detect anti-PRO polypeptide antibodies. 

After a suitable amibody titer has been detected, die animals "positive" for antibodies can be injected widi 
a final intravenous injection of PRO polypepdde. TTirec to four days later, the mice are sacrificed and die spleen cells 
arc harvested. The spleen cells are dien fiised (using 35% polyediylene glycol) to a selected murine myeloma cell 
line such as R3X63AgU.l. available from ATCC. No. CRL 1597. TTie fiisions generate hybridoma cells which can 
dJcn be plated in 96 well Ussue culnirc plates containing HAT (hypoxanthine. aminopterin. and thymidine) medhun 
to inhibit proliferation of non-fiised cells, myeloma hybrids, and spleen cell hybrids. 

The hybridoma cells will be screened in an EUSA for reactivity against the PRO polypeptide. 
Determinarion of "positive" hybridoma cells secreting die desired monoclonal antibodies against die PRO polypeptide 
is widiin die skill in die an. 

■nie positive hybridoma cells can be injected miraperitoneally into syngeneic Balb/c mice to produce ascites 
containing the anti-PRO polypepode monoclonal antibodies. Aliemaavely. the hybridoma cells can be grown in assue 
culnire flasks or roBcr botdes. Purificanon of die monoclonal antibodies produced in die ascites can be accomplished 
using ammonium sulfate precipitation, followed by gel exclusion chromatography. .Alternatively, affinity 
chromatography based upon binding of antibody to protein A or protein G can be employed. 

EXAMPLE 105: Chimerir PRO Pnlvn.miH>« 

PRO polypeptides may be expressed as chimeric proteins widi one or more additional polypeptide domains 
added to facilitote protein purification. Such purification facilitating domains include, but are not limited to. metal 
chelating peptides such as hisudine -tryptophan modules diat allow purification on immobilized metals, protein A 
domains diat allow purification on unmobilized immunoglobulin, and die domain utilized in die FLAGS™ 
extension/affinity purification system (Immunex Corp.. Seattle Wash.). The inclusion of a cleavable linker sequence 
such as Factor XA or emcrokinase (Invitrogen. San Diego Calif.) between die purificauon domam and die PRO 
polypeptide sequence may be useftil lo facilitate expression of DNA encoding die PRO polypeptide. 
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RXAMPLE 106 : purificanon of pRO Polvpepiides Using Specific Amibodics 

Native or recombinant PRO polypeptides may be punficd by a vancry ot standard techniques in tbc an of 
protein purificanon. For example, pro-PRO polypeptide, mature PRO polypeptide, or pre-PRO polypeptide is 
purified by iniraunoaffiniiy chromaiogniphy usmg antibodies specific for the PRO poiypcpude of interest. In general, 
an immunoaffinity column is constructed by covalenily coupling the anti-PRO polypeptide anybody to an activated 

5 chromatographic resin. 

Polyclonal immunoglobulins are prepared ft-om immune sera either by precipitauon with ammonium sulfate 
or by purification on immobilized Protem A (Pharmacia LKB Biotechnology, Piscaiaway, N.J.). Likewise, 
monoclonal antibodies are prepared from mouse ascites fluid by ammomum sulfate precipitation or chromatography 
on immobilized Protein A. Partially purified immunoglobulin is covalently attached to a chromatographic resin such 
10 as CnBr-acnvatcd SEPHAROSE^^ (Pharmacia LKB Biotechnology). The antibody is coupled to the resin, the resin 
is blocked, and the derivative resin is washed according to the manufacwrer s instructions. 

Such an immunoaffinity column is utilized in the purification of PRO polypeptide by prepanng a ft^action 
from cells containing PRO polypeptide in a soluble form. This preparation is derived by solubilization of the whole 
cell or of a subcellular fraction obtained via differential cemriftigauon by the addition of detergent or by other 
15 methods weU known m the art. Aliemanvely, soluble PRO polypeptide coniammg a signal sequence may be secreted 
in useful quantity into the medium m which the ceils are grown. 

A soluble PRO polypeptide-contaimng preparation is passed over die immunoaffinity column, and the 
column is washed under conditions that allow the preferential absorbance of PRO polypeptide (e.g., high ionic 
strength buffers in the presence of detergent). Then, die column is eluied under conditions that disrupt antibody/PRC 
20 polypeptide binding (e.g. , a low pH buffer such as approxmiaicly pH 2-3, or a high concentrauon of a chaoiropc such 
as urea or tiiiocyanate ion), and PRO polypeptide is collected. 



FXAMPLE 107: Dnig Screening 

This mvention is panicularly useful for screening compounds by using PRO polypeptides or binding 
25 fragmem tiiereof in any of a variety of drug screening techniques. The PRO polypeptide or fragment employed in 
such a test may cither be free in solution, affixed to a solid support, borne on a cell surface, or located iniracellularly. 
One method of drug screemng utilizes eukaryoiic or prokaryoiic host cells which are stably transformed with 
recombinant nucleic acids expressing the PRO polypeptide or fragment. Drugs arc screened against such iransfomicd 
cells in competitive binding assays. Such cells, eitiier in viable or fixed form, can be used for standard binding 
30 assays. One may measure, for example, the formation of complexes between PRO polypeptide or a ftagmcm and die 
agent being tested. Alternatively, one can examine the diminution in complex formation between die PRO polypeptide 
and its target cell or target receptors caused by the agent being tested. 

Thus, the present invenuon provides methods of screening for drugs or any otiier agents which can affect 
a PRO polypcpiide-associaied disease or disorder. These mctiiods comprise contacting such an agent witii an PRO 
35 polypeptide or fragment thereof and assaying (I) for the presence of a complex between die agent and the PRO 
polypeptide or fragroint. or (u) for the presence of a complex between die PRO polypeptide or fragmem and the cell, 
by methods weU known in die an. In such competitive binding assays, die PRO polypeptide or fragment is typically 
labeled. After suitable incubation, free PRO polypeptide or fragment is separated from that present in bound form, 
and the amount of free or uncomplexed label is a measure of die ability of the parucular agent to bind to PRO 
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polypcpude or to interrere wiih ihc PRO polypepude/ceil complex. 

Another technique for drug screening provides high ihroughput screetiing for compounds havuig suitable 
binding affinity to a polypeptide and is descnbed in detail in WO 84/03564. published on September 13. 1984. 
Briefly stated. large numbers of different small peptide test compounds arc synthesized on a solid substrate, such as 
plastic pins or some other surface. As applied to a PRO polypeptide, ihe peptide test compounds arc reacted with 
PRO polypcpode and washed. Bound PRO polypeptide is detected by methods well known in the an. Purified PRO 
polypeptide can also be coated direcUy onto plates for use in the aforementioned drug screenmg techniques. In 
addition, non-neutralizing antibodies can be used to capture the peptide and immobilize it on the solid suppon. 

This invention also contemplates the use of competitive drug screening assays in which neutralizing 
antibodies capable of binding PRO polypeptide specificaJly compete with a test compound for binding to PRO 
polypeptide or fragments thereof, in this manner, the antibodies can be used to detect the presence of any peptide 
which shares one or more anugenic deiemunants with PRO polypeptide. 

EXAMPLE 108: Rational Dmp np.iyn 

The goal of raaonal drug design is to produce smictural analogs of biologically active poiypepude of mterest 
(/.e., a PRO polypeptide) or of small molecules widi which they interact, e.g., agomsts. antagomsts, or inhibitors. 
Any of these examples can be used to fashion drugs which are more active or stable forms of the PRO polypeptide 
or which enhance or interfere with the function of the PRO polypeptide in vivo (c.f., Hodgson, Bio/Technnloyy j: 
19-21 (1991)). 

In one approach, the three-dimensional stnicmrc of the PRO polypeptide, or of an PRO polypeptidc-inhibitor 
complex, is determined by x-ray crystallography, by computer modeling or. most typically, by a combination of the 
two approaches. Botii the shape and charges of the PRO polypeptide must be ascertained to elucidate the strucmre 
and to detemune active site(s) of the molecule. Less often, useful information regarding the strucmre of the PRO 
polypeptide may be gained by modeling based on the sinicturc of homologous proteins. In both cases, relevant 
smiciural informanon is used to design analogous PRO polypeptide-like molecules or to identify efficient inhibitors. 
Uscfiil examples of rational drug design may include molecules which have unproved activity or stability as shown 
by Braxton and Wells. Biochgmisiry. 21:7796-7801 (1992) or which act as inhibitors, agonists, or antagonists of 
native peptides as shown by Athauda et aL, J. Biochcm. . 112:742-746 (1993). 

It is also possible to isolate a target-specific antibody, selected by functional assay, as described above, and 
then to solve its crystal strucmre. This approach, m principle, yields a pharmacore upon which subsequent drug 
design can be based, h is possible to bypass protein crystallography altogether by generating anti-idiotypic antibodies 
(anti-ids) to a functional, pharmacologically active antibody As a mirror image of a mirror image, the binding site 
of the anti^ids would be expected to be an analog of die original receptor. The anti-id could then be used to identify 
and isolate peptides from banlcs of chemically or biologically produced peptides. The isolated peptides would dien 
act as the pharmacore. 

By virme of die present invention, sufficient amounts of the PRO polypeptide may be made available to 
pcrfonn such anaiyncal studies as X-ray crysiaUography . In addition, knowledge of die PRO polypeptide ammo acid 
sequence provided herein will provide guidance to those employing computer modeling techniques in place of or in 
addition to x-ray crystallography. 
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PVAMPf E 109 - f\h\m fff P P" P"'T^""' ' " '"h'h" Va^rniar Fnrtoihclial Growth Facwr (VF<1F) Sl i mu la wd 

PmlifcraP"" En<lothelial Cell Growjh 

The ability of various PRO polypeptides to inhibit VEGF stimulated prolifcrauon of eiutothelial ceils was 
tested. Specifically, bovine adrenal cortical capillary endothelial (ACE) cells (from primary culture. ma«nmm 12-14 
passages) were plated on 96-well tiucroiiter plates (Amersham Ufe Sciencei at a density of 500 celU/wcU per 100 
5 fiL in lov^' glucose DMEM. 10% calf setum. 2 mM glutamine. Ix pen/sirepi and fungizone, supplemented with 3 
ng/mL VEGF. Controls were plated the same way but some did not include VEGF. A test sample of the PRO 
polypeptide of interest was added in a 100 mI volume for a 200 ,.1 final volume. Cells were incubated for 6-7 days 
at 37"C. nte media was aspirated and the cells washed Ix wi* PBS. An acid phosphatase reaction mixture (100 
^L. O.IM sodium acetate. pH 5.5. 0. 1 % Triton-100. 10 mM p-niirophenyl phosphate) was added. After incubation 
10 for 2 hours at 37'C. the leacnon was stopped by addition of 10 m1 1 N NaOH. OD was measured on microliter plate 
reader at 405 nm. Controls were no cells, cells alone, cells + FGF (5 ng/mL). cells + VEGF (3 ng/mL). celU + 
VEGF (3 ng/tnl) + TGF-P (I ng/ml). and cells + VEGF (3ng/mL) + LIF (5 ng/mL). aOF-p at a 1 ng/ml 
concentration is known to block 70-90% of VEGF stimulated cell proliferation.) 

The results were assessed by calculating the percentage inhibition of VEGF (3 ng/ml) stunulated cells 
15 proliferation, determined by measuring acid phosphatase activity at OD405 nm. (1) relative to cells without 
stimulation, and (2) relative to the reference TGF-p inhibition of VEGF sumulated acuvity. THe results are indicative 
of the utiUty of the PRO polypeptides in cancer therapy and specifically in inhibiting tumor angiogenesU. 

The PRO polypeptides demonstrated as being capable of inhibiting VEGF stimulated proUfcration of 
endothelial cell growth at various concentranons include PRO200 and PRO320. 

20 

PYAMPt K 110: Rwinal Neuron Survival 

TWs example demonsoates that various PRO polypeptides have efficacy in enhancing the survival of retinal 

neuron cells. 

Sprague Dawley rat pups at postnatal day 7 (mixed population: gUa and retinal neuronal types) are killed 
25 by decapitation following CO, anesthesia and the eyes are removed under sterile conditions. The neural retina is 
dissected away from the pigment cpiiheUum and other ocular tissue and then dissociated mto a single cell suspension 
using 0.25% trypsin in Ca'*. Mg-*-free PBS. The retinas are incubated at 37'C for 7-10 minutes after which the 
trypsin is inactivated by adding I ml soybean trypsin inhibitor. TTb cells are plated at 100.000 cells per weU in 96 
weU plates in DMEM/F12 supplemented with N2 and with or without the specific test PRO polypeptide. Cells for 
30 all cxpetiroenis are grown at 37»C in a water sanirated atmosphere of 5 % CO.. After 2-3 days in culnire. cells arc 
stained with calcein AM then fixed using 4% paraformaldehyde and stained with DAPl for determination of toul cell 
count. Tlje total cells (fhiorcscenu are quantified at 20X objective magnification using CCD camera and NIH image 
software for Macintosh. Fields in the well are chosen at random. 

The effea of various concentrauon of PRO polypeptides is calculated by dividing the total number of calcem 
35 AM positive cells at 2-3 days in culmre by the toul number of DAPl-labeled cells at 2-3 days in cutaire. Anything 
above 30% survival is considered posiiive. The following PRO polypeptides were positive in tiiis assay: PRO200. 
PRO540. PR0846 and PR0617. 
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EXAMPt.R 1 1 1 ■ Rod Phntftpffrrrmr ^iint-iil 

This example demonsirates ihat vanous PRO polypeptides have efficacy tn enhancing the survival of rod 
photoreceptor cells. 

Sprague Dawlcy rat pups at 7 day posmatal tmued population: glia and retinal neuronal cell t>'pes) are killed 
by decapitauon following CO, ancsthesis and the eyes are removed under sterile conditions. The neural reiiiia is 
dissected away tbmi the p.gmera epidielium and other ocular tissue and then dissociated into a single cell suspension 
using 0.25% toDSin m Ca^*. Mg'*-free PBS. The retinas are incubated at 37-C for 7-10 minutes after which the 
trypsin is inactivated by adding I ml soybean trypsin inhibitor. The cells are plated at 100.000 cells per well in 96 
weu plates in DMEM/F12 supplemented with N2 and with or without the specific test PRO polypeptide. Cells for 
all expenmems are grown at 37»C in a water saniraicd atmosphere of 5% C02. After 2-3 days in culnire. cells are 
fixed using 4% paraformaldehyde, and then stained using CellTracker Green CMFDA. Rho 4D2 (ascites or IgG 
1:100). a monoclonal antibody directed towards the visual pigment rhodopsin is used to detect rod photoreceptor cells 
by indirect unmunofluoresccncc. The results are reponed as % survival: total number of calcein/CellTrackcr - 
rhodopsin posiove cells at 2-3 days in culnire. divided by the total number of rhodopsin positive cells at time 2-3 days 
in culture. The total cells (fluorescent) are quantified at 20x objective magnification usmg a CCD camera and NIH 
15 image software for Macintosh. Fields in die well are chosen at random. 

With regard to the effect of various concentration of PRO polypeptides, anything above 10% survival is 
considered posinve. The foUowing PRO polypeptides tested positive in this assay: PRO200. PRO540. PR0846 and 
PR0617. 
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EXAMPf F I V : AbiliiY PRO PolvDemides to .Stimulate .h. BM.,,. of Pn,t>^,ivp,n. t^..J ^ ^ ^ nil n f T 

TTic ability of vanous PRO polypeptides to stimulate die release of proteoglycans from cartilage ussue was 
tested as follows. 

The metacarphophalangeal joint of month old pigs was aseptically dissected, and articular carulage was 
removed by free hand slicmg being careftil to avoid die underlying bone. TTie canilage was minced and culmrcd in 
bulk for 24 hours in a humidified amnsphere of 95% air. S% CO, in serum free (SF) media (DME/F12 1:1) wotii 
0.1% BSA and lOOU/ml penicillin and lOO^g/ml streptomycin. After washing three times, approximately 100 mg 
of articular cartilage was alit)uoted into micronics mbes and incubated for an additional 24 hours in die above SF 
media. PRO polypeptides were then added at I % either alone or in combination with 18 ng/ml intcrleukin-la. a 
known stmnilator of proteoglycan release from cartilage tissue. The supernatant was then harvested and assayed for 
the amount of proteoglycans using the 1 .9wlimethyl-meihylene blue (DMB) colotimetric assay (Famdale and ButUe. 
B i ochcm , Bim)hvs ftr t a 883:173-177 (1985)). A positive result in this assay indicates diat the test polypeptide will 
find use. for example, in the treatment of sports-related joint problems, articular canilage defects, osteoanhrios or 
rheumatoid arduius. 

When PRO200 polypepndes were tested in the above assay, die polypeptides demonstrated a marked ability 
to stimulate release of proteoglycans from carulage tissue both basaily and after stimulation with interleukin-lo and 
at 24 and 72 hours after treamient. thereby indicating that PRO200 polypeptides are usehil for sumulating 
proteoglycan release from canilage tissue. 
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FVAMPi.g 113 - In vjtrff Antiprftlifrr?"^^ ^^^^^ 

The aimprolifcrauvc acuvity of vanous PRO polypcpiides was deicrmincd in the invesugauonal, discasc- 
oricmcd in vitro anti-cancer drug discovery assay of ihc National Cancer Insdtuic (NCF). using a suiforhodaraine B 
(SRB) dye binding assay essenrially as described by Skehan et al., /. NaiL Cancer fnst. 82: 1 107-1 1 12 (1990). The 
60 aimor ceU lines employed in this study ("the NCI panel"), as well as conditions for their maintenance and culture 
5 in vitro have been described by Monks et al.. /. NaiL Cancer Inst, ^3:151-166 (1991). The purpose of this screen 
is 10 inidaUy evaluate the cytotoxic andVor cytostatic acuvity of the test compounds against different types of tumors 
(Monks et al., supra; Boyd, Cancer: Princ. Pract. OncoL Update 3(10): M2 (19891). 

CeUs from approximately 60 human tumor cell lines were harvested with irypsin/EDTA (Gibco). washed 
once, resuspendcd in IMEM and their viability was determined. The cell suspensions were added by pipci (100 mL 
10 volume) into separate 96-well micronier plates. Hie cell density for the 6.day mcubation was less than for the 2-day 
incubation to prevent overgrowth. Inoculates were allowed a preincubation period of 24 hours at 37*C for 
stabilization. Dilutions at twice the intended test concentration were added at time zero in 100 pL aliquots to the 
microtitcr plate wells (1:2 dilution). Test compounds were evaluated at five half-log dilutions (1000 to 100,000-fold). 
Incubations took place for two days and six days in a 5% CO, aunosphcrc and 100% humidity. 
15 After incubation, the medium was removed and the cells were fixed in 0. 1 ml of 10% irichloroaceuc acid 

at 40"C. The plaies were rinsed five limes with deionized water, dried, stamcd for 30 minutes with 0. 1 ml of 0.4% 
sulforhodamine B dye (Sigma) dissolved in I % acetic acid, rinsed four times with 1 % acetic acid to remove unbound 
dye, dried, and the stain was extracted for five minutes with 0.1 ml of 10 mM Tris base 
(tris(hydroxymcthyl)aniinomeihanel. pH 10.5. The absorbancc (OD) of sulforhodamine B at 492 nm was measured 
20 using a computer-interfaced. 96-weU microliter plate reader. 

A lesi sample is considered positive if it shows at least 50% growth inhibitory effect at one or more 
conccmrations. The foUowmg PRO polypeptides gave positive results in at least one mmor cell line: PR0I8K 
PR0237. PR0526. PR0362 and PR0866. 

25 FX AMPLE 114: Gene Amnlification 

This example shows that genes encoding various PRO polypcpiides arc amplified in ihe genome of certain 
human cancers. Amplification is associated wiih overexpression of the gene product, indicating that the PRO 
polypeptide is a useful target for therapeutic intervention in certain cancers such as colon, lung and other cancers, 
-merapeuric agent may take the form of antagonists of PRO polypeptide-encoding genes, for example, murine-human 
30 chimeric, humanized or human antibodies agamst the PRO polypeptide. 

The starting material for the screen was genomic DNA isolated from a variety cancers. The DNA is 
quandtated precisely, e.g., nuoromctrically. As a negative control. DNA was isolated from the cells of ten normal 
healthy individuals which was pooled and used as assay controls for the gene copy m healthy individuals (NorHu). 
m 5' nuclease assay (for example. TaqMan'^) and real-time quantitative PCR (for example, ABI Prizm 
35 7700 Sequence Detection System^"* (Perkin Elmer, Applied Biosystcms Division. Foster City. CA)), were used to 
lind genes potentially amplified in certain cancers. The results were used to deicrmme whether die DNA encoding 
the PRO polypepdde is over-represented in any of the lung and colon cancers diat were screened. The resuh was 
reported in Delia CT units. One unit corresponds 1 PCR cycle or approximately a 2-fold amplificauon relative to 
normal, two umu corresponds to 4.fold. 3 units to S-fold and so on. Quanoiauon was obtained using primers and 
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a Taqman™ fluorescent derived from the PRO polypeptidc-encodmg gene. Regions of the PRO polypepnde which 
are most likely to contain unique nucleic acid sequences and which are least likely to have spliced out inirons arc 
preferred for the primer derivation, e.g.. 3 '-untranslated region. 

The 5" nuclease assay reacnon is a fluorescera PCR-based technique which makes use of the 5' exonuclease 
activity of Taq DNA polymerase enzyme to momior amplification m real time. Two ohgonucleodde primers are used 
to generate an amplicon typical of a PCR reacnon. A third oligonucleoude. or probe, is designed to detect nucleotide 
sequence located between the two PCR pnmers. The probe is non-extendible by Taq DNA polymerase enzyme, and 
is labeled with a reponcr fluorescem dye and a quencher fluorescent dye. Any laser-induced emission the 
reporter <fye is quenched by the quenching dye when the two dyes are located close together as they are on the probe. 
During ttw anqitiiication reaction, the probe is cleaved by the Taq DNA polymerase enzyme in a template-iiepeiideu 
manner. The resultant probe fiagments disassociate in soludon, and signal from the released reporter dye is free from 
the quenching effect of the second fluorophore. One molecule of reponer dye is liberated for each new molecule 
synihesizcd. and detection of the unqucnched reporter dye provides the basis for quantitative interpretation of the data. 

The 5' nuclease procedure is run on a real-time quanntaiive PCR device such as the ABI Prism 7700TM 
Sequence Detection. The system consists of a thermocyler, laser, charge-coupled device (CCD) camera and 
con^juter. The system amplifies samples in a 96-well format on a thermocycler. During amplificauon, laser-induced 
nuorescent signal is coUecied in real-dmc through fiber optics cables for all 96 wells, and detected at the CCD. The 
system iiKludes software for running the inirumem and for analyzing the data. 

5* Nuclease assay data are initially expressed as Ct, or the threshold cycle. This b defined as the cycle at 
which the reporter signal accumulates above the background level of fluorescence. The Ct values are used as 
quantitaUve measurement of the relative number of starting copies of a particular target sequence in a nucleic acid 
sample. 

Genes encoding the foUowmg PRO polypeptides were found to be amplified in die above assay: PR0213-1 , 
PR0237. PR0324. PR0351. PR0362. PR0853. PR0615. PR0531. PR0618. PR0772. PRO703. PR0474. 
PR01017 and PR0792. 

EXAMPLE 115 : Induction nf e-fns in Fndnfh,.|ial TolU 

Human venous umbilical vein endothelial cells (HUVEC. Cell Systems) in growth media (50:50 without 
glycine. 1% gluuunine. lOmM Hepes. 10% FBS. 10 ng/ml bFGF), were plated on 96-well microUter plates at a cell 
density of IxlO'cellsAveU. The day alter plating, the cells were surved by removing the growth media and treating 
the cells widi 100 »il/well test samples and controls (positive comrol: growth media: negative control: Protein 32). 
TTjc cells were incubated for 30 minutes at 37'C. in 5« CO,. The samples were removed, and the first pan of the 
bDNA kit protocol (Chiron Diagnostics, cat. Ji'6005-037) was followed. 

Briefly, the amounts of the TM Lysis Buffer and Probes needed for the tests were calculated based on 
infomiation provided by the manufacnirer. The appropriate amounts of thawed Probes were added to the TM Lysis 
Buffer, The Capmre Hybridization Buffer was warmed to room lempcramre. The bDNA strips were set up in die 
metal strip holders, and 100 m1 of Capture Hybridizauon Buffer were added to each b-DNA well needed, followed 
by incubation for at least 30 minuies. The test plates with the cells were removed from the incubator, and die media 
was gently removed using the vacuum manifold. 100^1 of Lysis Hybridization Buffer widi Probes were quickly 
pipened into each well of the microliter plates. The plates were dien incubated at 5S°C for 15 minutes. Upon 
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removal from ihc mcubaior, ihe plates were placed on the vonex mixer with the microliter adapter head and voncx 
on the #2 senmg for one mmuie. 80 ft\ of the lysaie were removed and added lo Ae bDNA wells comaming the 
Capture Hybridization Buffer* and pipetted up and down to mix. The plates were incubated at 53**C for at least 16 
hours. 

On the next day. the second pan of the bDNA kit protocol was followed. Specificaily. the Plates were 

5 removed from the incubator and placed on the bench to cool for tO minutes. The volumes of additions needed were 
calculated based upon information provided by the manufacturer. An Amplifier Workmg Solution was prepared by 
making a 1:100 dilution of the Amplifier Concentrate (20 ftn/pH in AL Hybridization Buffer. The hybridization 
mixnire was removed from the plates and washed twice with Wash A. 50 fi\ of Amplifier Working Solution were 
added to each well and the wells were incubated at 53°C for 30 minutes. The plates were then removed from the 

10 incubator and allowed to cool for 10 minutes. The Label Probe Working Solution was prepared by making a 1 : 100 
dilution of Label Concentrate (40 pmolcs/Ml) in AL Hybridization Buffer. After the 10 minutes cool down period, 
the amplifier hybridization mixmre was removed and the plates washed twice with Wash A. 50^1 of Label Probe 
Working Solution were added to each well and the wells were incubated at S^^'C for 15 mimites. After coohng for 
10 minutes, the Substrate was warmed to room temperature. Upon addition of 3 n\ of Substrate Enhancer to each 

15 ml of Substrate needed for the assay, the plates were allowed to cool for 10 minutes, the label hybridization nuxture 
was removed, and the plates were washed twice with Wash A and three-times with Wash D. 50^1 of the Substrate 
Sohuion with Enhancer were added to each well. The plates were incubated for 30 minutes at 37*'C and RLU read 
in an appropriate Itiminometer. 

The replicates were averaged and the coefficient of variation was determined. The meastire of activity of 

20 the fold increase over the Protein 32 (buffer control) value indicated by chcmoluminescence units (RLU). Samples 
which showed an at least two-fold value over the Protein 32 value were considered positive. PR0938 was positive 
in the above assay. 

py AMPLE 1 16 : Prnl»fi>»rinn nf Rat Ulricniar SunnoTting Cells 
25 In an effort to identify PRO polypeptides that act as potent mitogens for inner ear supporting ceils which 

are hair cell progenitors (related to auditory hair cell regeneration), various PRO polypeptides were tested in the 
following assay. 

The rat utricular epithelial cell line (UEC-4 cells) are aliquoted into 96 well plates with a density of 3000 
ceUs/weU in 200 ^1 of serum-containing medium at 33"C. After overnight, the cultures are switched to serum-free 

30 medhmi at 3TC and the PRO polypeptide samples arc added at various dilutions. After 24h incubation. ^H-ihymidine 
(1 mCi/wcU) is adcd to the cultures for an additional 24h. The cells arc then harvested using a Tomtec cell harvester. 
Because the epithelial cells are grown on a polylysine substrate, trypsin (1 mg/ml) is added to the culmre wells for 
30 mtn at 3rC to lift the cells before cell harvest. Cpm/well are counted with a matrix 9600 gas coumer (Packard 
Instrument Company, Downers Grove, IL). Data is collected from 3 culture wells from each of the experimental 

35 groups and expressed as mean ± SEM. A two-tailed, unpaired t-test is tised for statistical analysis, as compared to 
the control group (treatment with TGF-a). 

Average cpm counts which are at least 30% higher than the control values are considered positive for the 
assay. The following PRO polypeptides were positive in this assay: PR0337, PR0363 and PRO1012. 
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EXAMPLE 117: h sttu Hvt^rjrii^^^i^f, 

In sttu hybridizauon is a powerful and versatile technique for the detection and localization of nucleic acid 
sequences within ceil or tissue preparations. It may be useful, for example, to identify sues of gene expression, 
analyze the tissue distribution of iranscnpnon. identify and localize viral infection, follow changes in specific mRNA 
synthesis and aid in chromosome mapping. 

In situ hybridization was performed following an optimized version of the protocol by Lu and Gillea. CfiU 
VisiSiD 1:169-176 (1994). using PCR-generated "P-labeled riboprobes Briefly, formalin-fixed, paraffin-embedded 
human tissues were sectioned, deparaffinized. deproteinated m proteuiase K (20 g/ifU) for 15 minutes at 37°C. and 
ftirther processed for in situ hybridization as described by Lu and Gillett. supra. A ("-P] UTP-labcled antisense 
riboprobe was generated from a PCR product and hybridized at SS'C overnight. TTie slides were dipped in Kodak 
NTB2 nuclear track emulsion and exposed for 4 weeks. 
"F-Riboprohe svinhixi.; 

6.0 Ml (125 mCi) of "P-UTP (Amcrsham BF 1002. SA<2000 Ci/mmol) were speed vac dried. To each 
tube containing dried ''P-UTP. the following ingredients were added: 
2.0 /il 5x transcription buffer 
15 l O^tl DTT(lOOmM) 

2.0 Ml NTP mix (2.5 mM • 10 /i: each of 10 mM GTP. CTP & ATP + 10 ^1 H,0) 

1.0m1UTP(50mM) 

I.OmI Rnasin 

I.O/jI DNA template (iMg) 
20 l.O/ilHjO 

1.0 Ml RNA polymerase (for PCR products T3 = AS. T7 = S. usually) 

nie nibes were incubated at 37-C for one hour. 1.0 mI RQl DNase were added, followed by incubation 
at 37»C for 15 minutes. 90 m1 TE (10 mM Tris pH 7.6/lmM EDTA pH 8.0) were added, and the mixnire was 
pipetted onto DE8I paper. The remaining soluuon was loaded in a Microcon-50 ultrafiltration unit, and spun using 
program 10 (6 minutes). The filtration urat was invened over a second tube and spun using program 2 (3 minutes) 
After the final recovery spin. 100 m1 TE were added. 1 m1 of the final product was pipetted on DE81 paper and 
counted in 6 ml of Biofluor II. 

TTie probe was nm on a TBE/urea gel. 1-3 m1 of the probe or 5 mI of RNA Mrk HI were added to 3 mI of 
loading buffer. Al^er heating on a 95'C heat block for diree minutes, the gel was immediately placed on ice. TTie 
weDs of gd were flushed, the sample loaded, and tun at 180-250 volts for 45 minutes. The gel was wrapped in saran 
wrap and exposed to XAR film with an intensifying screen in -TO'C freezer one hour to overnight. 

"P.HvhriiM«.rt«n 

A. Pretreatment of frozen sectinnt 

The slides were removed from the freezer, placed on aluminium trays and thawed at room temperature for 
5 minutes. The trays were placed in 55°C incubator for five minutes to reduce condensation. TTie slides were fixed 
for 10 minutes in 4% paraformaldehyde on ice in the fume hood, and washed in 0.5 x SSC for 5 minutes, at room 
temperamre (25 ml 20 x SSC + 975 ml SQ H,0). After dcprotcination in 0.5 Mg/ml prouinasc K for 10 minutes 
at 37-C (12.5 m1 of 10 mg/ml stock in 250 ml prewarmed RNase-frce RNAsc buffer), the sections were washed in 
0.5 x SSC for 10 minutes at room temperamrc. TTie sections were dehydrated in 70%. 95%. 100% ethanol. 2 
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minutes each. 

B. Pretreatment of paraffin-embedded sections 

The slides were deparaffirazed. placed m SQ H,0, and rinsed iwice in 2 x SSC ai room temperature, for 
5 minutes each lime The sections were deprotemaied in 20 /ig'mi protemase K (500 /il of 10 mg/mJ m 250 ml 
RNase-free RNase buffer: VC, 15 minutes) - human embr>'o. or 8 x proteinase K (100 ^il in 250 ml Rnase buffer, 
5 37"C, 30 minutes) - formalin tissues. Subsequent nnsing in 0.5 x SSC and dehydration were performed as described 
above. 

C. Prehvbridization 

The slides were laid out in a plastic box lined with Box buffer (4 x SSC. 50% formamide) - saturated filter 
paper. The tissue was covered with 50 /xl of hybridization buffer (3.75g Dextran Sulfate + 6 ml SQ H,0). vonexed 
10 and heated in the microwave for 2 minutes with the cap loosened. After cooling on ice, 18 75 ml formamide, 3 75 
ml 20 X SSC and 9 ml SQ H^O were added, die tissue was vonexed well, and incubated at 42**C for 1-4 hours. 

D. Hybfidj^augn 

1 .0 x 10* cpm probe and 1 ,0 /xl iRNA (50 mg/ml stock) per slide were heated at 95 ''C for 3 minutes. The 
slides were cooled on ice, and 48 hybridization buffer were added per slide. After vonexing. 50 /il ''P mix were 
15 added to 50 n\ prehybridizanon on slide. The slides were incubated overnight at 55°C. 

E. Washes 

Washing was done 2 x 10 minutes with 2xSSC. EDTA at room temperature (400 ml 20 x SSC + 16 ml 
0.25M EDTA, Vj-4L), followed by RNaseA treatment at 37''C for 30 minutes (500 /xl of 10 mg/ml in 250 ml Rnase 
buffer = 20 ;ig/ml). The slides were washed 2 x 10 minutes with 2 x SSC, EDTA at room icmperacure. The 
20 stringercy wash conditions were as follows: 2 hours at 55"C, 0. 1 x SSC, EDTA (20 ml 20 x SSC + 16 ml EDTA, 
Vr=4L). 

F. OligPHMckoadcs 

In situ analysis was performed on a variety of DNA sequences disclosed herein. The oligonucleotides 
cn^lpycd for these analyses were derived from the nucleotide sequences disclosed herein and generally range from 
25 about 40 to 55 nucleotides in length. 

G. Results 

In Sim analysis was performed on a variety of DNA sequences disclosed herein. The results from these 
analyses are as follows. 
(1) pNA291Ql-n22 (PRQ2Q0> 

30 Fetal : Lower limb expression in developing lower limb bones at the edge of the cartilagenous anlage (i.e. 

around the outside edge); in developing tendons, in vascular smooth muscle and in cells embracing developmg skeletal 
muscle myocytes and myotubes. Expression also observed at the epiphyseal growth plate. Lymph node expression 
in marginal sinus of developing lymph nodes. Thymus expression in the subcapsular region of the thymic conex, 
possibly repitsenting either the subcapsular cpidielial cells or the proliferating, double negative, thymocytes that are 

35 found in this region. Spleen is negative. Trachea expression in smooUi muscle. Brain (cerebral cortex) focal 
expression in conical neurones. Spinal cord negative. Small intestine expression in smooth muscle Thyroid - 
generalized expression overthyroid epithelium. Adrenal is negative. Liver expression in ductal plate cells. Stomach 
expression in mural smooth muscle. Fetal skin expression in basal layer of squamous epithelium. Placenta 
expression in interstitial cells in trophoblastic villi. Cord expression in wall of arteries and vein. 

279 



BNSOOaO' <W0__»46281A2J.> 



10 



15 



WO 99/46281 i^j^ PCT/US99/05028 

Cprnrptn;^ : Expression pattern suggests that PRO200 may be involved in cell differemiaiion/proiiferation 
High expression was observed at the following additional sites: Chimp ovary - granulosa cells of nutunng 
follicles, lower intensity signal observed over thecal cells. Chimp parathyroid - high expression over chief cells. 
Human fetal testis - moderate expression over stromal cells surrounding developing tubules. Human fetal lung - high 
expression over chondrocytes in developing bronchial tree, and low level expression over branching bronchial 
epithelium. Specific expression was not observed over the renal cell, gastric and colorac carcinomas. Fetal tissues 
examined (E12-E16 weeks) include: placenta, umbilical cord, liver, kidney, adrenals, thyroid, lungs, heart, great 
vessels, oesophagus, stomach, small intestine, spleen, thymus, pancreas, bram. eye. spinal cord, body wall, pelvis 
and lower limb. Adult tissues examined: liver, kidney, adrenal, myocardium, aoru. spleen, lymph node, pancitas. 
lung. skin, cerebral cortex (nn). hippocampus(tm). cerebellum(tm). penis, eye. bladder, stomach, gastric carcmoma. 
colon, colonic carcinoma and chondrosarcoma. Acetominophen induced liver injury and hepatic cirrhosis. 

(2) DNA30«67.niS fPROyiB) 
Low level expression over numerous epithelia including fetal small intestine, fetal thyroid, chimp gastric 

epithelium. Expression also seen over malignant cells in a renal cell carcinoma. Expression in fetal brain, over 
cortex. The distribution does not suggest an obvious fiinoion. Human fetal tissues exammed (E12-E16 weeks) 
include: placenta, umbilical cord, liver, kidney, adrenals, thyroid, lungs, hean. great vessels, oesophagus, stomach, 
small intestine, spleen, diymus. pancreas, brain, eye. spinal cord, body wall, pelvis and lower limb. Aduh human 
tissues examined: kidney (normal and end-stage), bladder, adrenal, spleen, lymph node, pancreas, hing. skin, eye 
(inc. reuna). colon, bladder, liver (normal, cirrhotic, acute failure), heart, clear cell carcinoma of kidney, gastric 
adenocarcinoma, colorectal carcinoma. Non^uman primate tissues examined: CHiirap tissues: salivary gland, 
stomach, thyroid, parathyroid, tongue, thymus, ovary, lymph node, peripheral nerve. Rhesus Monkey tissues: 
cerebral cortex, hippocampus, cerebellum, penis. 

(3) DNA4002I.HS4 (Pnryjfi'i) 
Low levels of expression observed in the placenta and over hematopoietic cells in the mouse fetal liver. No 

expr«sion was detected in either human fetal, adult or chimp lymph node and no expression was detected in human 
fetal or human adub spleen. Fetal tissues examined (E12-EI6 weeks) include: placena. umbilical cord, liver, kidney, 
adrenals, thyroid, hmgs. heart, great vessels, oesophagus, stomach, small intestine, spleen, thymus, pancreas, brain, 
eye. spinal cord, body walL pelvis and lower limb. Adult tissues examined: liver, kidney, adrenal, myocardium. 
30 aona. spleen, lymph node, pancreas, lung. skin, cerebral cortex (rm). hippocaropus(rm). cerebeUum(nn). brain 
infarct (human), cerebritis (human), penis, eye, bladder, stomach, gasuic carcinoma, colon, colonic carcinoma, 
thyroid (chimp), parathyroid (chimp) ovary (chimp) and chondrosarcoma. Acetominophen induced Kver injury and 
hepatic cirrhosis. 

35 (4) DNA39S2^-ll(>7fPRfi77^) 

Expression over epithelium of mouse embryo skin as well as over basal epithelium and dermis of human 
fetal skin. Basal epitheUal pegs of the squamous mucosa of the chimp tongue are also positive. Expression over a 
subset of cells b devekipiag glomeruli of fetal kidney, adult renal mbules. and over "thyroidized" epithelium in end- 
stage renal disease, tow expression in a renal cell carcinoma, probably over the epidieliai cells. Low level expression 
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over stromal cells in fetal lung. Expression over stromal cells in the apical portion of gastric glands. High expression 
in the lamina propria of the fetal small intestinal villi, normal colonic mucosa and over stromal cells in a colonic 
carcinoma. Strong expression over benign connective tissue cells in the hylanizcd stroma of a sarcoma. Expression 
over suoraal cells in the placental villi and the splenic red pulp. In the brain, expression over conical neurones. 
Connective tissue surrounding developing bones and over nerve sheath cells in the ferns. Fcul tissues cxanuned 

5 (E12-E16 weeks) include: placenta, umbilical cord, liver, kidney, adrenals, thyroid, lungs, hean, great vessels, 
oesophagus, stomach, small intesune. spleen, thymus, pancreas, brain, eye, spinal cord, body wall, pelvis and lower 
limb. Aduli ossues examined: liver, kidney, adrenal, myocardium, aorta, spleen, lymph node, pancreas, lung, skin, 
cerebral cortex (rm), hippocampus(rm), eye, stomach, gasuic carcinoma, colon, colonic carcinoma, thyroid (chimp), 
parathyroid (chimp) ovary (chimp) and chondrosarcoma. Acetominophen induced liver injury and hepatic cirrhosis. 

10 Expression was present m many cells in the outer layers (I and II) of the monkey cerebral cortex. A small 

subset of cells in the deeper conical layers also expressed mRNA for this chemokine homolog. Scanered cells within 
the molecular layers of the hippocampus and bordering the inner edge of the dentate gyrus contained chemokine 
homolog mRNA. No expression was detected within the cerebellar conex. Chemokine homolog expression is not 
observed in infarctcd brain, where cell death has occurred in the regions where the chemokine homolog normally is 

15 expressed. This probe could possibly serve as a marker of a subset of neurons of outer layers of the cerebral cortex 
and could possibly reveal neuronal migration disorders. Abnormal neuronal migration is a possible cause of some 
seizure disorders and schizophrciua. In order lo gain a bener appreciation of the disuibuiion of this mRNA we will 
test whether the probe will cross-hybridize with mouse brain tissue. 

Also shows intriguing and specific paiicms of hybridization within postnatal day (P)10 and adult mouse 

20 brains. In one sagittal section of PIO mouse brain, strong signal was observed scattered within the molecular layer 
of the hippocampus and inner edges of die dentate gyrus. Cells in the presubiculum were moderately labeled; the 
signal extended in a strong band through outer layers of the retrosplenial cones to the occipital cortex, where the 
signal diiranished to background levels. A small set of positive neurons were detected in deeper regions of PIO motor 
cortex: neurons in outer layers of PIO cortex did not exhibit signal above background levels. Moderate hybridization 

25 signal was also detected in the inferior colliculus. Chemokine homolog signal in the adult mouse brain was evaluated 
in three coronal sccdons at different levels. Strong signal was detected in the sepmm and in scanered neurons in the 
pontine nuclei and motor root of the trigeminal nerve; moderate signal was seen in the molecular layers of the 
hippocampus and outer layers of the retrosplenial conex. 

30 (5) nNA39979-1213 rPRQ296> 

Widespread expression in fetal in adult tissues. Expressed in a variety of fetal and adult epithelia, skeletal 
and cardiac muscle, developing (including retina) and adult CNS. thymic epithelium, placental villi, hepatocytcs in 
cirrhotic and acetaminophen induced toxicity. Highly expressed in hypertrophic chondrocytes in developing skeletal 
system.The overall expression panem, while not completely ovelapping (not expressed in glomeruli, more widely 

35 expressed in CNS). is not disimilar to VEGF. A possible role in angiogenesis should therefore be considered. Human 
fetal tissues examined (E12-E16 weeks) include: placenta, umbilical cord, liver, kidney, adrenals, thyroid, lungs, 
great vessels, stomach, small intesune, spleen, thymus, pancreas, brain, eye. spinal cord, body wall, pelvis, testis 
and lower limb. Adult human tissues examined: kidney (normal and cnd-suge). adrenal, spleen, lymph node, 
pancreas, lung, eye (inc. retina), bladder, liver (normal, cirrhotic, acute failure). Non-human primate tissues 
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examined: Chimp tissues: adrenal. Rhesus Monkey tissues: cerebral cortex, hippocampus, cerebellum 

(6) DNA52S9A-117ff fPffOffffl 
Expression over neuronal ccUs in fetal dorsal root ganglia, spinal cord, developing enteric neurons, cortical 

neurons. Low level expression also seen m placental trophoblast. In adult tissues the only site where notable 
expression was observed was the normal adult prostate; as such it may represent a possible prostate cell surface 
receptor target antigen. Studies to further characterize the expression in adult tissues seem warranted. Low level 
expression also observed in a liposarcoma. Fetal tissues examined (E12-EI6 weeks) include: placenta, umbilical 
cord, liver, kidney, adrenals, thyroid, lungs, heart, great vessels, oesophagus, stomach, small intestine, spleen, 
thymus, pancreas, bram. eye, spinal cord, body wall, pelvis and lower limb. Adult human tissues examined: liver, 
kidney, adrenal, myocardium, aorta, spleen, lung, skin, chondrosarcoma, eye, stomach, gastric carcinoma, colon, 
colonic carcinoma, renal cell carcinoma, prostate, bladder mucosa and gall bladder. Acetominophen induced liver 
injury and hepatic cirrhosis. Rhesus tissues examined: cerebral cortex (rm), hippocampus(rm), cerebellum. Chmip 
tissues examined: thyroid, parathyroid, ovary, nerve, tongue, thymus, adrenal, gastric mucosa and salivary gland. 
WIG.l(WISP.l), WIG-2 (WISP-2) and WIG-5 (WISP-3) expression in human breast carcinoma and normal breast 
15 tissue, Wig-2 in lung carcinoma, and Wig-5 in colon carcmoma. 

(7) DNA649Q7.n63 n>Rnn^9) 

In human fetal tissues there was strong specific expression over artrerial. venous, capillary and smusoidal 
endothelium in all tissues examined, except for fetal brain. In normal adult tissues expression was low to absent, but 
when present appeared expression was confined to the vasculanjre. Highest expression in adult tissues was observed 
regionally in vessels running within the white matter of rhesus brain - the significance of diis paoem is unclear. 
Elevated expression observed in vasculamre of many inflamed and diseased tissues, including tumor vasculature. 
In some of these tissues it was unclear if expression was soley confined to vascular endothelium. In the 15 lung 
tumors examined no expression was seen over the malignant epithelium, however, vascular expression was observed 
25 in many of the tumors and adjacent lung tissue. Moderate, apparently non-specific background, was seen with this 
probe over hyaliniscd collagen and sites of tissue necrosis. In the absccnce of a sense control, however, it is not 
possible to be absolutely certain that ail of this signal is non-specific. Some signal, also thought to be non-specific, 
was seen over the chimp gastric mucosa, transitional cell epithelium of human aduU bladder and fetal retina, 

30 (8) DNA49624,n7q fppn^4^) 

Expression of the ADAM family molecule, ADAM 12 (DNA49624-1279) observed in normal human lung, 
lung nunor, normal colon and colon carcinoma. 



20 



(9) DNA59?94.n«1 (PR^Ift^l) 

The expression of tfiis IL17 homologuc was evaluated m a panel consisting of normal adult and fetal tissues 
and tissues with inflammation, predominantly chronic lymphocytic inflammation. This panel is designed to 
specificaUy evahiate the expression pattern in immune mediated inflammatory disease of novel proteins that modulate 
T lymphocyte ftmction (stimulatory or inhibitory). This protein when expressed as an Ig-fusion protein was 
immunosiimulaiory in a dose dependent fashion in the human mixed lymphocyte reaction (MLR): it caused a 285% 

282 



BNSOOCID: <W0. 



> 994628 1A2.L> 



PCT/US«)9/05028 

WO 99/46281 



and 147% increase above ihe baseline sdmuiadon index when ualized ai two different conceniraiions (1 .0% and 0. 1 % 
of a 560 nM stock). Summan". expression was resiricied to muscle, certain types of smooth muscle in the adult and 
in skeletal and smooth muscle in the human fetus. Expression in adult human was in smooth muscle of mbular organs 
evaluated including colon and gall bladder. There no expression in the smooth muscle of vessels or bronchi. No 
adult human skeletal muscle was evaluated. In fetal tissues there was moderate to high diffuse expression in skeletal 
5 muscle the axial skeleton and limbs. There was weak expression in the smooth muscle of the intestinal wall but no 
expression in cardiac muscle, Aduh human tissues with expression: Colon, there was low level diffuse expression 
in the smooth muscle (tunica musculaiis) in 5 specimens widi chronic inflammatory bowel disease. Gall bladder: there 
was weak to low level expression in the smooth muscle of the gall bladder. Fetal human (issues with expression: 
there was moderate diffuse expression in skeletal muscle and weak tolow expression in smooth muscle; there was 
10 no expression in fetal hcan or any other fetal organ including liver, spleen, CNS, kidney, gut, lung. Human tissues 
with no expression: lung with chronic granulomatous inflammation and chronic bronchitis (5 patients), peripheral 
nerve, prostate, heart, placenta, liver (disease multiblock), brain (cerbrum and cerebellum), tonsil (reactive 
hyperplasia), peripheral lymph node, thymus. 
(10) nNA4S4l6.ll51 rPR0362^ 
15 jhe expression of this novel protein was evaluated in a variety of human and non-human primate tissues and 

was found to be highly restricted. Expression was present only in alveolar macrophages in the lung and in Kupffer 
cells of the hepatic smusoids. Expression in these cells was significantly increased when these distinct cell 
populations were activated. Though these two subpopulations of tissue macrophages are located in different organs, 
they have similar biological functions. Both types of these phagocytes act as biological filters to remove material from 
20 the blood stream or airways including pathogens, senescent cells and proteins and both are capable of secreting a 
wide variery of important proinflammatory cytokines. In inflamed lung (7 patient samples) expression was prominent 
in reactive alveolar macrophage cell populations defined as large, pale often vacuolated cells present singly or m 
aggregates within alveoli and was weak to negative in normal, non-reactive macrophages (single scanered cells of 
normal size). Expression in alveolar macrophages was increased during inflammation when these cells were both 
25 increased in numbers and size (activated). Despite the presence of histocytes in areas of inierstial inflammiion and 
peribronchial lymphoid hyperplasia in these tissues, expression was restricted to alveolar macrophages. Many of the 
inflamed lungs also had some degree of suppurative inflammation; expression was not present in neutrophilic 
granulocytes. In liver, there was strong expression in reactive/activated Kupffer cells in livers with acute 
cenirilobular necrosis (acctominophen toxicity) or fairly marked periportal inflammiion. However there was weak 
30 or no expression in Kupffer ceUs in normal liver or in liver with only mild inflammation or mild to moderate lobular 
hyperplasia/hypenrophy. Thus, as in the hmg, there was increased expression in acivatcd/reaiive cells. There was 
no cxpresskm of (his molecule in histiocytes/macropahges present in inflamed bowel, hyperplastic/reactive tonsil or 
normal lymph node. The lack of expression in these tissues which all contained histiocytic inflammation or resident 
macrophage populations strongly supports restricted expression to the unique macrophage subset populations defined 
35 as aWedar macrophage and hepatic Kupffer cells. Spleen or bone marrow was not available for evaluation. Human 
tissues evaluated which had no detectable expression included: Inflammatory bowel disease (7 patient samples with 
moderate to severe disease), tonsil with reactive hyperplasia, peripheral lymphnode. psoriatic skin (2 patient samples 
with mild to moderate disease), heart, peripheral nerve. Chimp tissues evaluated which had no detectable expression 
included: tongue, stomach, tiiymus. 
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(11) DNA52196-n4« ^ppnyii) 

Generalked low level signal in many tissues and in many cell lypes. While endoihelial cell expression was 
observed ii was not a promincm feature in eiiher feial, normal or diseased tissues. Human tissues: moderate 
expression over fetal Uver (mainly hepaiocyies), lung, skin, adrenal and heart. Fetal spleen, small mtesiinc. brain and 
eye arc ncgaiive. Aduh normal kidney, bladder epithelium, lung, adrenal, pancreas, skin - all negative. Expression 
in adult human liver (normal and diseased), renal tubules m end-stage renal disease, adipose tissue, sarcoma, colon, 
renal cell carcmoma, hepatocellular carcinoma, squamous cell carcinoma. Non human pnmaie tissues- chimp 
salivary gland, vessels, stomach, tongue, peripheral nerve, thymus, lymph node, thyroid and parathyroid. Rhesus 
spinal cord negative, conical and hippocampal neurones positive. 



10 Deposit of Material 

The following materials have been deposited with the American Type Culture Collection, 12301 Parklawn 
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Drive. Rockville. MD. 


USA (ATCC): 






ATCC 


P?p. No. 


Deposit Date 


DNA39987.1I84 


ATCC 


209786 


April 21, 1998 


DNA40625-1189 


ATCC 


209788 


April 21, 1998 


DNA23318-I21i 


ATCC 


209787 


April 21, 1998 


DNA39979-1213 


ATCC 


209789 


April 21, 1998 


DNA40594-1233 


ATCC 


209617 


February 5, 1998 


DNA45416-125I 


ATCC 


209620 


February 5, 1998 


DNA4541 9-1252 


ATCC 


209616 


February 5, 1998 


DNA52594-1270 


ATCC 


209679 


March 17, 1998 


DNA45234-1277 


ATCC 


209654 


March 5, 1998 


DNA49624.1279 


ATCC 


209655 


March 5, 1998 


DNA48309-1280 


ATCC 


209656 


March 5. 1998 


DNA46776-1284 


ATCC 


209721 


March 31, 1998 


DNA50980-1286 


ATCC 


209717 


March 31. 1998 


DNA50913-1287 


ATCC 


209716 


March 31. 1998 


DNA50914-I289 


ATCC 


209722 


March 31. 1998 


DNA48296-1292 


ATCC 


209668 


March 11. 1998 


DNA32284-1307 


ATCC 


209670 


March 11. 1998 


DNA36343-1310 


ATCC 


209718 


March 31, 1998 


DNA40571-1315 


ATCC 


209784 


April 21, 1998 


DNA41386.1316 


ATCC 


209703 


March 26, 1998 


DNA44194-1317 


ATCC 


209808 


April 28, 1998 


DNA45415-1318 


ATCC 


209810 


April 28, 1998 


DNA44I89-1322 


ATCC 


209699 


March 26, 1998 


DNA48304.1323 


ATCC 


209811 


April 28. 1998 


DNA49I52-1324 


ATCC 


209813 


April 28, 1998 


DNA49646-1327 


ATCC 


209705 


March 26. 1998 


DNA49631-1328 


ATCC 


209806 


April 28, 1998 


DNA49645-1347 


ATCC 


209809 


April 28, 1998 


DNA45493-1349 


ATCC 


209805 


April 28. 1998 


DNA48227.1350 


ATCC 


209812 


April 28. 1998 


DNA41404.1352 


ATCC 


209844 


May 6, 1998 


DNA44 196-1353 


ATCC 


209847 


May 6, 1998 


DNA52187-1354 


ATCC 


209845 


May 6, 1998 


DNA48328-1355 


ATCC 


209843 


May 6. 1998 


DNA56352.1358 


ATCC 


209846 


May 6. 1998 


DNA53971-1359 


ATCC 


209750 


April 7. 1998 


DNA509I9-136I 


ATCC 


209848 


May 6. 1998 


DNA44179-1362 


ATCC 


209851 


May 6, 1998 
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DNA54002-1367 


ATCC 209754 


April 7. 1998 




DNA53906-1368 


ATCC 209747 


April 7. 1998 




DNA52185.1370 


ATCC 209861 


May 14, 1998 




DNA53977-1371 


ATCC 209862 


May 14. 1998 




DNA57253-I382 


ATCC 209867 


May 14, 1998 


5 


DNA58847-1383 


ATCC 209879 


May 20. 1998 




DNA58747-1384 


ATCC 209868 


May 14, 1998 




DNA57689-1385 


ATCC 209869 


May 14, 1998 




DNA23330-1390 


ATCC 209775 


AprU 14. 1998 




DNA26847-1395 


ATCC 209772 


AprU 14, 1998 


10 


DNA53974-1401 


ATCC 209774 


April 14, 1998 




DNA57039-1402 


ATCC 209777 


April 14, 1998 




DNA57O33-1403 


ATCC 209905 


May 27, 1998 




DNA34353-1428 


ATCC 209855 


May 12, 1998 




DNA45417-1432 


ATCC 209910 


May 27. 1998 


15 


DNA39523-1192 


ATCC 209424 


October 31. 1997 




DNA44205-1285 


ATCC 209720 


March 31, 1998 




DNA5091 1-1288 


ATCC 209714 


March 31. 1998 




DNA48329-1290 


ATCC 209785 


April 21. 1998 




DNA48306-1291 


ATCC 209911 


May 27. 1998 


20 


DNA48336-1309 


ATCC 209669 


March 11. 1998 




DNA44184-1319 


ATCC 209704 


March 26. 1998 




DNA48314.1320 


ATCC 209702 


March 26. 1998 




DNA48333-1321 


ATCC 209701 


March 26. 1998 




DNA50920.1325 


ATCC 209700 


March 26. 1998 


25 


DNA5a988-1326 


ATCC 209814 


April 28, 1998 




DNA48331-1329 


ATCC 209715 


March 31. 1998 




DNA30867-1335 


ATCC 209807 


April 28. 1998 




DNA55737-1345 


ATCC 209753 


April 7. 1998 




DNA49829-1346 


ATCC 209749 


April 7. 1998 


30 


DNA52196.1348 


ATCC 209748 


April 7. 1998 




DNA56965.1356 


ATCC 209842 


May 6. 1998 




DNA56405-1357 


ATCC 209849 


May 6. 1998 




DNA57530-1375 


ATCC 209880 


May 20. 1998 




DNA56439-1376 


ATCC 209864 


May 14. 1998 


35 


DNA56409-1377 


ATCC 209882 


May 20, 1998 




DNA56n2-1379 


ATCC 209883 


May 20, 1998 




DNA56O45-1380 


ATCC 209865 


May 14, 1998 




DNA59294.1381 


ATCC 209866 


May 14. 1998 




DNA56433-1406 


ATCC 209857 


May 12. 1998 


40 


DNA53912-1457 


ATCC 209870 


May 14, 1998 




DNA5092M458 


ATCC 209859 


May 12. 1998 




DNA29101-1122 


ATCC 209653 


March 5, 1998 




DNA4002M154 


ATCC 209389 


October 17. 1997 




DNA42663-U54 


ATCC 209386 


October 17, 1997 


45 


DNA30943-l-n63-l 


ATCC 209791 


April 21, 1998 




DNA64907-116M 


ATCC 203242 


September 9. 1998 




DNA64908-1 163-1 


ATCC 203243 


September 9. 1998 




DNA39975-1210 


ATCC 209783 


April 21, 1998 




DNA43316-1237 


ATCC 209487 


November 21. 1997 


50 


DNA55800-1263 


ATCC 209680 


March 17. 1998 



55 



These deposit were made imder the provisions of the Budapest Treaty on the International Recognition of 
the Deposit of Microorganisms for the Purpose of Patem Procedure and the Regulations thereunder (Budapest 
Treaty). This assures maintenance of a viable culture of the deposit for 30 years from the dale of deposit. The 
deposits will be made available by ATCC under the tcmns of die Budapest Treaty, and subject to an agreement 
between Genentech. Inc. and ATCC, which assures permanem and unrestricted availabiUty of die progeny of the 
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culnire oi the deposit to the public upon issuance ot the pertinent U.S. patent or upon laying open to the public of any 
U.S. or for«en patent application, whichever comes first, and assures availability of the progeny to one detertnined 
by the U S. Commissioner of Patents and Trademarks to be enuUed thereto according to 35 USC § 122 andthe 
Commissioner s rules pursuant thereto (including 37 CFR § 1.14 with parucular reference to 886 OG 638). 

The assignee of the present application has agreed that if a culture of the matenals on deposit should die or 
be lost or destroyed when culnvated under suitable condiuons. the materials will be promptly replaced on notification 
with another of the same. Availability of the deposited material is not to be construed as a license to practice the 
invention m contravenaon of the rights granted under the authority of any government in accordance with its patem 
laws. 

The foregomg wntien specification is considered to be sufficient to enable one skilled in the art to practice 
the invention. The present invenuon is not to be limited m scope by the construct deposited, since the deposited 
embodiment is intended as a single illustration of certain aspects of the invention and any constructs that are 
functionaUy equivalent are withm the scope of this invention. The deposit of material herein does not consomte an 
admission that the wnncn description herem contained is inadequate to enable the practice of any aspect of the 
invention, mcluding the best mode thereof, nor is it to be construed as limiting the scope of the clauns to the specific 
iUustraoons that it represents. Indeed, various modificaaons of the mvention in addiuon to those shown and described 
herein will become apparent to those skilled in the art from the foregoing descripuon and fall within the scope of the 
appended claims. 
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I , Isolated nucleic acid having at least %Q% sequence idenuty to a nuclcoade sequence that encodes 
a polypeptide composing an amino acid sequence selected from the group consisimg of the amino acid sequci»ce 
shown in Figure 2 (SEQ ID NO 2). Figure 4 (SEQ ID N0:7). Figure 9 (SEQ ID NO:I9). Figure 11 (SEQ ID 
NO:28). Figure 15 (SEQ ID ^^0:36). Figure 20 fSEQ ID NO:45). Figure 22 (SEQ ID NO:52). Figure 24 (SEQ ID 
S NO:59). Figure 26 (SEQ ID NO:64). Figure 28 (SEQ ID NO:69). Figure 30 (SEQ ID NO:74). Figure 33 (SEQ ID 
NO:85). Figure 35 (SEQ ID NO:90). Figure 37 (SEQ ID NO:97). Figure 39 (SEQ ID NO: 102). Figure 41 (SEQ ID 
NO:I09). Figure 43 (SEQ ID NOUM). Figure 45 (SEQ ID N0:n9). Figure 47 (SEQ ID NO:124). Figure 49 (SEQ 
ID NO:132), Figure 51 (SEQ ID NO:137). Figure 53 (SEQ ID NO:145). Figure 55 (SEQ ID NO:150). Figure 59 
(SEQ ID NO:I57). Figure 61 (SEQ ID NO:I62). Figure 63 (SEQ ID NO:I69). Figure 66 (SEQ ID NO:178). Figure 
10 68 (SEQ ID NO: 183). Figure 70 (SEQ ID NO: 190). Figure 73 (SEQ ID NO: 196), Figure 75 (SEQ ID NO:206). 
Figure 77 (SEQ ID N0:211). Figure 79 (SEQ ID NO:216). Figure 81 (SEQ ID NO:22i). Figure 83 (SEQ ID 
N0.226). Figure 85 (SEQ ID N0.23n, Figure 87 (SEQ ID N0.236). Figure 89 (SEQ ID NO:245). Figure 91 (SEQ 
ID NO:254). Figure 93 (SEQ ID NO:259). Figure 95 (SEQ ID N0:264), Figure 98 (SEQ ID NO:270), Figure 109 
(SEQ ID NO:284). Figure 118 (SEQ ID NO:296). Figure 120 (SEQ ID N0:301). Figure 122 (SEQ ID NO:303). 
15 Figure 125 (SEQ ID NO:309), Figure 129 (SEQ ID NO:322). Figure 132 (SEQ ID NO:330). Figure 136 (SEQ ID 
NO:337), Figure 139 (SEQ ID NO:346), Figure 142 (SEQ ID NO:352). Figure 145 (SEQ ID NO:358). Figure 147 
(SEQ ID NO:363). Figure 149 (SEQ ID NO:370). Figure 151 (SEQ ID NO:375). Figure 153 (SEQ ID NO:380). 
Figure 155 (SEQ ID NO:385), Figure 157 (SEQ ID NO:390). Figure 159 (SEQ ID NO:395), Figure 161 (SEQ ID 
NO:40(9. Figure 163 (SEQ ID NO:405). Figure 165 (SEQ ID NO:410). Figure 167 (SEQ ID NO:415). Figure 169 
20 (SEQ ID NO:420). Figure 171 (SEQ ID NO:425). Figure 173 (SEQ ID NO:430). Figure 177 (SEQ ID NO:437). 
Figure 179 (SEQ ID NO:442). Figure 181 (SEQ ID N0.447). Figure 183 (SEQ ID NO:452). Figure 185 (SEQ ID 
NO:454). Figure 187 (SEQ ID NO:456). Figure 190 (SEQ ID NO:459), Figure 192 (SEQ ID NO:464). Figure 194 
(SEQ ID NO:466), Figure 196 (SEQ ID NO:468). Figure 198 (SEQ ID NO:470). Figure 200 (SEQ ID NO:472), 
Figure 202 (SEQ ID NO;477), Figure 204 (SEQ ID NO:483), Figure 207 (SEQ ID NO:488). Figure 209 (SEQ ID 
25 NO:496). Figure 211 (SEQ ID NO:498), Figure 213 (SEQ ID NO:506). Figure 215 (SEQ ID NO:508). Figure 217 
(SEQ ID NO:510). Figure 219 (SEQ ID N0:515). Figure 222 (SEQ ID NO:523) and Figure 225 (SEQ ID NO:526). 

2. The nucleic acid sequence of Claim 1, wherein said nucleotide sequence comprises a nucleotide 
sequence selected from the group consisting of the sequence shown in Figure 1 (SEQ ID NO: I), Figure 3 (SEQ ID 

30 N0:6). Figure 8 (SEQ ID N0:18). Figure 10 (SEQ ID NO:27). Figure 14 (SEQ ID NO:35). Figure 19 (SEQ ID 
NO:44). Figure 21 (SEQ ID N0:51). Figure 23 (SEQ ID NO:58). Figure 25 (SEQ ID NO:63). Figure 27 (SEQ ID 
mm. Figure 29 (SEQ ID NO:73). Figure 32 (SEQ ID NO:84). Figure 34 (SEQ ID N0:89). Figure 36 (SEQ ID 
NO:96). Figure 38 (SEQ ID NO: 101). Figure 40 (SEQ ID NO: 108). Figure 42 (SEQ ID NO: 1 13). Figure 44 (SEQ 
ID N0:118). Figure 46 (SEQ ID NO:123). Figure 48 (SEQ ID N0:l3l). Figure 50 (SEQ ID N0:136). Figure 52 

35 (SEQ ID NO: 144). Figure 54 (SEQ ID NO: 149). Figure 58 (SEQ ID NO: 156), Figure 60 (SEQ ID NO: 161). Figure 
62 (SEQ ID NO: 168). Figure 65 (SEQ ID NO: 177). Figure 67 (SEQ ID NO: 182), Figure 69 (SEQ ID NO: 189). 
Figure 72 (SEQ ID NO: 195). Figure 74 (SEQ ID NO:205). Figure 76 (SEQ ID NO:210). Figure 78 (SEQ ID 
NO:215). Figure 80 (SEQ ID NO:220). Figure 82 (SEQ ID NO:225). Figure 84 (SEQ ID NO:230). Figure 86 (SEQ 
ID NO:235). Figure 88 rSEQ ID NO:244). Figure 90 (SEQ ID NO:253). Figure 92 (SEQ ID NO:258). Figure 94 
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(SEQ ID NO:263). Figure 97 fSEQ ID NO:269). Figure 108 fSEQ ID NO:283). Figure 117 fSEQ ID NO:295). 
Figure 119 (SEQ ID NO:300). Figure 121 (SEQ ID NO:302). Figure 124 (SEQ ID NO:308). Figure 128 (SEQ ID 
NO:32n. Figure 131 (SEQ ID NO:329). Figure 135 (SEQ ID NO:336). Figure 138 (SEQ ID NO:345). Figure 141 
(SEQ ID NO:35n. Figure 144 (SEQ ID NO:3571. Figure 146 (SEQ ID NO:362). Figure 148 (SEQ ID NO:369). 
Figure 150 (SEQ ID NO:374). Figure 152 (SEQ ID NO:379). Figure 154 (SEQ ID NO:384). Figure 156 (SEQ ID 
NO:389). Figure 158 (SEQ ID NO:394). Figure 160 (SEQ ID NO:399). Figure 162 (SEQ ID NO-404), Figure 164 
(SEQ ID NO:409). Figure 166 (SEQ ID NO:4i4). Figure 168 (SEQ ID NO:419). Figure 170 (SEQ ID NO:424). 
Figure 172 (SEQ ID NO:429). Figure 176 (SEQ ID NO:436). Figure 178 (SEQ ID N0:44l). Figure 180 (SEQ ID 
NO:446). Figure 182 (SEQ ID NO:451). Figure 184 (SEQ ID NO:453). Figure 186 (SEQ ID NO:455), Figure 189 
(SEQ ID NO:458). Figure 191 (SEQ ID NO:463). Figure 193 (SEQ ID NO:465). Figure 195 (SEQ ID NO:467). 
Figure 197 (SEQ ID NO:469). Figure 199 (SEQ ID NO:471). Figure 201 (SEQ ID NO:476). Figure 203 (SEQ ID 
NO:482). Figure 206 (SEQ ID NO:487). Figure 208 (SEQ ID NO:495), Figure 210 (SEQ ID NO:497). Figure 212 
(SEQ ID NO:505). Figure 214 (SEQ ID NO:507). Figure 216 (SEQ ID NO:509). Figure 218 (SEQ ID NO:514). 
Figure 221 (SEQ ID NO:522) and Figure 224 (SEQ ID NO:525). 

^- nucleic acid of Claim 1 . wherein said nucleotide sequence compnses a nucleotide sequence 

selected from the group consisting of the fuU-length coding sequence of the sequence shown in Figure 1 (SEQ ID 
N0:1). Figure 3 (SEQ ID N0:6). Figure 8 (SEQ ID NO: 18). Figure 10 (SEQ ID NO:27). Figure 14 (SEQ ID 
NO:35). Figure 19 (SEQ ID NO:44). Figure 21 (SEQ ID N0:51). Figure 23 (SEQ ID NO:58). Figure 25 (SEQ ID 
NO:63). Figure 27 (SEQ ID NO:68). Figure 29 (SEQ ID NO:73). Figure 32 (SEQ ID NO:84). Figure 34 (SEQ ID 
20 NO:89). Figure 36 (SEQ ID NO:96). Figure 38 (SEQ ID NO: 101). Figure 40 (SEQ ID NO: 108). Figure 42 (SEQ 
ID NO: 113). Figure 44 (SEQ ID N0:1I8). Figure 46 (SEQ ID NO: 123). Figure 48 (SEQ ID NO: 131). Figure SO 
(SEQ ID N0:I36). Figure 52 (SEQ ID NO:144). Figure 54 (SEQ ID NO:149). Figure 58 (SEQ ID NO:156). Figure 
60 (SEQ ID N0:161). Figure 62 (SEQ ID N0:I68). Figure 65 (SEQ ID NO:177). Figure 67 (SEQ ID NO:182). 
Figure 69 (SEQ ID NO: 189). Figure 72 (SEQ ID NO: 195), Figure 74 (SEQ ID NO:205). Figure 76 (SEQ ID 
25 NO:2IO). Figure 78 (SEQ ID NO:215). Figure 80 (SEQ ID NO:220). Figure 82 (SEQ ID NO:225). Figure 84 (SEQ 
ID NO:230). Figure 86 (SEQ ID NO:235). Figure 88 (SEQ ID NO:244). Figure 90 (SEQ ID NO:253). Figure 92 
(SEQ ID NO:258). Figure 94 (SEQ ID NO:263). Figure 97 (SEQ ID NO:269). Figure 108 (SEQ ID NO:283). Figure 
117 (SEQ ID NO:295). Figure 119(SEQ ID NO:300). Figure 121 (SEQ ID NO:302), Figure 124 (SEQ ID NO:308). 
Figure 128 (SEQ ID NO:321). Figure i3l (SEQ ID NO:329). Figure 135 (SEQ ID NO:336). Figure 138 (SEQ ID 
30 NO:345). Figure 141 (SEQ ID NO:351). Figure 144 (SEQ ID NO:357). Figure 146 (SEQ ID NO:362). Figure 148 
(SEQ ID NO:369). Figure 150 (SEQ ID NO:374). Figure 152 (SEQ ID NO:379). Figure 154 (SEQ ID NO:384). 
Figure 156 (SEQ ID NO:389). Figure 158 (SEQ ID NO:394). Figure 160 (SEQ ID NO:399). Figure 162 (SEQ ID 
NO:404). Figure 164 (SEQ ID NO:409). Figure 166 (SEQ ID NO:414), Figure 168 (SEQ ID NO:4I9). Figure 170 
(SEQ ID NO:424). Figure 172 (SEQ ID NO:429). Figure 176 (SEQ ID NO:436). Figure 178 (SEQ ID NO:44|). 
Figure 180 (SEQ ID NO:446), Figure 182 (SEQ ID NO:451). Figure 184 (SEQ ID NO:453). Figure 186 (SEQ ID 
NO:455). Figure 189 (SEQ ID NO:458). Figure 191 (SEQ ID NO:463). Figure 193 (SEQ ID NO:465). Figure 195 
(SEQ ID NO:467). Figure 197 (SEQ ID NO:469). Figure 199 (SEQ ID NO:471). Figure 201 (SEQ ID NO:476). 
Figure 203 (SEQ ID NO:482). Figure 206 (SEQ ID NO:487). Figure 208 (SEQ ID NO:495). Figure 210 (SEQ ID 
NO:497). Figure 212 (SEQ ID NO:505). Figure 214 (SEQ ID NO:507). Figure 216 (SEQ ID NO:509). Figure 218 

288 



35 



(SEQ ID N0:514). Figure 221 (SEO ID NO:522) or Figure 224 (SEQ ID NO:525) 



PCT/US99/05028 

WO 99/46281 



4. Isolated nucleic acid which compnses the full-length coding sequence of the DNA deposited under 
accession number ATCC 209791. ATCC 209786, ATCC 209788. ATCC 209787. ATCC 209789. ATCC 209617, 
ATCC 209620. ATCC 209616. ATCC 209679. ATCC 209654. ATCC 209655. ATCC 209656. ATCC 20972L 

5 ATCC 209717. ATCC 209716. ATCC 209722. ATCC 209668, ATCC 209670. ATCC 209718, ATCC 209784, 
ATCC 209703. ATCC 209808. ATCC 209810, ATCC 209699, ATCC 209811. ATCC 209813, ATCC 209705. 
ATCC 209806. ATCC 209809. ATCC 209805. ATCC 209812, ATCC 209844, ATCC 209847. ATCC 209845, 
ATCC 209843, ATCC 209846. ATCC 209750, ATCC 209848. ATCC 209851. ATCC 209754. ATCC 209747. 
ATCC 209861. ATCC 209862, ATCC 209867. ATCC 209879, ATCC 209868, ATCC 209869. ATCC 209775. 

10 ATCC 209772. ATCC 209774. ATCC 209777, ATCC 209905. ATCC 209855. ATCC 209910. ATCC 209424, 
ATCC 209720. ATCC 209714, ATCC 209785. ATCC 209911. ATCC 209669. ATCC 209704, ATCC 209702, 
ATCC 209701. ATCC 209700. ATCC 209814. ATCC 209715, ATCC 209807, ATCC 209753. ATCC 209749, 
ATCC 209748. ATCC 209842. ATCC 209849. ATCC 209880, ATCC 209864. ATCC 209882. ATCC 209883. 
ATCC 209865. ATCC 209866. ATCC 209857. ATCC 209870. ATCC 209859. ATCC 209653. ATCC 209389. 

15 ATCC 209386. ATCC 203242, ATCC 203243, ATCC 209783. ATCC 209487 or ATCC 209680. 

5. A vector comprising the nucleic acid of Claim 1 . 

6. The vector of Claim 5 operably linked to control sequences recognized by a host cell transformed 
20 with the vector. 

7. A host cell comprising the vector of Claim 5. 

8 The host cell of Claim 7 wherein said cell is a CHO cell. 

25 

9. The host cell of Claim 7 wherein said cell is an E, coU, 



10 The host cell of Claim 7 wherein said cell is a yeast cell. 



30 11. A process for producmg a PRO poiypcpudes comprising culturing the host cell of Claim 7 under 

conditions suitable for expression of said PRO polypeptide and recovermg said PRO polypeptide from the cell culture. 

12. Isolated nanve sequence PRO polypeptide having at least 80% sequence identity to an ammo actd 
sequence selected from the group consisting of the anuno acid sequence shown in Figure 2 (SEQ ID N0:2). Figure 
35 4 (SEQ ID N0:7). Figure 9 fSEQ ID NO: 19), Figure 1 1 (SEQ ID NO:28). Figure 15 (SEQ ID NO:36), Figure 20 
(SEQ ID NO:45). Figure 22 (SEQ ID NO:52), Figure 24 (SEQ ID NO:59). Figure 26 (SEQ ID NO:64). Figure 28 
(SEQ ID NO:69), Figure 30 (SEQ ID NO:74). Figure 33 (SEQ ID NO:85). Figure 35 (SEQ ID NO:90), Figure 37 
(SEQ ID NO:97). Figure 39 (SEQ ID NO: 102). Figure 41 (SEQ ID NO: 109). Figure 43 (SEQ ID N0:114). Figure 
45 (SEQ ID N0:I19). Figure 47 (SEQ ID NO:124). Figure 49 (SEQ ID NO:l32), Figure 51 (SEQ ID NO:l37). 
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Figure 53 (SEQ ID NO:145). Figure 55 (SEQ ID NO:150). Figure 59 (SEO ID^:I57). Figure 61 (SEQ ID 
NO:162). figure 63 fSEQ ID NO:169). Figure 66 (SEQ ID NO: 178). Figure 68 (SEQ ID NO: 183). Figure 70 (SEQ 
ID NO: 190), Figure 73 (SEQ ID NO: 196). Figure 75 (SEQ ID NO:206). Figure 77 (SEQ ID N0:2ll), Figure 79 
(SEQ ID NO:216). Figure 81 (SEQ ID N0:221). Figure 83 (SEQ ID N0.226). Figure 85 (SEQ ID N0:231). Figure 
87 (SEQ ID N0:236). Figure 89 (SEQ ID NO:245). Figure 91 (SEQ ID N0.254), Figure 93 (SEQ ID N0:259). 
Figure 95 (SEQ ID NO:264). Figure 98 (SEQ ID NO:270). Figure 109 (SEQ ID NO:284). Figure 118 (SEQ ID 
NO:296). Figure 120 (SEQ ID NO-301). Figure 122 (SEQ ID NO:303). Figure 125 (SEQ ID NO:309). Figure 129 
(SEQ ID NO:322), Figure 132 (SEQ ID NO:330). Figure 136 (SEQ ID NO:337). Figure 139 (SEQ ID NO:346). 
Figure 142 (SEQ ID NO:352). Figure 145 (SEQ ID NO:358), Figure 147 (SEQ ID NO:363). Figure 149 (SEQ ID 
NO:370), Figure 151 (SEQ ID NO:375). Figure 153 (SEQ ID NO:380). Figure 155 (SEQ ID NO:385>. Figure 157 
(SEQ ID NO:390). Figure 159 (SEQ ID NO:395). Figure 161 (SEQ ID NO:400). Figure 163 (SEQ ID NO:405), 
Figure 165 (SEQ ID NO.4I0). Figure 167 (SEQ ID NO:415), Figure (69 (SEQ ID NO:420), Figure 171 (SEQ ID 
N0.425). Figure 173 (SEQ ID NO 430). Figure 177 (SEQ ID NO:437). Figure 179 (SEQ ID NO:442). Figure 181 
(SEQ ID NO:447), Figure 183 (SEQ ID NO:452), Figure 185 (SEQ ID NO 454). Figure 187 (SEQ ID NO:456). 
Figure 190 (SEQ ID N0.459). Figure 192 (SEQ ID NO:464). Figure 194 (SEQ ID NO:466). Figure 196 (SEQ ID 
1 5 NO:468). Figure 198 (SEQ ID NO-470). Figure 200 (SEQ ID NO:472). Figure 202 (SEQ ID NO:477). Figure 204 
(SEQ ID NO:483). Figure 207 (SEQ ID NO:488). Figure 209 (SEQ ID NO:496). Figure 211 (SEQ ID NO:498). 
Figure 213 (SEQ ID NO:506). Figure 215 (SEQ ID NO:508). Figure 217 (SEQ ID NO:510>. Figure 219 (SEQ ID 
NO:5I5). Figure 222 (SEQ ID NO:523) and Figure 225 (SEQ ID NO:526). 
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'3. Isolated PRO polypepode having at least 80% sequence identity to the amino acid sequence encoded 
by the nucleotide deposited under accession number ATCC 209791 . ATCC 209786, ATCC 209788. ATCC 209787, 
ATCC 209789. ATCC 209617. ATCC 209620. ATCC 209616. ATCC 209679. ATCC 209654. ATCC 209655. 
ATCC 209656. ATCC 209721. ATCC 209717. ATCC 209716, ATCC 209722. ATCC 209668. ATCC 209670. 
ATCC 209718. ATCC 209784. ATCC 209703. ATCC 209808, ATCC 209810. ATCC 209699. ATCC 209811. 
25 ATCC 209813. ATCC 209705. ATCC 209806. ATCC 209809. ATCC 209805. ATCC 209812. ATCC 209844. 
ATCC 209847. ATCC 209845. ATCC 209843. ATCC 209846. ATCC 209750. ATCC 209848. ATCC 209851. 
ATCC 209754. ATCC 209747. ATCC 209861. ATCC 209862. ATCC 209867. ATCC 209879. ATCC 209868. 
ATCC 209869, ATCC 209775. ATCC 209772. ATCC 209774. ATCC 209777. ATCC 209905. ATCC 209855. 
ATCC 209910. ATCC 209424, ATCC 209720. ATCC 209714. ATCC 209785. ATCC 209911. ATCC 209669. 
ATCC 209704. ATCC 209702, ATCC 209701. ATCC 209700, ATCC 209814. ATCC 209715. ATCC 209807. 
ATCC 209753, ATCC 209749, ATCC 209748. ATCC 209842, ATCC 209849. ATCC 209880, ATCC 209864, 
ATCC 209882. ATCC 209883. ATCC 209865. ATCC 209866, ATCC 209857. ATCC 209870. ATCC 209859. 
ATCC 209653. ATCC 209389. ATCC 209386. ATCC 203242. ATCC 203243. ATCC 209783, ATCC 209487 or 
ATCC 209680. 



14. A chimcnc molecule comprising a polypeptide according to Claim 12 fiised to a heterologous amino 
acid sequence. 



15. 



The chimeric molecule of Claun 14 wherem said heterologous ammo acid sequence is an epitope 
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tag sequence. 

!6. The chimeric molecule of Claim 14 wherem said heterologous amino acid sequence is a Fc region 
of an immunoglobulin. 

5 17. An antibody which specifically bmds to a PRO polypepude according to Claim 12. 

18. The anubody of Claim 17 wherein said antibody is a monoclonal antibody 

19. An isolated nucleic acid molecule which has at least 80% sequence identicy to a nucleic acid which 
10 comprises a nucleodde sequence selected from the group consisting of that shown in Figure 5 (SEQ ID NO: 8). Figure 

6 (SEQ ID N0:9). Figure 7 (SEQ ID NO:10). Figure 12 (SEQ ID NO:29). Figure 13 {SEQ ID NO-30), Figure 16 
(SEQ ID NO:37), Figure 17 (SEQ ID NO:38), Figure 18 (SEQ ID NO:39). Figure 31 (SEQ ID NO:75). Figure 64 
(SEQ ID NO:170), Figure 71 (SEQ ID N0:19n. Figure 96 (SEQ ID NO:265), Figure 99 (SEQ ID NO 271), Figure 
100 (SEQ ID N0:272), Figure 101 (SEQ ID N0:273), Figure 102 (SEQ ID N0:274). Figure 103 (SEQ ID N0:275). 

15 Figure 104 (SEQ ID N0:276). Figure 105 (SEQ ID N0:277). Figure 106 (SEQ ID NO:278>. Figure 107 (SEQ ID 
NO:279>. Figure 1 10 (SEQ ID NO:285), Figure III (SEQ ID N0:286), Figure 112 (SEQ ID NO:287>. Figure 113 
(SEQ ID NO:288). Figure 114 (SEQ ID NO:289). Figure 115 (SEQ ID NO:290), Figure 116 (SEQ ID NO:291). 
Figure 123 (SEQ ID NO:304). Figure 126 (SEQ ID NO:310), Figure 127 (SEQ ID N0:3in. Figure 130 (SEQ ID 
NO:323). Figure 133 (SEQ ID N0:33l). Figure 134 (SEQ ID N0;332), Figure 137 (SEQ ID NO:3381. Figure 140 

20 (SEQ ID NO:347). Figure 143 (SEQ ID NO:353). Figure 174 (SEQ ID NO:43l), Figure 175 (SEQ ID N0:432)> 
Figure 188 (SEQ ID NO:457), Figure 205 (SEQ ID N0:484), Figure 220 (SEQ ID N0:5I6V Figure 223 (SEQ ID 
NO:524>. Figure 226 (SEQ ID N0:527). Figure 227 (SEQ ID NO:528) and Figure 228 (SEQ ID N0:529). 
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FIGURE 1 

CCAGGTCCAACTGCACCTCGGTTCTATCGATTGAATTCCCCGGGGATCCTCTAGAGATCCCT 
CGACCTCGACCCACGCGTCCGCCAAGCTGGCCCTGCACGGCTGCAAGGGAGGCTCCTGTGGA 
CAGGCCAGGCAGGTGGGCCTCAGGAGGTGCCTCCAGGCGGCCAGTGGGCCTGAGGCCCCAGC 
AAGGGCTAGGGTCCATCTCCAGTCCCAGGACACAGCAGCGGCCACCATGGCCACGCCTGGGC 
TCCAGCAGCATCAGCAGCCCCCAGGACCGGGGGAGGCACAGGTGGCCCCCACCACCCGGAGG 
AGCAGCTCCTGCCCCTGTCCGGGGGMSACTGATTCTCCTCCGCCAGGCCACCCAGAGGAGA 




GTCCCGGGCTCACGGGGACCCTGTCTCCGAGTCGTTCGTGCAGCGTGTGTACCAGCCCTTCC 
TCACCACCTGCGACGGGCACCGGGCCTGCAGCACCTACCGAACCATTTATAGGACCGCCTAC 
CGCCGCAGCCCTGGGCTGGCCCCTGCCAGGCCTCGCTACGCGTGCTGCCCCGGCTGGAAGAG 
GACCAGCGGGCTTCCTGGGGCCTGTGGAGCAGCAATATGCCAGCCGCCATGCCGGAACGGAG 




L ^r^^r^ X >j 1 vjur^ I urtrt i uuAo i i AQj(jA(jij(j(jL(3C3CTeTCCCCAGCGCTGCATCAACACCGC 
CGGCAGTTACTGGTGCCAGTGTTGGGAGGGGCACAGCCTGTCTGCAGACGGTACACTCTGTG 
TGCCCAAGGGAGGGCCCCCCAGGGTGGCCCCCAACCCGACAGGAGTGGACAGTGCAATGAAG 
GAAGAAGTGCAGAGGCTGCAGTCCAGGGTGGACCTGCTGGAGGAGAAGCTGCAGCTGGTGCT 
GGCCCCACTGCACAGCCTGGCCTCGCAGGCACTGGAGCATGGGCTCCCGGACCCCGGCAGCC 
TCCTGGTGCACTCCTTCCAGCAGCTCGGCCGCATCGACTCCCTGAGCGAGCAGATTTCCTTC 
CTGGAGGAGCAGCTGGGGTCCTGCTCCTGCAAGAAAGACTCGIS&CTGCCCAGCGCCCCAGG 
CTGGACTGAGCCCCTCACGCCGCCCTGCAGCCCCCATGCCCCTGCCCAACATGCTGGGGGTC 
CAGAAGCCACCTCGGGGTGACTGAGCGGAAGGCCAGGCAGGGCCTTCCTCCTTTTCCTCCTC 
CCCTTCCCTCGGGAGGGTCCCCAGACCCTGGCATGGGATGGGCTGGGATTTTTTTTGTGAAT 
CCACCCCTGGCTACCCCCACCCTGGTTACCCCAACGGCATCCCAAGGCCAGGTGGGCCCTCA 
GCTGAGGGAAGGTACGAGTTCCCCTGCTGGAGCCTGGGACCCATGGCACAGGCCAGGCAGCC 
CGGAGGCTGGGTGGGGCCTCAGTGGGGGCTGCTGCCTGACCCCCAGCACAATAAAAATGAAA 
CGTGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGCGACTCT 

AGAGTCGACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATGGT 
TACAAAT 
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ElQimE-2 



MTDSPPPGHPEEKATPPGGTGHEGLSGGAADVASGVGSGRHRARLPARPLGCVLSRAHGDPV 
SESFVQRVYQPFLTTCDGHRACSTYRTIYRTAYRRSPGLAPARPRYACCPGWKRTSGLPGAC 
GAAICQPPCRNGGSCVQPGRCRCPAGWRGDTCQSDVDECSARRGGCPQRCINTAGSYWCQCW 
EGHSLSADGTLCVPKGGPPRVAPNPTGVDSAMKEEVQRLQSRVDLLEEKLQLVLAPLHSLAS 
QALEHGLPDPGSLLVKSFQQLGRIDSLSEQISFLEEQLGSCSCKKDS 
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FIGURE 3 

CGCTCGCCCCGTCGCCCCTCGCCTCCCCGCAGAGTCCCCTCGCGGCAGCAGATGTGTGTGGG 
GTCAGCCCACGGCGGGGACTAISGTGAAATTCCCGGCGCTCACGCACTACTGGCCCCTGATC 
CGGTTCTTGGTGCCCCTGGGCATCACCAACATAGCCATCGACTTCGGGGAGCAGGCCTTGAA 
CCGGGGCATTGCTGCTGTCAAGGAGGATGCAGTCGAGATGCTGGCCAGCTACGGGCTGGCGT 
ACTCCCTCATGAAGTTCTTCACGGGTCCCATGAGTGACTTCAAAAATGTGGGCCTGGTGTTT 
GTGAACAGCAAGAGAGACAGGACCAAAGCCGTCCTGTGTATGGTGGTGGCAGGGGCCATCGC 
TGCCGTCTTTCACACACTGATAGCTTATAGTGATTTAGGATACTACATTATCAATAAACTGC 
ACCATGTGGACGAGTCGGTGGGGAGCAAGACGAGAAGGGCCTTCCTGTACCTCGCCGCCTTT 
CCTTTCATGGACGCAATGGCATGGACCCATGCTGGCATTCTCTTAAAACACAAATACAGTTT 
CCTGGTGGGATGTGCCTCAATCTCAGATGTCATAGCTCAGGTTGTTTTTGTAGCCATTTTGC 
TTCACAGTCACCTGGAATGCCGGGAGCCCCTGCTCATCCCGATCCTCTCCTTGTACATGGGC 
GCACTTGTGCGCTGCACCACCCTGTGCCTGGGCTACTACAAGAACATTCACGACATCATCCC 
TGACAGAAGTGGCCCGGAGCTGGGGGGAGATGCAACAATAAGAAAGATGCTGAGCTTCTGGT 
GGCCTTTGGCTCTAATTCTGGCCACACAGAGAATCAGTCGGCCTATTGTCAACCTCTTTGTT 
TCCCGGGACCTTGGTGGCAGTTCTGCAGCCACAGAGGCAGTGGCGATTTTGACAGCCACATA 
CCCTGTGGGTCACATGCCATACGGCTGGTTGACGGAAATCCGTGCTGTGTATCCTGCTTTCG 
ACAAGAATAACCCCAGCAACAAACTGGTGAGCACGAGCAACACAGTCACGGCAGCCCACATC 
AAGAAGTTCACCTTCGTCTGCATGGCTCTGTCACTCACGCTCTGTTTCGTGATGTTTTGGAC 
ACCCAACGTGTCTGAGAAAATCTTGATAGACATCATCGGAGTGGACTTTGCCTTTGCAGAAC 
TCTGTGTTGTTCCTTTGCGGATCTTCTCCTTCTTCCCAGTTCCAGTCACAGTGAGGGCGCAT 
CTCACCGGGTGGCTGATGACACTGAAGAAAACCTTCGTCCTTGCCCCCAGCTCTGTGCTGCG 
GATCATCGTCCTCATCGCCAGCCTCGTGGTCCTACCCTACCTGGGGGTGCACGGTGCGACCC 
TGGGCGTGGGCTCCCTCCTGGCGGGCTTTGTGGGAGAATCCACCATGGTCGCCATCGCTGCG 
TGCTATGTCTACCGGAAGCAGAAAAAGAAGATGGAGAATGAGTCGGCCACGGAGGGGGAAGA 
CTCTGCCATGACAGACATGCCTCCGACAGAGGAGGTGACAGACATCGTGGAAATGAGAGAGG 
AGAATGAAIAAGGCACGGGACGCCATGGGCACTGCAGGGACGGTCAGTCAGGATGACACTTC 
GGCATCATCTCTTCCCTCTCCCATCGTATTTTGTTCCCTTTTTTTTGTTTTGTTTTGGTAAT 
GAAAGAGGCCTTGATTTAAAGGTTTCGTGTCAATTCTCTAGCATACTGGGTATGCTCACACT 
GACGGGGGGACCTAGTGAATGGTCTTTACTGTTGCTATGTAAAAACAAACGAAACAACTGAC 
TTCATACCCCTGCCTCACGAAAACCCAAAAGACACAGCTGCCTCACGGTTGACGTTGTGTCC 
TCCTCCCCTGGACAATCTCCTCTTGGAACCAAAGGACTGCAGCTGTGCCATCGCGCCTCGGT 
CACCCTGCACAGCAGGCCACAGACTCTCCTGTCCCCCTTCATCGCTCTTAAGAATCAACAGG 
TTAAAACTCGGCTTCCTTTGATTTGCTTCCCAGTCACATGGCCGTACAAAGAGATGGAGCCC 
CGGTGGCCTCTTAAATTTCCCTTCTGCCACGGAGTTCGAAACCATCTACTCCACACATGCAG 
GAGGCGGGTGGCACGCTGCAGCCCGGAGTCCCCGTTCACACTGAGGAACGGAGACCTGTGAC 
CACAGCAGGCTGACAGATGGACAGAATCTCCCGTAGAAAGGTTTGGTTTGAAATGCCCCGGG 
GGCAGCAAACTGACATGGTTGAATGATAGCATTTCACTCTGCGTTCTCCTAGATCTGAGCAA 
GCTGTCAGTTCTCACCCCCACCGTGTATATACATGAGCTAACTTTTTTAAATTGTCACAAAA 
GCGCATCTCCAGATTCCAGACCCTGCCGCATGACTTTTCCTGAAGGCTTGCTTTTCCCTCGC 
CTTTCCTGAAGGTCGCATTAGAGCGAGTCACATGGAGCATCCTAACTTTGCATTTTAGTTTT 
TACAGTGAACTGAAGCTTTAAGTCTCATCCAGCATTCTAATGCCAGGTTGCTGTAGGGTAAC 
TTTTGAAGTAGATATATTACCTGGTTCTGCTATCCTTAGTCATAACTCTGCGGTACAGGTAA 
TTGAGAATGTACTACGGTACTTCCCTCCCACACCATACGATAAAGCAAGACATTTTATAACG 
ATACCAGAGTCACTATGTGGTCCTCCCTGAAATAACGCATTCGAAATCCATGCAGTGCAGTA 
TATTTTTCTAAGTTTTGGAAAGCAGGTTTTTTCCTTTAAAAAAATTATAGACACGGTTCACT 
AAATTGATTTAGTCAGAATTCCTAGACTGAAAGAACCTAAACAAAAAAATATTTTAAAGATA 

TAAATATATGCTGTATATGTTATGTAATTTATTTTAGGCTATAATACATTTCCTATTTTCGC 
ATTTTCAATAAAATGTCTCTAATACAAAAAA 
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MVKFPALTHYWPLIRFLVPLGITNIAIDFGEQALNRGIAAVKEDAVEMLASYGLAYSLMKFF 
TGPMS D F KNVGL VF VNS KRDRTKAVLCMWAGAI AAVFHTL I AYSDLGYYI I NKLHHVDES V 
GSKTRRAFLYLAAFPFMDAMAWTHAGILLKHKYSFLVGCASISDVIAQWFVAILLHSHLEC 
REPLL I P I LS L YMGALVRCTTLCLGYYKN I HD 1 1 PDRSGPELGGDAT IRKMLS FWWPLAL IL 
ATQR I S R P I VNLFVSRDLGGSSAATEAVAI LTATYPVGHMP YGWLTE I RAVYPAFDKNNPSN 
KLVSTSNTVTAAHIKKFTFVCMALSLTLCFVMFWTPNVSEKILIDIIGVDFAFAELCWPLR 
I FS FF PVPVTVRAHLTGWLMTLKKTFVLAPSSVLRI I VLI ASLWLP YLGVHGATLGVGSLL 
AGFVGESTMVAIAACYVYRKQKKKMENESATEGEDSAMTDMPPTEEVTDIVEMREENE 
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CCTGACAGAAGTGCCCCGGAGCTGGGGGAGATNCAACATTAAGAAGATGCTGAGCTTCTGGT 

GCCNTTTGGCTCTAATTCTGGCCACACAGAGAANCAGTCGGCCTATTGTCAACCTCTTTGTT 

TCCCGGGACCTTGGTGGCAGTTCTGCAGCCACAGAGGCAGTGGCGATTTTGACAGCCACATA 

CCCTGTGGGTCACATGCCATACGGCTGGTTGACGGAAATCCGTGCTGTGTATCCTGCTTTCG 

ACAAGAATAACCCCAGCAACAAACTGGTGAGCACGAGCAACACAGTCACGGCGGCCCACATC 

AAGAAGTTCACCTTCGTCTGCATGGCTCTGTCACTCACGCTCTGTTTCGTGATGTTTTGGAC 

ACCCAACGTGTCTGNGAAAATCTTGATAGACATCATCGGAGTGGACTTTGCCTTTGCAGAAC 

TCTGTGTTGTTCCTTTGCGGATCTTCTCCTTCTTCCCAGTTCCAGTCACAGTGAGGGCGCAT 
CTCACCGGGTGGCTGATGACACTGAAGAAAACCTTCGTC 
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FIGURE 6 

TGACGGAATCCCGGGCTGGGTATCCTGGTTTNGACAAGATAAACCCCCAGCAANAAATTGGG 
GAGCAGGGCAAAACAGTNACGGGCAGCCCACATCAAGAAGTTCACCTTNGTTTGNATGGNTC 
TGTCAACTCACGCTNTGTTTCGTGATGTTTTGGACACCCAAAGTGTTTGAGAAAATTTTGAT 
AGACATNATCGGAGTGGANTTTGCCTTTGCAGAANTTTGNGNTGTTCCTTTGCGGATTTTCT 
CCTTTTTCCCAGTTCCAGTCACAGNGAGGGCGCATCTCACCGGGNGGNTGATGACANTGAAG 
AAAACCTTTGTCCTTGCCCCCAGCTNTTTGGTGCGGATCATTGTCCTNATNGCCAGCCTTGT 
GGTCCTACCCTACCTGGGGGTGCACGGTGCGACCCTGGGCGTGGGTTCCCTCCTGGCGGGCA 
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FIGURE 7 

TATTCCCAGTTCCGGTCACGGGGAGGGCGCATNTCACCGGGTGGCTGANGACACTGAAGAAA 

ACCTTNGTCCTTGCCCCCAGNTTTGTGNTGCGGATNATCGTCCTCATCGCCAGCCTNGTGGT 
CCTACCCTACCTGGGGGTGCACGGTGAGAC 
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FIGURE 8 



GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCCMSGGGGCCTGCCTGGGA 

GCCTGCTCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAG 

CTGCTGCCCCGCCAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCC 

TGGGGGTGCTGGTGTCCATCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTG 

CCCTGGGTGTGTGAGGAGGGGGCCGGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGG 

CTCCCTGCTTGGCTACCGCGCTGTCTACCGCATGTGCTTCGCCACGGCGGCCTTCTTCTTCT 

TCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCAGCCGGGACCCCCGGGCTGCCATCCAG 

AATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTCACCGTGGGTGCCTTCTACAT 

CCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGGCTCCTTCCTCTTCA 

TCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGTGGCTGGGC 

AAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTCTT 

CTACTTGCTGTCGATCGCGGCCGTGGCGCTGATGTTCATGTACTACACTGAGCCCAGCGGCT 

GCCACGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCT 

GCTGTCCTGCCCAAGGTCCAGGACGCCCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCAT 

CACCCTCTACACCATGTTTGTCACCTGGTCAGCCCTATCCAGTATCCCTGAACAGAAATGCA 

ACCCCCATTTGCCAACCCAGCTGGGCAACGAGACAGTTGTGGCAGGCCCCGAGGGCTATGAG 

ACCCAGTGGTGGGATGCCCCGAGCATTGTGGGCCTCATCATCTTCCTCCTGTGCACCCTCTT 

CATCAGTCTGCGCTCCTCAGACCACCGGCAGGTGAACAGCCTGATGCAGACCGAGGAGTGCC 

CACCTATGCTAGACGCCACACAGCAGCAGCAGCAGCAGGTGGCAGCCTGTGAGGGCCGGGCC 

TTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCCACTTCTGCCTGGTGCT 

GGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTACAAGCCCGGTGAGACCCGGAAGA 

TGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCTCCTC 

TACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGCISAGGCAGCCT 

CACAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCT 

GCCCCCTCCCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTG 

CCCCTGAGCCGGGCCTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCA 

GAGCCCCATCCCCCCGCCACACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGT 

TGCCCATACTCAGCATCTCGGATGAAAGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGG 

CTGCTGGAGAGAGCGGGGAACTCCCACCACAGTGGGGCATCCGGCACTGAAGCCCTGGTGTT 

CCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTGGACTTCGTGCCTTACTGAGTCTCT 

AAGACTTTTTCTAATAAACAAGCCAGTGCGTGTAAAAAAAA 
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FIGURE 9 

MGACLGACSLLSCASCLCGSAPCILCSCCPASRNSTVSRLIFTFFLFLGVLVSIIMLSPGVE 

SQLYKLPWVCEEGAGIPTVLQGHIDCGSLLGYRAVYRMCFATAAFFFFFFTLLMLCVSSSRD 

PRAAIQNGFWFFKFLILVGLTVGAFYIPDGSFTNIWFYFGWGSFLFILIQLVLLIDFAHSW 

NQRWLGKAEECDSRAWYAGLFFFTLLFYLLSIAAVALMFMYYTEPSGCHEGKVFISLNLTFC 

VCVS lAAVLPKVQDAQPNSGLLQASVITLYTMFVTWSALSS IPEQKCNPHLPTQLGNETWA 

GPEGYETQWWDAPSIVGLI I FLLCTLFISLRSSDHRQVNSLMQTEECPPMLDATQQQQQQVA 

ACEGRAFDNEQDGVTYSYSFFHFCLVLASLHVMMTLTNWYKPGETRKMISTWTAVWVKICAS 
WAGLLLYLWTLVAPLLLRNRDFS 
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FIGURE 10 

GAGCGAGGCCGGGGACTGAAGGTGTGGGTGTCGAGCCCTCTGGCAGAGGGTTAACCTGGGTC 

AAATGCACGGATTCTCACCTCGTACAGTTACGCTCTCCCGCGGCACGTCCGCGAGGACTTGA 

AGTCCTGAGCGCTCAAGT7TGTCCGTAGGTCGAGAGAAGGCC&ISGAGGTGCCGCCACCGGC 

ACCGCGGAGCTTTCTCTGTAGAGCATTGTGCCTATTTCCCCGAGTCTTTGCTGCCGAAGCTG 

TGACTGCCGATTCGGAAGTCCTTGAGGAGCGTCAGAAGCGGCTTCCCTACGTCCCAGAGCCC 

TATTACCCGGAATCTGGATGGGACCGCCTCCGGGAGCTGTTTGGCAAAGATGAACAGCAGAG 

AATTTCAAAGGACCTTGCTAATATCTGTAAGACGGCAGCTACAGCAGGCATCATTGGCTGGG 

TGTATGGGGGAATACCAGCTTTTATTCATGCTAAACAACAATACATTGAGCAGAGCCAGGCA 

GAAATTTATCATAACCGGTTTGATGCTGTGCAATCTGCACATCGTGCTGCCACACGAGGCTT 

CATTCGTTATGGCTGGCGCTGGGGTTGGAGAACTGCAGTGTTTGTGACTATATTCAACACAG 

TGAACACTAGTCTGAATGTATACCGAAATAAAGATGCCTTAAGCCATTTTGTAATTGCAGGA 

GCTGTCACGGGAAGTCTTTTTAGGATAAACGTAGGCCTGCGTGGCCTGGTGGCTGGTGGCAT 

AATTGGAGCCTTGCTGGGCACTCCTGTAGGAGGCCTGCTGATGGCATTTCAGAAGTACGCTG 

GTGAGACTGTTCAGGAAAGAAAACAGAAGGATCGAAAGGCACTCCATGAGCTAAAACTGGAA 

GAGTGGAAAGGCAGACTACAAGTTACTGAGCACCTCCCTGAGAAAATTGAAAGTAGTTTACG 

GGAAGATGAACCTGAGAATGATGCTAAGAAAATTGAAGCACTGCTAAACCTTCCTAGAAACC 

CTTCAGTAATAGATAAACAAGACAAGGACISAAAGTGCTCTGAACTTGAAACTCACTGGAGA 

GCTGAAGGGAGCTGCCATGTCCGATGAATGCCAACAGACAGGCCACTCTTTGGTCAGCCTGC 

TGACAAATTTAAGTGCTGGTACCTGTGGTGGCAGTGGCTTGCTCTTGTCTTTTTCTTTTCTT 

TTTAACTAAGAATGGGGCTGTTGTACTCTCACTTTACTTATCCTTAAATTTAAATACATACT 

TATGTTTGTATTAATCTATCAATATATGCATACATGGATATATCCACCCACCTAGATTTTAA 

GCAGTAAATAAAACATTTCGCAAAAGATTAAAGTTGAATTTTACAGTTT 
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FIGURE 1 1 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA23318 
xsubunit 1 of 1, 285 aa, 1 stop 
xMW: 32190, pi: 9.03, NX(S/T) : 2 

MEVPPPAPRSFLCRALCLFPRVFAAEAVTADSEVLEERQKRLPYVPEPYYPESGWDRLRELF 
GKDEQQRISKDLANICKTAATAGIIGWVYGGIPAFIHAKQQYIEQSQAEIYHNRFDAVQSAH 
RAATRGFIRYGWRWGWRTAVFVTIFNTVNTSLNWRNKDALSHFVIAGAVTGSLFRINVGLR 
GLVAGG 1 1 GALLGTPVGGLLMAFQKYAGETVQERKQKDRKALHELKLEEWKGRLQVTEHLPE 
KIESSLREDEPENDAKKIEALLNLPRNPSVIDKQDKD 

Important Features: 
Signal Peptide: 
amino acids 1-24 

Transmembrane domains : 

amino acids 76-96 and amino acids 171-195 

N-glycosylation site: 
amino acids 153-156 
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FIGURE 12 

CGGAAGTCCCTTGAGGAGCGTCAGAAGCGGCTTCCCTACGTCCCAGAGCCCTATTACCCGGA 
ATCTGGATGGGACCGCTCCGGGAGCTGTTTGGCAAAGATGAACAGCAGAGAATTTCAAAGGA 
CCTTGCTAATATCTGTAAGACGGCAGCTACAGCAGGCATCATTGGCTGGGTGTATGGGGGAA 
TACCAGCTTTTATTCATGCTAAACAACAATACATTGAGCAGAGCCAGGCAGAAATTTATCAT 
AACCGGTTTGATGCTGTGCAATCTGCACATCGTGCTGCCACACGAGGCTTCATTCGTTCATG 
GCTGGCGCCGAACC 
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FIGURE 13 

TCAAGTTTGTCCGTAGGTCGAGAGAAGGCCATGGAGGTGCCGCCACCGGCACCGCGGAGCTT 

TTTTCTGTAGAGCATTGTGCCTATTTCCCCGAGTTTTTGCTGCCGAAGCTGTGACTGCCGAT 

TCGGAAGTCCTTGAGGAGCGTCAGAAGCGGCTTCCCTACGTCCCAGAGCCCTATTACCCGGA 

ATTTGGATGGGACCGCCTCCGGGAGCTGTTTGGCAAAGATGAACAGCAGAGAATTTCAAAGG 

ACCTTGCTGATATNTGTAAGACGGCAGCTACAGCAGGCATCATTGGCTGGGTGTATGGGGGA 

ATACCAGCTTTTATTCATGNTAAACAACAATACATTGAGCAGAGCCAGGCAGAAATTTATNA 
TAACC 
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FIGURE 14 

GAGCCGCCGCCGCGCGCGCGCCGCGCACTGCAGCCCCAGGCCCCGGCCCCCCACCCACGTCT 

GCGTTGCTGCCCCGCCTGGGCCAGGCCCCAAAGGCAAGGACAAAGCAGCTGTCAGGGAACCT 

CCGCCGGAGTCGAATTTACGTGCAGCTGCCGGCAACCACAGGTTCCAAG&ESGTTTGCGGGG 

GCTTCGCGTGTTCCAAGAACTGCCTGTGCGCCCTCAACCTGCTTTACACCTTGGTTAGTCTG 

CTGCTAATTGGAATTGCTGCGTGGGGCATTGGCT7CGGGCTGATTTCCAGTCTCCGAGTGGT 

CGGCGTGGTCATTGCAGTGGGCATCTTCTTGTTCCTGATTGCTTTAGTGGGTCTGATTGGAG 

CTGTAAAACATCATCAGGTGTTGCTATTTTTTTATAT3ATTATTCTGTTACTTGTATTTATT 

GTTCAGTTTTCTGTATCTTGCGCTTGTTTAGCCCTGAACCAGGAGCAACAGGGTCAGCTTCT 

GGAGGTTGGTTGGAACAATACGGCAAGTGCTCGAAATGACATCCAGAGAAATCTAAACTGCT 

GTGGGTTCCGAAGTGTTAACCCAAATGACACCTGTCTGGCTAGCTGTGTTAAAAGTGACCAC 

TCGTGCTCGCCATGTGCTCCAATCATAGGAGAATATGCTGGAGAGGTTTTGAGATTTGTTGG 

TGGCATTGGCCTGTTCTTCAGTTTTACAGAGATCCTGGGTGTTTGGCTGACCTACAGATACA 

GGAACCAGAAAGACCCCCGCGCGAATCCTAGTGCATTCCTT2SATGAGAAAACAAGGAAGAT 

TTCCTTTCGTATTATGATCTTGTTCACTTTCTGTAATTTTCTGTTAAGCTCCATTTGCCAGT 

TTAAGGAAGGAAACACTATCTGGAAAAGTACCTTATTGATAGTGGAATTATATATTTTTACT 

CTATGTTTCTCTACATGTTTTTTTCTTTCCGTTGCTGAAAAATATTTGAAACTTGTGGTCTC 

TGAAGCTCGGTGGCACCTGGAATTTACTGTATTCATTGTCGGGCACTGTCCACTGTGGCCTT 

TCTTAGCATTTTTACCTGCAGAAAAACTTTGTATGGTACCACTGTGTTGGTTATATGGTGAA 

TCTGAACGTACATCTCACTGGTATAATTATATGTAGCACTGTGCTGTGTAGATAGTTCCTAC 

TGGAAAAAGAGTGGAAATTTATTAAAATCAGAAAGTATGAGATCCTGTTATGTTAAGGGAAA 

TCCAAATTCCCAATTTTTTTTGGTCTTTTTAGGAAAGATTGTTGTGGTAAAAAGTGTTAGTA 

TAAAAATGATAATTTACTTGTAGTCTTTTATGATTACACCAATGTATTCTAGAAATAGTTAT 

GTCTTAGGAAATTGTGGTTTAATTTTTGACTTTTACAGGTAAGTGCAAAGGAGAAGTGGTTT 

CATGAAATGTTCTAATGTATAATAACATTTACCTTCAGCCTCCATCAGAATGGAACGAGTTT 

TGAGTAATCAGGAAGTATATCTATATGATCTTGATATTGTTTTATAATAATTTGAAGTCTAA 

AAGACTGCATTTTTAAACAAGTTAGTATTAATGCGTTGGCCCACGTAGCAAAAAGATATTTG 

ATTATCTTAAAAATTGTTAAATACCGTTTTCATGAAATTTCTCAGTATTGTAACAGCAACTT 

GTCAAACCTAAGCATATTTGAATATGATCTCCCATAATTTGAAATTGAAATCGTATTGTGTG 

GCTCTGTATATTCTGTTAAAAAATTAAAGGACAGAAACCTTTCTTTGTGTATGCATGTTTGA 

ATTAAAAGAAAGTAATGGAAG 
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FIGURE 15 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA39979 
xsubunit 1 of 1, 204 aa, 1 stop 
XMW: 22147, pi: 8.37, NX(S/T): 3 

MVCGGFACSKNCLCALNLLYTLVSLLLIGIAAWGIGFGLI3SLRWGWIAVGIFLFLIALV 
GLIGAVKHHQVLLFFYMIILLLVFIVQFSVSCACLALNQEQQGQLLEVGWNNTASARNDIQR 

NLNCCGFRSVNPNDTCLASCVKSDHSCSPCAPIIGEYAGEVLRFVGGIGLFFSFTEILGVWL 
TYRYRNQKDPRANPSAFL 

Signal Peptide: 

amino acids 1-34 

Transmembrane domains: 

amino acids 47-63, 72-95 and 162-182 



BNSOOCIO <WO 994628)A2.I.> 



yl5/-?2>9- 



wo 99/46281 




PCT/LS99/05028 



FIGURE 16 

TGATTGGAGCTGTAAAAAANTCTTCAGGTGTTGTNATTTTTTTATATGATTATTCTGTAANT 
TGTATTTATTGTTCAGTTTTNTGTATCTTGCGCTTGTTTAGCCNTGAACCAGGAGCAACAGG 
GTCAGNTTNTGGAGGTTGGTTGGAACAATACGGCAAGTGCTCGAAATGACATCCAGAGAAAT 
NTAAACTGCTGTGGGTTCCGAAGTGTTAACCCAAATGACACCTGTNTGGCTAGCTGTGTTAA 
AAGTGACCACTNGTGCTCGCCATGTGCTCaVATCATAGGAGAATATGCTGGAGAGGTTTTGA 
GATTTGTTGGTGGCATTGGCCTGTTNTTCAGTTTTACAGAGATCCTGGGTGTTTGGCTGACC 
TACAGATACAGGAACCAG 
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FIGURE 17 

AATCCCAAATTCCCCAATTTTTTTGGNCTTTTTAGGGAAAGATGTGTTGTGGTAAAAAGTGT 

TAGTATAAAAATGATAATTTACTTGTAGTCTTTTATGATTACACCAATGTATTCTAGAATAG 

TTATGTCTTAGGAAATTGTGGTTTAATTTTTGACTTTTACAGGTAAGTGCAAAGGAGAAGTG 

GTTTCATGAAATGTTCTAATGTATAATAACATTTACCTTCAGCCTCCCATCAGAATGGAACG 

AGTTTTGAGTAATCCAGGAAGTATATCTATATGATCTTGATATTGTTTTATATAATTTGAAG 

TCTAAAAGACTGCATTTTTAAACAAGTTAGTATTAATGCGTTGGCCCACGTAGCAAAAAGAT 

ATTTGATTATCTTAAAAATTGTTAAATACCGTTTTCATGAAAGTTCTCAGTATTGTAACAGC 

AACTTGTCAAACCTAAGCATATTTGAATATGATCTCCCATAATTTGAAATTGAAATCGTATT 

GTGTGGAGGAAATGGCAATCTTATGTGTGCTGAAGGACACAGTAAGAGCACCAAGTTGTGCC 
CCACTTGC 
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FIGURE 18 

ATGATTATTCTGTTACTTGTATTTATTGTTCAGTTTTATGGTATCTTGCGCTTGTTTAGCCC 
CTGAAACCAGGAGCAACAGGGNNCAGCTTCCTGGAGGTTGGTTGGCAACAATCACGGCCAAG 
TGACTCCGCAAATGACATCCCAGAGAAATCCT.ZUSACTGCTGTGGGTTCCGAAGTGTTAACCC 
AAATGACACCTGTCTGGCTNGCTGTGTTAAAAGTGACCACTCGTGCTCGCCATGTGCTCCAA 

TCATAGGAGAATATGC 
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FIGURE 19 

CAGTCACCAISAAGCTGGGCTGTGTCCTCATGGCCTGGGCCCTCTACCTTTCCCTTGGTGTG 

CTCTGGGTGGCCCAGATGCTACTGGCTGCCAGTTTTGAGACGCTGCAGTGTGAGGGACCTGT 

CTGCACTGAGGAGAGCAGCTGCCACACGGAGGATGACTTGACTGATGCAAGGGAAGCTGGCT 

TCCAGGTCAAGGCCTACACTTTCAGTGAACCCTTCCACCTGATTGTGTCCTATGACTGGCTG 

ATCCTCCAAGGTCCAGCCAAGCCAGTTTTTGAAGGGGACCTGCTGGTTCTGCGCTGCCAGGC 

CTGGCAAGACTGGCCACTGACTCAGGTGACCTTCTACCGAGATGGCTCAGCTCTGGGTCCCC 

CCGGGCCTAACAGGGAATTCTCCATCACCGTGGTACAAAAGGCAGACAGCGGGCACTACCAC 

TGCAGTGGCATCTTCCAGAGCCCTGGTCCTGGGATCCCAGAAACAGCATCTGTTGTGGCTAT 

CACAGTCCAAGAACTGTTTCCAGCGCCAATTCTCAGAGCTGTACCCTCAGCTGAACCCCAAG 

CAGGAAGCCCCATGACCCTGAGTTGTCAGACAAAGTTGCCCCTGCAGAGGTCAGCTGCCCGC 

CTCCTCTTCTCCTTCTACAAGGATGGAAGGATAGTGCAAAGCAGGGGGCTCTCCTCAGAATT 

CCAGATCCCCACAGCTTCAGAAGATCACTCCGGGTCATACTGGTGTGAGGCAGCCACTGAGG 

ACAACCAAGTTTGGAAACAGAGCCCCCAGCTAGAGATCAGAGTGCAGGGTGCTTCCAGCTCT 

GCTGCACCTCCCACATTGAATCCAGCTCCTCAGAAATCAGCTGCTCCAGGAACTGCTCCTGA 

GGAGGCCCCTGGGCCTCTGCCTCCGCCGCCAACCCCATCTTCTGAGGATCCAGGCTTTTCTT 

CTCCTCTGGGGATGCCAGATCCTCATCTGTATCACCAGATGGGCCTTCTTCTCAAACACATG 

CAGGATGTGAGAGTCCTCCTCGGTCACCTGCTCATGGAGTTGAGGGAATTATCTGGCCACCA 

GAAGCCTGGGACCACAAAGGCTACTGCTGAAI^^AAGTAAACAGTTCATCCATGATCTCACT 

TAACCACCCCAATAAATCTGATTCTTTATTTTCTCTTCCTGTCCTGCACATATGCATAAGTA 

CTTTTACAAGTTGTCCCAGTGTTTTGTTAGAATAATGTAGTTAGGTGAGTGTAAATAAATTT 

ATATAAAGTGAGAATTAGAGTTTAGCTATAATTGTGTATTCTCTCTTAACACAACAGAATTC 

TGCTGTCTAGATCAGGAATTTCTATCTGTTATATCGACCAGAATGTTGTGATTTAAAGAGAA 

CTAATGGAAGTGGATTGAATACAGCAGTCTCAACTGGGGGCAATTTTGCCCCCCAGAGGACA 

TTGGGCAATGTTTGGAGACATTTTGGTCATTATACTTGGGGGGTTGGGGGATGGTGGGATGT 

GTGTCTACTGGCATCCAGTAAATAGAAGCCAGGGGTGCCGCTAAACATCCTATAATGCACAG 

GGCAGTACCCCACAACGAAAAATAATCTGGCCCAAAATGTCAGTTGTACTGAGTTTGAGAAA 

CCCCAGCCTAATGAAACCCTAGGTGTTGGGCTCTGGAATGGGACTTTGTCCCTTCTAATTAT 

TATCTCTTTCCAGCCTCATTCAGCTATTCTTACTGACATACCAGTCTTTAGCTGGTGCTATG 

GTCTGTTCTTTAGTTCTAGTTTGTATCCCCTCAAAAGCCATTATGTTGAAATCCTAATCCCC 

AAGGTGATGGCATTAAGAAGTGGGCCTTTGGGAAGTGATTAGATCAGGAGTGCAGAGCCCTC 

ATGATTAGGATTAGTGCCCTTATTTAAAAAGGCCCCAGAGAGCTAACTCACCCTTCCACCAT 

ATGAGGACGTGGCAAGAAGATGACATGTATGAGAACCAAAAAACAGCTGTCGCCAAACACCG 

ACTCTGTCGTTGCCTTGATCTTGAACTTCCAGCCTCCAGAACTATGAGAAATAAAATTCTGG 
TTGTTTGTAGCCTAA 
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FIGURE 20 

></usr/seqcib2/sst/DNA/Dnaseqs.min/ss .DNA4 0594 
xsubunit 1 of 1, 359 aa, 1 stop 
XMW: 38899, pi: 5.21, NX(S/T): 0 

MKLGCVLMAWALYLSLGVLWVAQMLLAASFETLQCEGPVCTEESSCHTEDDLTDAREAGFQV 
KAYTFSEPFHLIVSYDWLILQGPAKPVFEGDLLVLRCQAWQDWPLTQVTFYRDGSALGPPGP 
NREFSITWQKADSGHYHCSGIFQSPGPGIPETASWAITVQELFPAPILRAVPSAEPQAGS 
PMTLSCQTKLPLQRSAARLLFSFYKDGRIVQSRGLSSEFQIPTASEDHSGSYWCEAATEDNQ 
VWKQSPQLEIRVQGASSSAAPPTLNPAPQKSAAPGTAPEEAPGPLPPPPTPSSEDPGFSSPL 
GMPDPHLYHQMGLLLKHMQDVRVLLGHLLMELRELSGHQKPGTTKATAE 

Leucine zipper pattern sequence: 
amino acids 12-33 

Protein kinase C phosphorylation site: 
amino acids 353-355 
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FIGURE 21 

CCCACGCGTCCGCCCACGCGTCCGCCCACGGGTCCGCCCACGCGTCCGGGCCACCAGAAGTT 

TGAGCCTCTTTGGTAGCAGGAGGCTGGAAGAAAGGACAGAAGTAGCTCTGGCTGTGaiSGGG 

ATCTTACTGGGCCTGCTACTCCTGGGGCACCTAACAGTGGACACTTATGGCCGTCCCATCCT 

GGAAGTGCCAGAGAGTGTAACAGGACCTTGGAAAGGGGATGTGAATCTTCCCTGCACCTATG 

ACCCCCTGCAAGGCTACACCCAAGTCTTGGTGAAGTGGCTGGTACAACGTGGCTCAGACCCT 

GTCACCATCTTTCTACGTGACTCTTCTGGAGACCATATCCAGCAGGCAAAGTACCAGGGCCG 

CCTGCATGTGAGCCACAAGGTTCCAGGAGATGTATCCCTCCAATTGAGCACCCTGGAGATGG 

ATGACCGGAGCCACTACACGTGTGAAGTCACCTGGCAGACTCCTGATGGCAACCAAGTCGTG 

AGAGATAAGATTACTGAGCTCCGTGTCCAGAAACTCTCTGTCTCCAAGCCCACAGTGACAAC 

TGGCAGCGGTTATGGCTTCACGGTGCCCCAGGGAATGAGGATTAGCCTTCAATGCCAGGCTC 

GGGGTTCTCCTCCCATCAGTTATATTTGGTATAAGCAACAGACTAATAACCAGGAACCCATC 

AAAGTAGCAACCCTAAGTACCTTACTCTTCAAGCCTGCGGTGATAGCCGACTCAGGCTCCTA 

TTTCTGCACTGCCAAGGGCCAGGT7GGCTCTGAGCAGCACAGCGACATTGTGAAGTTTGTGG 

TCAAAGACTCCTCAAAGCTACTCAAGACCAAGACTGAGGCACCTACAACCATGACATACCCC 

TTGAAAGCAACATCTACAGTGAAGCAGTCCTGGGACTGGACCACTGACATGGATGGCTACCT 

TGGAGAGACCAGTGCTGGGCCAGGAAAGAGCCTGCCTGTCTTTGCCATCATCCTCATCATCT 

CCTTGTGCTGTATGGTGGTTTTTACCATGGCCTATATCATGCTCTGTCGGAAGACATCCCAA 

CAAGAGCATGTCTACGAAGCAGCCAGG2MGAAAGTCTCTCCTCTTCCATTTTTGACCCCGT 

CCCTGCCCTCAATTTTGATTACTGGCAGGAAATGTGGAGGAAGGGGGGTGTGGCACAGACCC 

AATCCTAAGGCCGGAGGCCTTCAGGGTCAGGACATAGCTGCCTTCCCTCTCTCAGGCACCTT 

CTGAGGTTGTTTTGGCCCTCTGAACACAAAGGATAATTTAGATCCATCTGCCTTCTGCTTCC 

AGAATCCCTGGGTGGTAGGATCCTGATAATTAATTGGCAAGAATTGAGGCAGAAGGGTGGGA 

AACCAGGACCACAGCCCCAAGTCCCTTCTTATGGGTGGTGGGCTCTTGGGCCATAGGGCACA 

TGCCAGAGAGGCCAACGACTCTGGAGAAACCATGAGGGTGGCCATCTTCGCAAGTGGCTGCT 

CCAGTGATGAGCCAACTTCCCAGAATCTGGGCAACAACTACTCTGATGAGCCCTGCATAGGA 

CAGGAGTACCAGATCATCGCCCAGATCAATGGCAACTACGCCCGCCTGCTGGACACAGTTCC 

TCTGGATTATGAGTTTCTGGCCACTGAGGGCAAAAGTGTCTGTTAAAAATGCCCCATTAGGC 

CAGGATCTGCTGACATAATTGCCTAGTCAGTCCTTGCCTTCTGCATGGCCTTCTTCCCTGCT 

ACCTCTCTTCCTGGATAGCCCAAAGTGTCCGCCTACCAACACTGGAGCCGCTGGGAGTCACT 

GGCTTTGCCCTGGAATTTGCCAGATGCATCTCAAGTAAGCCAGCTGCTGGATTTGGCTCTGG 

GCCCTTCTAGTATCTCTGCCGGGGGCTTCTGGTACTCCTCTCTAAATACCAGAGGGAAGATG 

CCCATAGCACTAGGACTTGGTCATCATGCCTACAGACACTATTCAACTTTGGCATCTTGCCA 

CCAGAAGACCCGAGGGAGGCTCAGCTCTGCCAGCTCAGAGGACCAGCTATATCCAGGATCAT 

TTCTCTTTCTTCAGGGCCAGACAGCTTTTAATTGAAATTGTTATTTCACAGGCCAGGGTTCA 

GTTCTGCTCCTCCACTATAAGTCTAATGTTCTGACTCTCTCCTGGTGCTCAATAAATATCTA 
ATCATAACAGC 
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></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA4 5416 

xsubunit 1 of 1, 321 aa, 1 stop 
XMW: 35544, pi: 8.51, NX(S/T): 0 

MGILLGLLLLGHLTVDTYGRPILEVPESVTGPWKGDVNLPCTYDPLQGYTQVLVKWLVQRGS 
DPVTIFLRDSSGDHIQQAKYQGRLHVSHKVPGDVSLQLSTLEMDDRSHYTCEVTWQTPDGNQ 
WRDKITELRVQKLSVSKPTVTTGSGYGFTVPQGMRISLQCQARGSPPISYIWYKQQTNNQE 
PIKVATLSTLLFKPAVIADSGSYFCTAKGQVGSEQHSDIVKEWKDSSKLLKTKTEAPTTMT 
YPLKATSTVKQSWDWTTDMDGYLGETSAGPGKSLPVFAIILIISLCCMWFTMAYIMLCRKT 

SQQEHVYEAAR 

Glycosaminoglycan attachment site: 
amino acids 149-152 

Transmembrane domain: 
amino acids 276-306 



<W0 9946281 A2J.> 
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F IGURE 2 3 

GCGCCGGGAGCCCATCTGCCCCCAGGGGCACGGGGCGCGGGGCCGGCTCCCGCCCGGCACAT 
GGCTGCAGCCACCTCGCGCGCACCCCGAGGCGCCGCGCCCAGCTCGCCCGAGGTCCGTCGGA 
GGCGCCCGGCCGCCCCGGAGCCAAGCAGCAACTGAGCGGGGAAGCGCCCGCGTCCGGGGATC 
GGG&2STCCCTCCTCCTTCTCCTCTTGCTAGTTTCCTACTATGTTGGAACCTTGGGGACTCA 
CACTGAGATCAAGAGAGTGGCAGAGGAAAAGGTCACTTTGCCCTGCCACCATCAACTGGGGC 
TTCCAGAAAAAGACACTCTGGATATTGAATGGCTGCTCACCGATAATGAAGGGAACCAAAAA 
GTGGTGATCACTTACTCCAGTCGTCATGTCTACAATAACTTGACTGAGGAACAGAAGGGCCG 
AGTGGCCTTTGCTTCCAATTTCCTGGCAGGAGATGCCTCCTTGCAGATTGAACCTCTGAAGC 
CCAGTGATGAGGGCCGGTACACCTGTAAGGTTAAGAATTCAGGGCGCTACGTGTGGAGCCAT 
GTCATCTTAAAAGTCTTAGTGAGACCATCCAAGCCCAAGTGTGAGTTGGAAGGAGAGCTGAC 
AGAAGGAAGTGACCTGACTTTGCAGTGTGAGTCATCCTCTGGCACAGAGCCCATTGTGTATT 
ACTGGCAGCGAATCCGAGAGAAAGAGGGAGAGGATGAACGTCTGCCTCCCAAATCTAGGATT 
GACTACAACCACCCTGGACGAGTTCTGCTGCAGAATCTTACCATGTCCTACTCTGGACTGTA 
CCAGTGCACAGCAGGCAACGAAGCTGGGAAGGAAAGCTGTGTGGTGCGAGTAACTGTACAGT 
ATGTACAAAGCATCGGCATGG7TGCAGGAGCAGTGACAGGCATAGTGGCTGGAGCCCTGCTG 
ATTTTCCTCTTGGTGTGGCTGCTAATCCGAAGGAAAGACAAAGAAAGATATGAGGAAGAAGA 
GAGACCTAATGAAATTCGAGAAGATGCTGAAGCTCCAAAAGCCCGTCTTGTGAAACCCAGCT 
CCTCTTCCTCAGGCTCTCGGAGCTCACGCTCTGGTTCTTCCTCCACTCGCTCCACAGCAAAT 
AGTGCCTCACGCAGCCAGCGGACACTGTCAACTGACGCAGCACCCCAGCCAGGGCTGGCCAC 
CCAGGCATACAGCCTAGTGGGGCCAGAGGTGAGAGGTTCTGAACCAAAGAAAGTCCACCATG 
CTAATCTGACCAAAGCAGAAACCACACCCAGCATGATCCCCAGCCAGAGCAGAGCCTTCCAA 
ACGGTCTS^TTACAATGGACTTGACTCCCACGCTTTCCTAGGAGTCAGGGTCTTTGGACTC 
TTCTCGTCATTGGAGCTCAAGTCACCAGCCACACAACCAGATGAGAGGTCATCTAAGTAGCA 
GTGAGCATTGCACGGAACAGATTCAGATGAGCATTTTCCTTATACAATACCAAACAAGCAAA 
AGGATGTAAGCTGATTCATCTGTAAAAAGGCATCTTATTGTGCCTTTAGACCAGAGTAAGGG 
AAAGCAGGAGTCCAAATCTATTTGTTGACCAGGACCTGTGGTGAGAAGGTTGGGGAAAGGTG 
AGGTGAATATACCTAAAACTTTTAATGTGGGATATTTTGTATCAGTGCTTTGATTCACAATT 
TTCAAGAGGAAATGGGATGCTGTTTGTAAATTTTCTATGCATTTCTGCAAACTTATTGGATT 
ATTAGTTATTCAGACAGTCAAGCAGAACCCACAGCCTTATTACACCTGTCTACACCATGTAC 
TGAGCTAACCACTTCTAAGAAACTCCAAAAAAGGAAACATGTGTCTTCTATTCTGACTTAAC 
TTCATTTGTCATAAGGTTTGGATATTAATTTCAAGGGGAGTTGAAATAGTGGGAGATGGAGA 
AGAGTGAATGAGTTTCTCCCACTCTATACTAATCTCACTATTTGTATTGAGCCCAAAATAAC 
TATGAAAGGAGACAAAAATTTGTGACAAAGGATTGTGAAGAGCTTTCCATCTTCATGATGTT 
ATGAGGATTGTTGACAAACATTAGAAATATATAATGGAGCAATTGTGGATTTCCCCTCAAAT 
CAGATGCCTCTAAGGACTTTCCTGCTAGATATTTCTGGAAGGAGAAAATACAACATGTCATT 
TATCAACGTCCTTAGAAAGAATTCTTCTAGAGAAAAAGGGATCTAGGAATGCTGAAAGATTA 
CCCAACATACCATTATAGTCTCTTCTTTCTGAGAAAATGTGAAACCAGAATTGCAAGACTGG 
GTGGACTAGAAAGGGAGATTAGATCAGTTTTCTCTTAATATGTCAAGGAAGGTAGCCGGGCA 
TGGTGCCAGGCACCTGTAGGAAAATCCAGCAGGTGGAGGTTGCAGTGAGCCGAGATTATGCC 
ATTGCACTCCAGCCTGGGTGACAGAGCGGGACTCCGTCTC 
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FiniJRE 24 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA45419 
xsubunit 1 of 1, 373 aa, 1 stop 
xMW: 41281, pi: 8.33, NX(S/T): 3 

MSLLLLLLLVSYYVGTLGTHTEIKRVAEEKVTLPCHHQLGLPEKDTLDIEWLLTDNEGNQKV 
VITYSSRHVYNNLTEEQKGRVAFASNFLAGDASLQIEPLKPSDEGRYTCKVKNSGRYVWSHV 
ILKVLVRPSKPKCELEGELTEGSDLTLQCESSSGTEPIVYYWQRIREKEGEDERLPPKSRID 
YNHPGRVLLQNLTMSYSGLYQCTAGNEAGKESCWRVTVQYVQSIGMVAGAVTGIVAGALLI 
FLLVWLLIRRKDKERYEEEERPNEIREDAEAPKARLVKPSSSSSGSRSSRSGSSSTRSTANS 
ASRSQRTLSTDAAPQPGLATQAYSLVGPEVRGSEPKKVHHANLTKAETTPSMIPSQSRAFQTV 

Transmembrane domain: 
amino acids 221-254 



BNSOOCIO: <W0 9946281 A2.L> 
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FIGURE 7^ 

5TE?II^'^'^^'^<^^TCTCTCGCGCCCAGTCCTCCTCCCTGGTTCTCCTCAGCCGCTGTCGGAG 

GAGAGCACCCGGAGACGCGGGCTGCAGTCGCGGCGGCTTCTCCCCGCCTGGGCGGCCTCGCC 

GCTGGGCAGGTGCTGAGCGCCCCTAGAGCCTCCCTTGCCGCCTCCCTCCTCTGCCCGGCCGC 

AGCAGTGCACATGGGGTGTTGGAGGTAGATGGGCTCCCGGCCCGGGAGGCGGCGGTGGATGC 

SSSSSI^^^^-^^^^^^^^^^^^^^TTCCAGCTGCCCCGCGCGCCCCGGGCGCCCCTGCGAG 

TCCCCGGTTCAGCC&IfiGGGACCTCTCCGAGCAGCAGCACCGCCCTCGCCTCCTGCAGCCGC 

ATCGCCCGCCGAGCCACAGCCACGATGATCGCGGGCTCCCTTCTCCTGCTTGGATTCCTTAG 

CACCACCACAGCTCAGCCAGAACAGAAGGCCTCGAATCTCATTGGCACATACCGCCATGTTG 

ACCGTGCCACCGGCCAGGTGCTAACCTGTGACAAGTGTCCAGCAGGAACCTATGTCTCTGAG 

CATTGTACCAACACAAGCCTGCGCGTCTGCAGCAGTTGCCCTGTGGGGACCTTTACCAGGCA 

TGAGAATGGCATAGAGAAATGCCATGACTGTAGTCAGCCATGCCCATGGCCAATGATTGAGA 

AATTACCTTGTGCTGCCTTGACTGACCGAGAATGCACTTGCCCACCTGGCATGTTCCAGTCT 

AACGCTACCTGTGCCCCCCATACGGTGTGTCCTGTGGGTTGGGGTGTGCGGAAGAAAGGGAC ' 

AGAGACTGAGGATGTGCGGTGTAAGCAGTGTGCTCGGGGTACCTTCTCAGATGTGCCTTCTA 

GTGTGATGAAATGCAAAGCATACACAGACTGTCTGAGTCAGAACCTGGTGGTGATCAAGCCG 

GGGACCAAGGAGACAGACAACGTCTGTGGCACACTCCCGTCCTTCTCCAGCTCCACCTCACC 

TTCCCCTGGCACAGCCATCTTTCCACGCCCTGAGCACATGGAAACCCATGAAGTCCCTTCCT 

CCACTTATGTTCCCAAAGGCATGAACTCAACAGAATCCAACTCTTCTGCCTCTGTTAGACCA 

AAGGTACTGAGTAGCATCCAGGAAGGGACAGTCCCTGACAACACAAGCTCAGCAAGGGGGAA 

GGAAGACGTGAACAAGACCCTCCCAAACCTTCAGGTAGTCAACCACCAGCAAGGCCCCCACC 

ACAGACACATCCTGAAGCTGCTGCCGTCCATGGAGGCCACTGGGGGCGAGAAGTCCAGCACG 

CCCATCAAGGGCCCCAAGAGGGGACATCCTAGACAGAACCTACACAAGCATTTTGACATCAA 

TGAGCATTTGCCCTGGATGATTGTGCTTTTCCTGCTGCTGGTGCTTGTGGTGATTGTGGTGT 

GCAGTATCCGGAAAAGCTCGAGGACTCTGAAAAAGGGGCCCCGGCAGGATCCCAGTGCCATT 

GTGGAAAAGGCAGGGCTGAAGAAATCCATGACTCCAACCCAGAACCGGGAGAAATGGATCTA 

CTACTGCAATGGCCATGGTATCGATATCCTGAAGCTTGTAGCAGCCCAAGTGGGAAGCCAGT 

GGAAAGATATCTATCAGTTTCTTTGCAATGCCAGTGAGAGGGAGGTTGCTGCTTTCTCCAAT 

GGGTACACAGCCGACCACGAGCGGGCCTACGCAGCTCTGCAGCACTGGACCATCCGGGGCCC 

CGAGGCCAGCCTCGCCCAGCTAATTAGCGCCCTGCGCCAGCACCGGAGAAACGATGTTGTGG 

AGAAGATTCGTGGGCTGATGGAAGACACCACCCAGCTGGAAACTGACAAACTAGCTCTCCCG 

ATGAGCCCCAGCCCGCTTAGCCCGAGCCCCATCCCCAGCCCCAACGCGAAACTTGAGAATTC 

CGCTCTCCTGACGGTGGAGCCTTCCCCACAGGACAAGAACAAGGGCTTCTTCGTGGATGAGT 

CGGAGCCCCTTCTCCGCTGTGACTCTACATCCAGCGGCTCCTCCGCGCTGAGCAGGAACGGT 

TCCTTTATTACCAAAGAAAAGAAGGACACAGTGTTGCGGCAGGTACGCCTGGACCCCTGTGA 

CTTGCAGCCTATCTTTGATGACATGCTCCACTTTCTAAATCCTGAGGAGCTGCGGGTGATTG 

AAGAGATTCCCCAGGCTGAGGACAAACTAGACCGGCTATTCGAAATTATTGGAGTCAAGAGC 

CAGGAAGCCAGCCAGACCCTCCTGGACTCTGTTTATAGCCATCTTCCTGACCTGCTGTAC3AA 

CATAGGGATACTGCATTCTGGAAATTACTCAATTTAGTGGCAGGGTGGTTTTTTAATTTTCT 

TCTGTTTCTGATTTTTGTTGTTTGGGGTGTGTGTGTGTGTTTGTGTGTGTGTGTGTGTGTGT 

GTGTGTGTGTGTGTTTAACAGAGAATATGGCCAGTGCTTGAGTTCTTTCTCCTTCTCTCTCT 

CTCTTTTTTTTTTAAATAACTCTTCTGGGAAGTTGGTTTATAAGCCTTTGCCAGGTGTAACT 

GTTGTGAAATACCCACCACTAAAGTTTTTTAAGTTCCATATTTTCTCCATTTTGCCTTCTTA 

TGTATTTTCAAGATTATTCTGTGCACTTTAAATTTACTTAACTTACCATAAATGCAGTGTGA 

CTTTTCCCACACACTGGATTGTGAGGCTCTTAACTTCTTAAAAGTATAATGGCATCTTGTGA 

ATCCTATAAGCAGTCTTTATGTCTCTTAACATTCACACCTACTTTTTAAAAACAAATATTAT 

TACTATTTTTATTATTGTTTGTCCTTTATAAATTTTCTTAAAGATTAAGAAAATTTAAGACC 

CCATTGAGTTACTGTAATGCAATTCAACTTTGAGTTATCTTTTAAATATGTCTTGTATAGTT 

CATATTCATGGCTGAAACTTGACCACACTATTGCTGATTGTATGGTTTTCACCTGGACACCG 

TGTAGAATGCTTGATTACTTGTACTCTTCTTATGCTAATATGCTCTGGGCTGGAGAAATGAA 

ATCCTCAAGCCATCAGGATTTGCTATTTAAGTGGCTTGACAACTGGGCCACCAAAGAACTTG 

AACTTCACCTTTTAGGATTTGAGCTGTTCTGGAACACATTGCTGCACTTTGGAAAGTCAAAA 

TCAAGTGCCAGTGGCGCCCTTTCCATAGAGAATTTGCCCAGCTTTGCTTTAAAAGATGTCTT 

GTTTTTTATATACACATAATCAATAGGTCCAATCTGCTCTCAAGGCCTTGGTCCTGGTGGGA 

TTCCTTCACCAATTACTTTAATTAAAAATGGCTGCAACTGTAAGAACCCTTGTCTGATATAT 

TTGCAACTATGCTCCCATTTACAAATGTACCTTCTAATGCTCAGTTGCCAGGTTCCAATGCA 

AAGGTGGCGTGGACTCCCTTTGTGTGGGTGGGGTTTGTGGGTAGTGGTGAAGGACCGATATC 

AGAAAAATGCCTTCAAGTGTACTAATTTATTAATAAACATTAGGTGTTTGTTAAAAAAAAAA 
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FIGURE 26 

></usr/seqdb2/sst/DNA/Dnaseqs.roin/ss.DNA52594 
xsubunit 1 of 1, 655 aa, 1 stop 
xMW: 71845, pi: 8.22, NX(S/T): 8 

MGTSPSSSTALASCSRIARRATATMIAGSLLLLGFLSTTTAQPEQKASNLIGTYRHVDRATG 
QVLTCDKCPAGTYVSEHCTNTSLRVCSSCPVGTFTRHENGIEKCHDCSQPCPWPMIEKLPCA 
ALTDRECTCPPGMFQSNATCAPHTVCPVGWGVRKKGTETEDVRCKQCARGTFSDVPSSVMKC 
KAYTDCLSQNLWIKPGTKETDNVCGTLPSFSSSTSPSPGTAIFPRPEHMETHEVPSSTYVP 
KGMNSTESNSSASVRPKVLSSIQEGTVPDNTSSARGKEDVNKTLPNLQWNHQQGPHHRHIL 
KLLPSMEATGGEKSSTPIKGPKRGHPRQNLHKHFDINEHLPWMIVLFLLLVLWIWCSIRK 
SSRTLKKGPRQDPSAIVEKAGLKKSMTPTQNREKWIYYCNGHGIDILKLVAAQVGSQWKDIY 
QFLCNASEREVAAFSNGYTADHERAYAALQHWTIRGPEASLAQLISALRQHRRNDWEKIRG 
LMEDTTQLETDKLALPMSPSPLSPSPIPSPNAKLENSALLTVEPSPQDKNKGFFVDESEPLL 
RCDSTS SGSS ALSRNGS F ITKEKKDTVLRQVRLDPCDLQP I FDDMLHFLNPEELRVIEEIPQ 
AEDKLDRLFE 1 1 G VKSQEASQTLLDS VYSHLPDLL 
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ATGGGAAGCCAGTAACACTGTGGCCTACTATCTCTTCCGTGGTGCCATCTACATTTTTGGGA 
CTCGGGAATTATGAGGTAGAGGTGGAGGCGGAGCCGGATGTCAGAGGTCCTGAAATAGTCAC 
C&IfiGGGGAAAATGATCCGCCTGCTGTTGAAGCCCCCTTCTCATTCCGATCGCTTTTTGGCC 
TTGATGATTTGAAAATAAGTCCTGTTGCACCAGATGCAGATGCTGTTGCTGCACAGATCCTG 
TCACTGCTGCCATTGAAGTTTTTTCCAATCATCGTCATTGGGATCATTGCATTGATATTAGC 
ACTGGCCATTGGTCTGGGCATCCACTTCGACTGCTCAGGGAAGTACAGATGTCGCTCATCCT 
TTAAGTGTATCGAGCTGATAGCTCGATGTGACGGAGTCTCGGATTGCAAAGACGGGGAGGAC 
GAGTACCGCTGTGTCCGGGTGGGTGGTCAGAATGCCGTGCTCCAGGTGTTCACAGCTGCTTC 
GTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTACGCAAATGTTGCCTGTGCCCAAC 
TGGGTTTCCCAAGCTATGTGAGTTCAGATAACCTCAGAGTGAGCTCGCTGGAGGGGCAGTTC 
CGGGAGGAGTTTGTGTCCATCGATCACCTCTTGCCAGATGACAAGGTGACTGCATTACACCA 
CTCAGTATATGTGAGGGAGGGATGTGCCTCTGGCCACGTGGTTACCTTGCAGTGCACAGCCT 
GTGGTCATAGAAGGGGCTACAGCTCACGCATCGTGGGTGGAAACATGTCCTTGCTCTCGCAG 
TGGCCCTGGCAGGCCAGCCTTCAGTTCCAGGGCTACCACCTGTGCGGGGGCTCTGTCATCAC 
GCCCCTGTGGATCATCACTGCTGCACACTGTGTTTATGACTTGTACCTCCCCAAGTCATGGA 
CCATCCAGGTGGGTCTAGTTTCCCTGTTGGACAATCCAGCCCCATCCCACTTGGTGGAGAAG 
ATTGTCTACCACAGCAAGTACAAGCCAAAGAGGCTGGGCAATGACATCGCCCTTATGAAGCT 
GGCCGGGCCACTCACGTTCAATGAAATGATCCAGCCTGTGTGCCTGCCCAACTCTGAAGAGA 
ACTTCCCCGATGGAAAAGTGTGCTGGACGTCAGGATGGGGGGCCACAGAGGATGGAGGTGAC 
GCCTCCCCTGTCCTGAACCACGCGGCCGTCCCTTTGATTTCCAACAAGATCTGCAACCACAG 
GGACGTGTACGGTGGCATCATCTCCCCCTCCATGCTCTGCGCGGGCTACCTGACGGGTGGCG 
TGGACAGCTGCCAGGGGGACAGCGGGGGGCCCCTGGTGTGTCAAGAGAGGAGGCTGTGGAAG 
TTAGTGGGAGCGACCAGCTTTGGCATCGGCTGCGCAGAGGTGAACAAGCCTGGGGTGTACAC 
CCGTGTCACCTCCTTCCTGGACTGGATCCACGAGCAGATGGAGAGAGACCTAAAAACCTQAA 
GAGGAAGGGGACAAGTAGCCACCTGAGTTCCTGAGGTGATGAAGACAGCCCGATCCTCCCCT 
GGACTCCCGTGTAGGAACCTGCACACGAGCAGACACCCTTGGAGCTCTGAGTTCCGGCACCA 
GTAGCAGGCCCGAAAGAGGCACCCTTCCATCTGATTCCAGCACAACCTTCAAGCTGCTTTTT 
QTTTVTT:GTTTTTTTGhGGT:GGAGTCTCGCi:CTGTTGCCCAGGCTGGAGTGChGTGGCGMA 
TCCCTGCTCACTGCAGCCTCCGCTTCCCTGGTTCAAGCGATTCrCTTGCCTCAGCTTCCCCA 
GTAGCTGGGACCACAGGTGCCCGCCACCACACCCAACTAATTTTTGTATTTTTAGTAGAGAC 
AGGGTTTCACCATGTTGGCCAGGCTGCTCTCAAACCCCTGACCTCAAATGATGTGCCTGCTT 
CAGCCTCCCACAGTGCTGGGATTACAGGCATGGGCCACCACGCCTAGCCTCACGCTCCTTTC 
TGATCTTCACTAAGAACAAAAGAAGCAGCAACTTGCAAGGGCGGCCTTTCCCACTGGTCCAT 
CTGGTTTTCTCTCCAGGGTCTTGCAAAATTCCTGACGAGATAAGCAGTTATGTGACCTCACG 
TGCAAAGCCACCAACAGCCACTCAGAAAAGACGCACCAGCCCAGAAGTGCAGAACTGCAGTC 
ACTGCACGTTTTCATCTCTAGGGACCAGAACCAAACCCACCCTTTCTACTTCCAAGACTTAT 
TTTCACATGTGGGGAGGTTAATCTAGGAATGACTCGTTTAAGGCCTATTTTCATGATTTCTT 
TGTAGCATTTGGTGCTTGACGTATTATTGTCCTTTGATTCCAAATAATATGTTTCCTTCCCT 
CATTGTCTGGCGTGTCTGCGTGGACTGGTGACGTGAATCAAAATCATCCACTGAAA 
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FIGURE 28 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ss.DNA45234 
xsubunit 1 of 1, 453 aa, 1 sCop 
xMW: 49334, p.1 : 6.32, NX(S/T): 1 

MGENDPPAVEAPFSFRSLFGLDDLKISPVAPDADAVAAQILSLLPLKFFPIIVIGIIALILA 
LAIGLGIHFDCSGKYRCRSSFKCIELIARCDGVSDCKDGEDEYRCVRVGGQNAVLQVFTAAS 
WKTMCSDDWKGHYANVACAQLGFPSYVSSDNLRVSSLEGQFREEFVSIDHLLPDDKVTALHH 
SVYVREGCASGHWTLQCTACGHRRGYSSRIVGGNMSLLSQWPWQASLQFQGYHLCGGSVIT 
PLWIITAAHCVYDLYLPKSWTIQVGLVSLLDNPAPSHLVEKIVYHSKYKPKRLGNDIALMKL 
AGPLTFNEM I QPVCLPNSEENFPDGKVCWTSGWGATEDGGDAS PVLNHAAVPL I SNKI CNHR 
DVYGGIISPSMLCAGYLTGGVDSCQGDSGGPLVCQERRLWKLVGATSFGIGCAEVNKPGVYT 

RVTSFLDWIHEQMERDLKT 
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FIGURE 29 

CCCACGCGTCCGTCCTAGTCCCCGGGCCAACTCGGACAGTTTGCTCATTTATTGCAACGGTC 

AAGGCTGGCTTGTGCCAGAACGGCGCGCGCGCGCGCACGCACGCACACACACGGGGGGAAAC 

TTTTTTAAAAATGAAAGGCTAGAAGAGCTCAGCGGCGGCGCGGGCGCTGCGCGAGGGCTCCG 

GAGCTGACTCGCCGAGGCAGGAAATCCCTCCGGTCGCGACGCCCGGCCCCGGCTCGGCGCCC 

GCGTGGGATGGTGCAGCGCTCGCCGCCGGGCCCGAGAGCTGCTGCACTGAAGGCCGGCGACG 

AlfiGCAGCGCGCCCGCTGCCCGTGTCCCCCGCCCGCGCCCTCCTGCTCGCCCTGGCCGGTGC 

TCTGCTCGCGCCCTGCGAGGCCCGAGGGGTGAGCTTATGGAACCAAGGAAGAGCTGATGAAG 

TTGTCAGTGCCTCTGTTCGGAGTGGGGACCTCTGGATCCCAGTGAAGAGCTTCGACTCCAAG 

AATCATCCAGAAGTGCTGAATATTCGACTACAACGGGAAAGCAAAGAACTGATCATAAATCT 

GGAAAGAAATGAAGGTCTCATTGCCAGCAGTTTCACGGAAACCCACTATCTGCAAGACGGTA 

CTGATGTCTCCCTCGCTCGAAATTACACGGGTCACTGTTACTACCATGGACATGTACGGGGA 

TATTCTGATTCAGCAGTCAGTCTCAGCACGTGTTCTGGTCTCAGGGGACTTATTGTGTTTGA 

AAATGAAAGCTATGTCTTAGAACCAATGAAAAGTGCAACCAACAGATACAAACTCTTCCCAG 

CGAAGAAGCTGAAAAGCGTCCGGGGATCATGTGGATCACATCACAACACACCAAACCTCGCT 

GCAAAGAATGTGTTTCCACCACCCTCTCAGACATGGGCAAGAAGGCATAAAAGAGAGACCCT 

CAAGGCAACTAAGTATGTGGAGCTGGTGATCGTGGCAGACAACCGAGAGTTTCAGAGGCAAG 

GAAAAGATCTGGAAAAAGTTAAGCAGCGATTAATAGAGATTGCTAATCACGTTGACAAGTTT 

TACAGACCACTGAACATTCGGATCGTGTTGGTAGGCGTGGAAGTGTGGAATGACATGGACAA 

ATGCTCTGTAAGTCAGGACCCATTCACCAGCCTCCATGAATTTCTGGACTGGAGGAAGATGA 

AGCTTCTACCTCGCAAATCCCATGACAATGCGCAGCTTGTCAGTGGGGTTTATTTCCAAGGG 

ACCACCATCGGCATGGCCCCAATCATGAGCATGTGCACGGCAGACCAGTCTGGGGGAATTGT 

CATGGACCATTCAGACAATCCCCTTGGTGCAGCCGTGACCCTGGCACATGAGCTGGGCCACA 

ATTTCGGGATGAATCATGACACACTGGACAGGGGCTGTAGCTGTCAAATGGCGGTTGAGAAA 

GGAGGCTGCATCATGAACGCTTCCACCGGGTACCCATTTCCCATGGTGTTCAGCAGTTGCAG 

CAGGAAGGACTTGGAGACCAGCCTGGAGAAAGGAATGGGGGTGTGCCTGTTTAACCTGCCGG 

AAGTCAGGGAGTCTTTCGGGGGCCAGAAGTGTGGGAACAGATTTGTGGAAGAAGGAGAGGAG 

TGTGACTGTGGGGAGCCAGAGGAATGTATGAATCGCTGCTGCAATGCCACCACCTGTACCCT 

GAAGCCGGACGCTGTGTGCGCACATGGGCTGTGCTGTGAAGACTGCCAGCTGAAGCCTGCAG 

GAACAGCGTGCAGGGACTCCAGCAACTCCTGTGACCTCCCAGAGTTCTGCACAGGGGCCAGC 

CCTCACTGCCCAGCCAATGTGTACCTGCACGATGGGCACTCATGTCAGGATGTGGACGGCTA 

CTGCTACAATGGCATCTGCCAGACTCACGAGCAGCAGTGTGTCACGCTCTGGGGACCAGGTG 

CTAAACCTGCCCCTGGGATCTGCTTTGAGAGAGTCAATTCTGCAGGTGATCCTTATGGCAAC 

TGTGGCAAAGTCTCGAAGAGTTCCTTTGCCAAATGCGAGATGAGAGATGCTAAATGTGGAAA 

AATCCAGTGTCAAGGAGGTGCCAGCCGGCCAGTCATTGGTACCAATGCCGTTTCCATAGAAA 

CAAACATCCCTCTGCAGCAAGGAGGCCGGATTCTGTGCCGGGGGACCCACGTGTACTTGGGC 

GATGACATGCCGGACCCAGGGCTTGTGCTTGCAGGCACAAAGTGTGCAGATGGAAAAATCTG 

CCTGAATCGTCAATGTCAAAATATTAGTGTCTTTGGGGTTCACGAGTGTGCAATGCAGTGCC 

ACGGCAGAGGGGTGTGCAACAACAGGAAGAACTGCCACTGCGAGGCCCACTGGGCACCTCCC 

TTCTGTGACAAGTTTGGCTTTGGAGGAAGCACAGACAGCGGCCCCATCCGGCAAGCAGAAGC 

AAGGCAGGAAGCTGCAGAGTCCAACAGGGAGCGCGGCCAGGGCCAGGAGCCCGTGGGATCGC 

AGGAGCATGCGTCTACTGCCTCACTGACACTCATCIS&GCCCTCCCATGACATGGAGACCGT 

GACCAGTGCTGCTGCAGAGGAGGTCACGCGTCCCCAAGGCCTCCTGTGACTGGCAGCATTGA 

CTCTGTGGCTTTGCCATCGTTTCCATGACAACAGACACAACACAGTTCTCGGGGCTCAGGAG 

GGGAAGTCCAGCCTACCAGGCACGTCTGCAGAAACAGTGCAAGGAAGGGCAGCGACTTCCTG 

GTTGAGCTTCTGCTAAAACATGGACATGCTTCAGTGCTGCTCCTGAGAGAGTAGCAGGTTAC 

CACTCTGGCAGGCCCCAGCCCTGCAGCAAGGAGGAAGAGGACTCAAAAGTCTGGCCTTTCAC 

TGAGCCTCCACAGCAGTGGGGGAGAAGCAAGGGTTGGGCCCAGTGTCCCCTTTCCCCAGTGA 

CACCTCAGCCTTGGCAGCCCTGATGACTGGTCTCTGGCTGCAACTTAATGCTCTGATATGGC 

TTTTAGCATTTATTATATGAAAATAGCAGGGTTTTAGTTTTTAATTTATCAGAGACCCTGCC 

ACCCATTCCATCTCCATCCAAGCAAACTGAATGGCAATGAAACAAACTGGAGAAGAAGGTAG 

GAGAAAGGGCGGTGAACTCTGGCTCTTTGCTGTGGACATGCGTGACCAGCAGTACTCAGGTT 

TGAGGGTTTGCAGAAAGCCAGGGAACCCACAGAGTCACCAACCCTTCATTTAACAAGTAAGA 

ATGTTAAAAAGTGAAAACAATGTAAGAGCCTAACTCCATCCCCCGTGGCCATTACTGCATAA 
AATAGAGTGCATTTGAAAT 
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></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA49624 
xsubunit 1 of 1, 735 aa, 1 stop 
XMW: 80177, pi: 7.08, NX(S/T): 5 

MAARPLPVSPARALLLALAGALLAPCEARGVSLWNQGRADEWSASVRSGDLWIPVKSFDSK 
NHPEVLNIRLQRESKELIINLERNEGLIASSFTETHYLQDGTDVSLARNYTGHCYYHGHVRG 

ysdsavslstcsglrglivfenesyvlepmksatnryklfpakklksvrgscgshhntpnla 
aknvf?ppsqtwarrhkretlkatkyvelvivadnrefqrqgkdlekvkqrliei;\nhvdkf 
yrplnirivlvgvevwndmdkcsvsqdpftslhefldwrkmkllprkshdnaqlvsgvyfqg 
ttigmapimsmctadqsggivmdhsdnplgaavtlahelghnfgmnhdtldrgcscqmavek 

./ggcimnastgypfpmvfsscsrkdletslekgmgvclfnlpevresfggqkcgnr'fveegee- 
cdcgepeecmnrccnattctlkpdavcahglccedcqlkpagtacrdssnscdlpefctgas 

' ...phcpanvylhdghscqdvdgycyngicqtheqqcvtlwgpgakpapgicfervnsagdpygn 
cgkvskssfakcemrdakcgkiqcqggasrpvigtnavsietniplqqggrilcrgthvylg 
• ddmpdpglvlagtkcadgki clnrqcqnisvfgvhecamqchgrgvcnnrknchceahwap'p. 
' fcdkfgfggstdsgp irqaearqeaaesnrergqgqepvgsqehastasltli 
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FIGURE 31 

TCCCAAGGCTTCTTGGATGGCAGATGATTNTGGGGTTTTGCATTGTTTCCCTGACAACGAAA 
ACAAAACAGTTTTGGGGGTTCAGGAGGGGAANTCCAGCCTACCCAGGAAGTTTGCAGAAACA 
GTGCAAGGAAGGGCAGGANTTCCTGGTTGAGNTTTTTGNTAAAACATGGACATGNTTCAGTG 
CTGCTCNTGAGAGAGTAGCAGGTTACCACTTTTGGCAGGCCCCAGCCCTGCAGCAAGGAGGA 
AGAGGACTCAAAAGTTTGGCCTTTCACTGAGCCTCCACAGCAGTGGGGGAGAAGCAAGGGTT 
GGGCCCAGTGTCCCCTTTCCCCAGTGACACCTCAGCCTTGGCAGCCCTGATAACTGGTNTNT 
GGCTGCAANTTAATGCTNTGATATGGCTTTTAGCATTTATTATATGAAAATAGCAGGGTTTT 
AGTTTTTAATTTATCAGAGACCCTGCCACCCATTCCATNTCCATCCAAG 
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FIGURE 32 



CATCCTGCAACATGGTGAAACCACGCCTGGCTAATTTTGTTGTATTTTTGGTAGAGATGGGA 

TTTCACCGTGTTAGCCAGGATTGTCTCAATCTGACCTCATGATCTGCCCGCCTCGGCCTCCC 

AAAGTGCTGGGATTACAGGCGAGTGCAACCACACCCGGCCACAAACTTTTTAAGAAGTTAAT 

GAAACCATACCTTTTACATTTTTAATGACAGGAAAATGCTCACAATAATTGTTAACCCAAAA 

TTCTGGATACAAAAGTACAATCTTTACTGTGTAAATACATGTATATGTACTATATGAAAATA 

TACCAAATATCAATAATACTTATCTCTGGGTAAAAACCTCTTCTCATACCCTGTGCTAACAA 

CTTTTAACAAAAAATTTGCATCACTTTTAAGAATCAAGAAAAATTTCTGAAGGTCATATGGG 

ACAGAAAAAAAAACCAAGGGAAAAATCACGCCACTTGGGAAAAAAAGATTCGAAATCTGCCT 

TTTTATAGATTTGTAATTAATAAGGTCCAGGCTTTCTAAGCAACTTAAATGTTTTGTTTCGA 

AACAAAGTACTTGTCTGGATGTAGGAGGAAAGGGAGTGATGTCACTGCCATTATGATGCCCC 

TTGAATATAAGACCCTACTTGCTATCTCCCCTGCACCAGCCAGGAGCCACCCATCCTCCAGC 

ACACTGAGCAGCAAGCTGGACACACGGCACACTGATCCAAAlfiGGTAAGGGGATGGTGGCGA 

TGCTCATTCTGGGTCTGCTACTTCTGGCGCTGCTCCTACCCGTGCAGGTTTCTTCATTTGTT 

CCTTTAACCAGTATGCCGGAAGCTACTGCAGCCGAAACCACAAAGCCCTCCAACAGTGCCCT 

ACAGCCTACAGCCGGTCTCCTTGTGGTCTTGCTTGCCCTTCTACATCTCTACCATia&GAGG 

CAGGTCAAGAAACAGCTACAGTTCTCCAACCCATACACTAAAACCGAATCCAAATGGTGCCT 

AGAAGTTCAATGTGGCAAGGAAAAAAACCAGGTCTTCATCAAATCTACTAATTTCACTCCTT 

ATTAACAGAGAAACGCTTGAGAGTCTCAAACTGGACTGGTTTAAAGAGCATCTGAAGGATTT 

GACTAGATGATAAATGCCTGTACTCCCAGTACTTTGGGAGGCCTAGGCCGGCGGATCACCTG 

AGGTCAGGAGTTTGAGACTAACCTGGCCAAAATGGTGAAACCCCATCTGTACTAAAAATACA 

AATATTGACTGGGCGTGGTGGTGAGTGCCTGTGATCCCAGCTACTCAGGTGGCTGAAGCAGG 

ACAATCACTTGAACTCAGGAGGCAGAGGTTGCAGTGAGCTGAGATCGCGCTACTGCACTCTA 

GCCTAGCCTGGGCAACAGAGTGAGACTTCGTCTCAAAAAAAAAAAAGCCAAGTGCAGTGGCT 

CACGCCTGTAATCCCGGCACTTTGGGAGGCCGAGGTGGGCGGATCACGAGGTCAGGAGATCA 

AGACCATCCTGGCTAATACAGTGAAACCCTGTCTCTACTAAAAATACAAAAAATTAGCCGGG 

GATGGTGGCAGGCACCTGGAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATAGCGTGAA 

CTCAGGAGGCGGAGCTTGCAGTGAGCCGAGATTGCGCTACTGCACTCCAGCCTGGGCGACAG 

CGCGAGACTCCGTCTCAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 33 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss.DNA48309 
xsubunit 1 of 1, 67 aa, 1 stop 
xMW: 6981, pi: 7.47, NX(S/T>: 0 

MGKGMVAMLILGLLLLALLLPVQVSSFVPLTSMPEATAAETTKPSNSALQPTAGLLWLLAL 
LHLYH 
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FIGURE 34 



GCCGCGGCGAGAGCGCGCCCAGCCCCGCCGCG^CCCGCGCGCCCAGGACGCCTCCTCCCG 

CTGCTGGCCCGGCCGGCGGCCCTGACTGCGCTGCTGCTGCTGCTGCTGGGCCATGGCGGCGG 

CGGGCGCTGGGGCGCCCGGGCCCAGGAGGCGGCGGCGGCGGCGGCGGACGGGCCCCCCGCGG 

CAGACGGCGAGGACGGACAGGACCCGCACAGCAAGCACCTGTACACGGCCGACATGTTCACG 

CACGGGATCCAGAGCGCCGCGCACTTCGTCATGTTCTTCGCGCCCTGGTGTGGACACTGCCA 

GCGGCTGCAGCCGACTTGGAATGACCTGGGAGACAAATACAACAGCATGGAAGATGCCAAAG 

TCTATGTGGCTAAAGTGGACTGCACGGCCCACTCCGACGTGTGCTCCGCCCAGGGGGTGCGA 

GGATACCCCACCTTAAAGCTTTTCAAGCCAGGCCAAGAAGCTGTGAAGTACCAGGGTCCTCG 

GGACTTCCAGACACTGGAAAACTGGATGCTGCAGACACTGAACGAGGAGCCAGTGACACCAG 

AGCCGGAAGTGGAACCGCCCAGTGCCCCCGAGCTCAAGC^AGGGCTGTATGAGCTCTCAGCA 

AGCAACTTTGAGCTGCACGTTGCACAAGGCGACCACTTTATCAAGTTCTTCGCTCCGTGGTG 

TGGTCACTGCAAAGCCCTGGCTCCAACCTGGGAGCAGCTGGCTCTGGGCCTTGAACATTCCG 

AAACTGTCAAGATTGGCAAGGTTGATTGTACACAGCACTATGAACTCTGCTCCGGAAACCAG 

GTTCGTGGCTATCCCACTCTTCTCTGGTTCCGAGATGGGAi^AAAGGTGGATCAGTACAAGGG 

AAAGCGGGATTTGGAGTCACTGAGGGAGTACGTGGAGTCGCAGCTGCAGCGCACAGAGACTG 

GAGCGACGGAGACCGTCACGCCCTCAGAGGCCCCGGTGCTGGCAGCTGAGCCCGAGGCTGAC 

AAGGGCACTGTGTTGGCACTCACTGAAAATAACTTCGATGACACCATTGCAGAAGGAATAAC 

CTTCATCAAGTTTTATGCTCCATGGTGTGGTCATTGTAAGACTCTGGCTCCTACTTGGGAGG 

AACTCTCTAAAAAGGAATTCCCTGGTCTGGCGGGGGTCAAGATCGCCGAAGTAGACTGCACT 

GCTGAACGGAATATCTGCAGCAAGTATTCGGTACGAGGCTACCCCACGTTATTGCTTTTCCG 

AGGAGGGAAGAAAGTCAGTGAGCACAGTGGAGGCAGAGACCTTGACTCGTTACACCGCTTTG 

TCCTGAGCCAAGCGAAAGACGAACTTT^GAACACAGTTGGAGGTCACCTCTCCTGCCCAGC 

TCCCGCACCCTGCGTTTAGGAGTTCAGTCCCACAGAGGCCACTGGGTTCCCAGTGGTGGCTG 

TTCAGAAAGCAGAACATACTAAGCGTGAGGTATCTTCTTTGTGTGTGTGTTTTCCAAGCCAA 

CACACTCTACAGATTCTTTATTAAGTTAAGTTTCTCTAAGTAAATGTGTAACTCATGGTCAC 

TGTGTAAACATTTTCAGTGGCGATATATCCCCTTTGACCTTCTCTTGATGAAATTTACATGG 

TTTCCTTTGAGACTAAAATAGCGTTGAGGGAAATGAAATTGCTGGACTATTTGTGGCTCCTG 

AGTTGAGTGATTTTGGTGAAAGAAAGCACATCCAAAGCATAGTTTACCTGCCCACGAGTTCT 

GGAAAGGTGGCCTTGTGGCAGTATTGACGTTCCTCTGATCTTAAGGTCACAGTTGACTCAAT 

ACTGTGTTGGTCCGTAGCATGGAGCAGATTGAAATGCAAAAACCCACACCTCTGGAAGATAC 

CTTCACGGCCGCTGCTGGAGCTTCTGTTGCTGTGAATACTTCTCTCAGTGTGAGAGGTTAGC 

CGTGATGAAAGCAGCGTTACTTCTGACCGTGCCTGAGTAAGAGAATGCTGATGCCATAACTT 

TATGTGTCGATACTTGTCAAATCAGTTACTGTTCAGGGGATCCTTCTGTTTCTCACGGGGTG 

AAACATGTCTTTAGTTCCTCATGTTAACACGAAGCCAGAGCCCACATGAACTGTTGGATGTC 

TTCCTTAGAAAGGGTAGGCATGGAAAATTCCACGAGGCTCATTCTCAGTATCTCATTAACTC 

ATTGAAAGATTCCAGTTGTATTTGTCACCTGGGGTGACAAGACCAGACAGGCTTTCCCAGGC 

CTGGGTATCCAGGGAGGCTCTGCAGCCCTGCTGAAGGGCCCTAACTAGAGTTCTAGAGTTTC 

TGATTCTGTTTCTCAGTAGTCCTTTTAGAGGCTTGCTATACTTGGTCTGCTTCAAGGAGGTC 

GACCTTCTAATGTATGAAGAATGGGATGCATTTGATCTCAAGACCAAAGACAGATGTCAGTG 

GGCTGCTCTGGCCCTGGTGTGCACGGCTGTGGCAGCTGTTGATGCCAGTGTCCTCTAACTCA 

TGCTGTCCTTGTGATTAAACACCTCTATCTCCCTTGGGAATAAGCACATACAGGCTTAAGCT 

CTAAGATAGATAGGTGTTTGTCCTTTTACCATCGAGCTACTTCCCATAATAACCACTTTGCA 

TCCAACACTCTTCACCCACCTCCCATACGCAAGGGGATGTGGATACTTGGCCCAAAGTAACT 

GGTGGTAGGAATCTTAGAAACAAGACCACTTATACTGTCTGTCTGAGGCAGAAGATAACAGC 

AGCATCTCGACCAGCCTCTGCCTTAAAGGAAATCTTTATTAATCACGTATGGTTCACAGATA 

ATTCTTTTTTTAAAAAAACCCAACCTCCTAGAGAAGCACAACTGTCAAGAGTCTTGTACACA 

CAACTTCAGCTTTGCATCACGAGTCTTGTATTCCAAGAAAATCAAAGTGGTACAATTTGTTT 

GTTTACACTATGATACTTTCTAAATAAACTCTTTTTTTTTAA 
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FIGURE 3R 

></usr/seqdb2/ssc/DNA/Dnaseqs .min/ss .DNA46776 
xsubunit l of 1, 432 aa, 1 stop 
xMW: 47629, pi: 5.90, NX(S/T): 0 

MPARPGRLLFLLARPAALTALLLLLLGHGGGGRWGARAQEAAAAAADGPPAADGEDGQDPHS 
KHLYTADMFTHGIQSAAHFVMFFAPWCGHCQRLQPTWNDLGDKYNSMEDAKVYVAKVDCTAH 
SDVCSAQGVRGYPTLKLFKPGQEAVKYQGPRDFQTLENWMLQTLNEEPVTPEPEVEPPSAPE 
LKQGLYELSASNFELHVAQGDHFIKFFAPWCGHCKALAPTWEQLALGLEHSETVKIGKVDCT 
QHYELCSGNQVRGYPTLLWFRDGKKVDQYKGKRDLESLREYVESQLQRTETGATETVTPSEA 
PVLAAEPEADKGTVLALTENNFDDTIAEGITFIKFYAPWCGHCKTLAPTWEELSKKEFPGLA 
GVKIAEVDCTAERNICSKYSVRGYPTLLLFRGGKKVSEHSGGRDLDSLHRFVLSQAKDEL 
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FIfitJRE 36 

CTTTTCTGAGGAACCACAGCAMfiAATGGCTTTGCATCCTTGCTTCGAAGAAACCAATTTAT 
CCTCCTGGTACTATTTCTTTTGCAAATTCAGAGTCTGGGTCTGGATATTGATAGCCGTCCTA 
CCGCTGAAGTCTGTGCCACACACACAATTTCACCAGGACCCAAAGGAGATGATGGTGAAAAA 
GGAGATCCAGGAGAAGAGGGAAAGCATGGCAAAGTGGGACGCATGGGGCCGAAAGGAATTAA 
AGGAGAACTGGGTGATATGGGAGATCAGGGCAATATTGGCAAGACTGGGCCCATTGGGAAGA 
AGGGTGACAAAGGGGAAAAAGGTTTGCTTGGAATACCTGGAGAAAAAGGCAAAGCAGGTACT 
GTCTGTGATTGTGGAAGATACCGGAAATTTGTTGGACAACTGGATATTAGTATTGCTCGGCT 
CAAGACATCTATGAAGTTTGTCAAGAATGTGATAGCAGGGATTAGGGAAACTGAAGAGAAAT 
TCTACTACATCGTGCAGGAAGAGAAGAACTACAGGGAATCCCTAACCCACTGCAGGATTCGG 
GGTGGAATGCTAGCCATGCCCAAGGATGAAGCTGCCAACACACTCATCGCTGACTATGTTGC 
CAAGAGTGGCTTCTTTCGGGTGTTCATTGGCGTGAATGACCTTGAAAGGGAGGGACAGTACA 
TGTCCACAGACAACACTCCACTGCAGAACTATAGCAACTGGAATGAGGGGGAACCCAGCGAC 
CCCTATGGTCATGAGGACTGTGTGGAGATGCTGAGCTCTGGCAGATGGAATGACACAGAGTG 
CCATCTTACCATGTACTTTGTCTGTGAGTTCATCAAGAAGAAAAAG3A&CTTCCCTCATCCT 
ACGTATTTGCTATTTTCCTGTGACCGTCATTACAGTTATTGTTATCCATCCTTTTTTTCCTG 
ATTGTACTACATTTGATCTGAGTCAACATAGCTAGAAAATGCTAAACTGAGGTATGGAGCCT 
CCATCATCAAAAAAAAAAAAAAAA 
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FIGURE 37 



></usr/seqdb2/sst/DNA/Dnaseqs.min/ss .DNA50980 
xsubunit 1 of 1, 277 aa, 1 stop 
xMW: 30645, pi: 7.47, NX(S/T): 2 

MNGFASLLRRNQFILLVLFLLQIQSLGLDIDSRPTAEVCATHTISPGPKGDDGEKGDPGEEG 
KHGKVGRMGPKGIKGELGDMGDQGNIGKTGPIGKKGDKGEKGLLGIPGEKGKAGTVCDCGRY 
RKFVGQLDISIARLKTSMKFVKNVIAGIRETEEKFYYIVQEEKNYRESLTHCRIRGGMLAMP 

KDEAANTLIADYVAKSGFFRVFIGVNDLEREGQYMSTDNTPLQNYSNWNEGEPSDPYGHEDC 
VEMLSSGRWNDTECHLTMYFVCEFI KKKK 
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FIGURE 38 



GGTTCTATCGATTCGAATTCGGCCACACTGGCCGGATCCTCTAGAGATCCCTCGACCTCGAC 

CCACGCGTCCGCTGCTCTCCGCCCGTGTGGAGTGGTGGGGGCCTGGGTGGGAAISGGCGTGT 

GCCAGCGCACGCGCGCTCCCTGGAAGGAGAAGTCTCAGCTAGAACGAGCGGCCCTAGGTTTT 

CGGAAGGGAGGATCAGGGATGTTTGCGAGCGGCTGGAACCAGACGGTGCCGATAGAGGAAGC 

GGGCTCCATGGCTGCCCTCCTGCTGCTGCCCCTGCTGCTGTTGCTACCGCTGCTGCTGCTGA 

AGCTACACCTCTGGCCGCAGTTGCGCTGGCTTCCGGCGGACTTGGCCTTTGCGGTGCGAGCT 

CTGTGCTGCAAAAGGGCTCTTCGAGCTCGCGCCCTGGCCGCGGCTGCCGCCGACCCGGAAGG 

TCCCGAGGGGGGCTGCAGCCTGGCCTGGCGCCTCGCGGAACTGGCCCAGCAGCGCGCCGCGC 

ACACCTTTCTCATTCACGGCTCGCGGCGCTTTAGCTACTCAGAGGCGGAGCGCGAGAGTAAC 

AGGGCTGCACGCGCCTTCCTACGTGCGCTAGGCTGGGACTGGGGACCCGACGGCGGCGACAG 

CGGCGAGGGGAGCGCTGGAGAAGGCGAGCGGGCAGCGCCGGGAGCCGGAGATGCAGCGGCCG 

GAAGCGGCGCGGAGTTTGCCGGAGGGGACGGTGCCGCCAGAGGTGGAGGAGCCGCCGCCCCT 

CTGTCACCTGGAGCAACTGTGGCGCTGCTCCTCCCCGCTGGCCCAGAGTTTCTGTGGCTCTG 

GTTCGGGCTGGCCAAGGCCGGCCTGCGCACTGCCTTTGTGCCCACCGCCCTGCGCCGGGGCC 

CCCTGCTGCACTGCCTCCGCAGCTGCGGCGCGCGCGCGCTGGTGCTGGCGCCAGAGTTTCTG 

GAGTCCCTGGAGCCGGACCTGCCCGCCCTGAGAGCCATGGGGCTCCACCTGTGGGCTGCAGG 

CCCAGGAACCCACCCTGCTGGAATTAGCGATTTGCTGGCTGAAGTGTCCGCTGAAGTGGATG 

GGCCAGTGCCAGGATACCTCTCTTCCCCCCAGAGCATAACAGACACGTGCCTGTACATCTTC 

ACCTCTGGCACCACGGGCCTCCCCAAGGCTGCTCGGATCAGTCATCTGAAGATCCTGCAATG 

CCAGGGCTTCTATCAGCTGTGTGGTGTCCACCAGGAAGATGTGATCTACCTCGCCCTCCCAC 

TCTACCACATGTCCGGTTCCCTGCTGGGCATCGTGGGCTGCATGGGCATTGGGGCCACAGTG 

GTGCTGAAATCCAAGTTCTCGGCTGGTCAGTTCTGGGAAGATTGCCAGCAGCACAGGGTGAC 

GGTGTTCCAGTACATTGGGGAGCTGTGCCGATACCTTGTCAACCAGCCCCCGAGCAAGGCAG 

AACGTGGCCATAAGGTCCGGCTGGCAGTGGGCAGCGGGCTGCGCCCAGATACCTGGGAGCGT 

TTTGTGCGGCGCTTCGGGCCCCTGCAGGTGCTGGAGACATATGGACTGACAGAGGGCAACGT 

GGCCACCATCAACTACACAGGACAGCGGGGCGCTGTGGGGCGTGCTTCCTGGCTTTACAAGC 

ATATCTTCCCCTTCTCCTTGATTCGCTATGATGTCACCACAGGAGAGCCAATTCGGGACCCC 

CAGGGGCACTGTATGGCCACATCTCCAGGTGAGCCAGGGCTGCTGGTGGCCCCGGTAAGCCA 

GCAGTCCCCATTCCTGGGCTATGCTGGCGGGCCAGAGCTGGCCCAGGGGAAGTTGCTAAAGG 

ATGTCTTCCGGCCTGGGGATGTTTTCTTCAACACTGGGGACCTGCTGGTCTGCGATGACCAA 

GGTTTTCTCCGCTTCCATGATCGTACTGGAGACACCTTCAGGTGGAAGGGGGAGAATGTGGC 

CACAACCGAGGTGGCAGAGGTCTTCGAGGCCCTAGATTTTCTTCAGGAGGTGAACGTCTATG 

GAGTCACTGTGCCAGGGCATGAAGGCAGGGCTGGAATGGCAGCCCTAGTTCTGCGTCCCCCC 

CACGCTTTGGACCTTATGCAGCTCTACACCCACGTGTCTGAGAACTTGCCACCTTATGCCCG 

GCCCCGATTCCTCAGGCTCCAGGAGTCTTTGGCCACCACAGAGACCTTCAAACAGCAGAAAG 

TTCGGATGGCAAATGAGGGCTTCGACCCCAGCACCCTGTCTGACCCACTGTACGTTCTGGAC 

CAGGCTGTAGGTGCCTACCTGCCCCTCACAACTGCCCGGTACAGCGCCCTCCTGGCAGGAAA 

CCTTCGAATCIGAGAACTTCCACACCTGAGGCACCTGAGAGAGGAACTCTGTGGGGTGGGGG 

CCGTTGCAGGTGTACTGGGCTGTCAGGGATCTTTTCTATACCAGAACTGCGGTCACTATTTT 

GTAATAAATGTGGCTGGAGCTGATCCAGCTGTCTCTGACCTAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAGGGCGGCCGCGACTCTAGAGTCGACCTGCAGTAGGGATAACAGGGTAATAAGC 

TTGGCCGCCATGGCCCAACTTGTTTATTGCAG 
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FIGURE 39 

></usr/seqdb2/sst/DNA/Dnaseqs . min/ss . DNA50913 
xsubunit 1 of 1, 730 aa, 1 stop 
xMW: 78644, pi: 7.65, NX{S/T) : 2 

MGVCQRTRAPWKEKSQLERAALGFRKGGSGMFASGWNQTVPIEEAGSMAALLLLPLLLLLPL 
LLLKLHLWPQLRWLPADIAFAVRALCCKRALRARAIAAAAADPEGPEGGCSLAWRLAELAQQ 
RAAHTFL I HGSRRFS YS EAERESNRAARAFLRALGWDWGPDGGDSGEGSAGEGERAAPGAGD 
AAAGSGAEFAGGDGAARGGGAAAPLSPGATVALLLPAGPEFLWLWFGLAKAGLRTAFVPTAL 
RRGPLLHCLRSCGARALVLAPEFLESLEPDLPALRAMGLHLWAAGPGTHPAGISDLLAEVSA 
EVDGPVPGYLSSPQSITDTCLYIFTSGTTGLPKAARISHLKILQCQGFYQLCGVHQEDVIYL 
ALPLYHMSGSLLGIVGCMGIGATWLKSKFSAGQFWEDCQQHRVTVFQYIGELCRYLVNQPP 
SKAERGHKVRLAVGSGLRPDTWERFVRRFGPLQVLETYGLTEGNVATINYTGQRGAVGRASW 
LYKHIFPFSLI R YD VTTGE P I RDPQGHCMATSPGEPGLLVAPVSQQS PFLGYAGGPELAQGK 
LLKDVFRPGDVFFNTGDLLVCDDQGFLRFHDRTGDTFRWKGENVATTEVAEVFEALDFLQEV 
NVYGVTVPGHEGRAGMAALVLRPPHALDLMQLYTHVSENLPPYARPRFLRLQESLATTETFK 
QQKVRMANEGFDPSTLSDPLYVLDQAVGAYLPLTTARYSALLAGNLRI 

Signal peptide: 
aa 1-42 

cAMP- and cGMP-dependent protein kinase phosphorylation site 
starting at aa 136 

CUB domain protein motif 
aa 254-261 

putative AMP-binding dOT»3.n siganture 
aa 332-343 

N^glycosylation sites 

aa 37-40 and 483-486 
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FIGURE 40 



CCTGTGTTAAGCTGAGGTTTCCCCTAGATCTCGTATA7CCCCAACACATACCTCCACGCACA 

CACATCCCCAAGAACCTCGAGCTCACACCAACAGACACACGCGCGCATACACACTCGCTCTC 

GCTTGTCCATCTCCCTCCCGGGGGAGCCGGCGCGCGCTCCCACCTTTGCCGCACACTCCGGC 

GAGCCGAGCCCGCAGCGCTCCAGGATTCTGCGGCTCGGAACTCGGATTGCAGCTCTGAACCC 

CCATGGTGGTTTTTTAAACACTTCTTTTCCTTCTCTTCCTCGTTTTGATTGCACCGTTTCCA 

TCTGGGGGCTAGAGGAGCAAGGCAGCAGCCTTCCCAGCCAGCCCTTGTTGGCTTGCCATCGT 

CCATCTGGCTTATAAAAGTTTGCTGAGCGCAGTCCAGAGGGCTGCGCTGCTCGTCCCCTCGG 

CTGGCAGAAGGGGGTGACGCTGGGCAGCGGCGAGGAGCGCGCCGCTGCCTCTGGCGGGCTTT 

CGGCTTGAGGGGCAAGGTGAAGAGCGCACCGGCCGTGGGGTTTACCGAGCTGGATTTGTATG 

TTGCACC&ISCCTTCTTGGATCGGGGCTGTGATTCTTCCCCTCTTGGGGCTGCTGCTCTCCC 

TCCCCGCCGGGGCGGATGTGAAGGCTCGGAGCTGCGGAGAGGTCCGCCAGGCGTACGGTGCC 

AAGGGATTCAGCCTGGCGGACATCCCCTACCAGGAGATCGCAGGGGAACACTTAAGAATCTG 

TCCTCAGGAATATACATGCTGCACCACAGAAATGGAAGACAAGTTAAGCCAACAAAGCAAAC 

TCGAATTTGAAAACCTTGTGGAAGAGACAAGCCATTTTGTGCGCACCACTTTTGTGTCCAGG 

CATAAGAAATTTGACGAATTTTTCCGAGAGCTCCTGGAGAATGCAGAAAAGTCACTAAATGA 

TATGTTTGTACGGACCTATGGCATGCTGTACATGCAGAATTCAGAAGTCTTCCAGGACCTCT 

TCACAGAGCTGAAAAGGTACTACACTGGGGGTAATGTGAATCTGGAGGAAATGCTCAATGAC 

TTTTGGGCTCGGCTCCTGGAACGGATGTTTCAGCTGATAAACCCTCAGTATCACTTCAGTGA 

AGACTACCTGGAATGTGTGAGCAAATACACTGACCAGCTCAAGCCATTTGGAGACGTGCCCC 

GGAAACTGAAGATTCAGGTTACCCGCGCCTTCATTGCTGCCAGGACCTTTGTCCAGGGGCTG 

ACTGTGGGCAGAGAAGTTGCAAACCGAGTTTCCAAGGTCAGCCCAACCCCAGGGTGTATCCG 

TGCCCTCATGAAGATGCTGTACTGCCCATACTGTCGGGGGCTTCCCACTGTGAGGCCCTGCA 

ACAACTACTGTCTCAACGTCATGAAGGGCTGCTTGGCAAATCAGGCTQACCTCGACACAGAG 

TGGAATCTGTTTATAGATGCAATGCTCTTGGTGGCAGAGCGACTGGAGGGGCCATTCAACAT 

TGAGTCGGTCATGGACCCGATAGATGTCAAGATTTCTGAAGCCATTATGAACATGCAAGAAA 

ACAGCATGCAGGTGTCTGCAAAGGTCTTTCAGGGATGTGGTCAGCCCAAACCTGCTCCAGCC 

CTCAGATCTGCCCGCTCAGCTCCTGAAAATTTTAATACACGTTTCAGGCCCTACAATCCTGA 

GGAAAGACCAACAACTGCTGCAGGCACAAGCTTGGACCGGCTGGTCACAGACATAAAAGAGA 

AATTGAAGCTCTCTAAAAAGGTCTGGTCAGCATTACCCTACACTATCTGCAAGGACGAGAGC 

GTGACAGCGGGCACGTCCAACGAGGAGGAATGCTGGAACGGGCACAGCAAAGCCAGATACTT 

GCCTGAGATCATGAATGATGGGCTCACCAACCAGATCAACAATCCCGAGGTGGATGTGGACA 

TCACTCGGCCTGACACTTTCATCAGACAGCAGATTATGGCTCTCCGTGTGATGACCAACAAA 

CTAAAAAACGCCTACAATGGCAATGATGTCAATTTCCAGGACACAAGTGATGAATCCAGTGG 

CTCAGGGAGTGGCAGTGGGTGCATGGATGACGTGTGTCCCACGGAGTTTGAGTTTGTCACCA 

CAGAGGCCCCCGCAGTGGATCCCGACCGGAGAGAGGTGGACTCTTCTGCAGCCCAGCGTGGC 

CACTCCCTGCTCTCCTGGTCTCTCACCTGC7VTTGTCCTGGCACTGCAGAGACTGTGCAGAI& 

ATCTTGGGTTTTTGGTCAGATGAAACTGCATTTTAGCTATCTGAATGGCCAACTCACTTCTT 

TTCTTACACTCTTGGACAATGGACCATGCCACAAAAACTTACCGTTTTCTATGAGAAGAGAG 

CAGTAATGCAATCTGCCTCCCTTTTTGTTTTCCCAAAGAGTACCGGGTGCCAGACTGAACTG 

CTTCCTCTTTCCTTCAGCTATCTGTGGGGACCTTGTTTATTCTAGAGAGAATTCTTACTCAA 

ATTTTTCGTACCAGGAGATTTTCTTACCTTCATTTGCTTTTATGCTGCAGAAGTAAAGGAAT 

CTCACGTTGTGAGGGTTTTTTTTTTCTCATTTAAAAT 
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FIGURE 41 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA50914 
xsubunit 1 of 1, 555 aa, 1 stop 
xMW: 62736, pi: 5.36. NX(S/T): 0 

MPSWIGAVILPLLGLLLSLPAGADVKARSCGEVRQAYGAKGFSLADIPYQEIAGEHLRICPQ 
EYTCCTTEMEDKLSQQSKLEFENLVEETSHFVRTTFVSRHKKFDEFFRELLENAEKSLNDMF 
VRTYGMLyMQNSEVFQDLFTELKRYYTGGNVNLEEMLNDFWARLLERMFQLINPQYHFSEDY 
LECVSKYTDQLKPFGDVPRKLKIQVTRAFIAARTFVQGLTVGREVANRVSKVSPTPGCIRAL 
MKMLYCPYCRGLPTVRPCNNYCLNVMKGCLANQADLDTEWNLFIDAMLLVAERLEGPPNIES 
VMDPIDVKISEAIMNMQENSMQVSAKVFQGCGQPKPAPALRSARSAPENFNTRFRPYNPEER 
PTTAAGTSLDRLVTDIKEKLKLSKKVWSALPYTICKDESVTAGTSNEEECWNGHSKARYLPE 
IMNDGLTNQINNPEVDVDITRPDTFIRQQIMALRVMTNKLKNAYNGNDVNFQDTSDESSGSG 
SGSGCMDDVCPTEFEFVTTEAPAVDPDRREVDSSAAQRGHSLLSWSLTCIVLALQRLCR 



BMSOOCID <W0 994&281A2J_> 
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FIGURE 42A 

CGGACGCGTGGGCGGACGCGTGGGCAAAAGAACTCGGAGTGCCAAAGCTAAATAAGTTAGCT 

GAGAAAACGCACGCAGTTTGCAGCGCCTGCGCCGGGTGCGCCAACTACGCAAAGACCAAGCG 

GGCTCCGCGCGGACCGGCCGCGGGGCTAGGGACCCGGCTTTGGCCTTCAGGCTCCCTAGCAG 

CGGGGAAAAGGAATTGCTGCCCGGAGTTTCTGCGGAGGTGGAGGGAGATCAGGAAACGGCTT 

CTTCCTCACTTCGCCGCCTGGTGAGTGTCGGGGAGATTGGCAAACGCCTAGGAAAGGACTGG 

GGAAAATAGCCCTGGGAAAGTGGAGAAGGTGATCAGGAGGCCGGTCCACTACGGCAGTTTAT 

CTGTCTGATCAGAGCCAGACGCGACGCGTCCACTTCGCAGTTCTTTCCAGGTGTGGGGACCG 

CAGGACAGACGGCCGATCCCGCCGCCCTCCGTACCAGCACTCCCAGGAGAGTCAGCCTCGCT 

CCCCAACGTCGAGGGCGCTCTGGCCACGAAAAGTTCCTGTCCACTGTGAT TCTCA ATTCCTT 

GCTTGGTTTTTTTCTCCAGAGAACTTTTGGGTGGAGATATTAACTTTTTTCTTTTTTTTTTT 

CCTTGGTGGAAGCTGCTCTAGGGAGGGGGGAGGAGGAGGAGAAAGTGAAATGTGCTGGAGAA 

GAGCGAGCCCTCCTTGTTCTTCCGGAGTCCCATCCATTAAGCCATCACTTCTGGAAGATTAA 

AGTTGTCGGACATGGTGACAGCTGAGAGGAGAGGAGGATTTCTTGCCAGGTGGAGAGTCTTC 

ACCGTCTGTTGGGTGCATGTGTGCGCCCGCAGCGGCGCGGGGCGCGTGGTTCTCCGCGTGGA 

GTCTCACCTGGGACCTGAGTGAAliSGCTCCCAGGGGCTGTGCGGGGCATCCGCCTCCGCCTT 

CTCCACAGGCCTGTGTCTGTCCTGGAAAGATGCTAGCAATGGGGGCGCTGGCAGGATTCTGG 

ATCCTCTGCCTCCTCACTTATGGTTACCTGTCCTGGGGCCAGGCCTTAGAAGAGGAGGAAGA 

AGGGGCCTTACTAGCTCAAGCTGGAGAGAAACTAGAGCCCAGCACAACTTCCACCTCCCAGC 

CCCATCTCATTTTCATCCTAGCGGATGATCAGGGATTTAGAGATGTGGGTTACCACGGATCT 

GAGATTAAAACACCTACTCTTGACAAGCTCGCTGCCGAAGGAGTTAAACTGGAGAACTACTA 

TGTCCAGCCTATTTGCACACCATCCAGGAGTCAGTTTATTACTGGAAAGTATCAGATACACA 

CCGGACTTCAACATTCTATCATAAGACCTACCCAACCCAACTGTTTACCTCTGGACAATGCC 

ACCCTACCTCAGAAACTGAAGGAGGTTGGATATTCAACGCATATGGTCGGAAAATGGCACTT 

GGGTTTTAACAGAAAAGAATGCATGCCCACCAGAAGAGGATTTGATACCTTTTTTGGTTCCC 

TTTTGGGAAGTGGGGATTACTATACACACTACAAATGTGACAGTCCTGGGATGTGTGGCTAT 

GACTTGTATGAAAACGACAATGCTGCCTGGGACTATGACAATGGCATATACTCCACACAGAT 

GTACACTCAGAGAGTACAGCAAATCTTAGCTTCCCATAACCCCACAAAGCCTATATTTTTAT 

ATACTGCCTATCAAGCTGTTCATTCACCACTGCAAGCTCCTGGCAGGTATTTCGAACACTAC 

CGATCCATTATCAACATAAACAGGAGAAGATATGCTGCCATGCTTTCCTGCTTAGATGAAGC 

AATCAACAACGTGACATTGGCTCTAAAGACTTATGGTTTCTATAACAACAGCATTATCATTT 

ACTCTTCAGATAATGGTGGCCAGCCTACGGCAGGAGGGAGTAACTGGCCTCTCAGAGGTAGC 

AAAGGAACATATTGGGAAGGAGGGATCCGGGCTGTAGGCTTTGTGCATAGCCCACTTCTGAA 

AAACAAGGGAACAGTGTGTAAGGAACTTGTGCACATCACTGACTGGTACCCCACTCTCATTT 

CACTGGCTGAAGGACAGATTGATGAGGACATTCAACTAGATGGCTATGATATCTGGGAGACC 

ATAAGTGAGGGTCTTCGCTCACCCCGAGTAGATATTTTGCATAACATTGACCCCTATACACC 

AAGGCAAAAAATGGCTCCTGGGCAGCAGGCTATGGGATCTGGAACACTGCAATCCAGTCAGC 

CATCAGAGTGCAGCACTGGAAATTGCTTACAGGAAATCCTGGCTACAGCGACTGGGTCCCCC 

CTCAGTCTTTCAGCAACCTGGGACCGAACCGGTGGCACAATGAACGGATCACCTTGTCAACT 

GGCAAAAGTGTATGGCTTTTCAACATCACAGCCGACCCATATGAGAGGGTGGACCTATCTAA 

CAGGTATCCAGGAATCGirS&AGAAGCTCCTACGGAGGCTCTCACAGTTCAACAAAACTGCAG 

TGCCGGTCAGGTATCCCCCCAAAGACCCCAGAAGTAACCCTAGGCTCAATGGAGGGGTCTGG 

GGACCATGGTATAAAGAGGAAACCAAGAAAAAGAAGCCAAGCAAAAATCAGGCTGAGAAAAA 

GCAAAAGAAAAGCAAAAAAAAGAAGAAGAAACAGCAGAAAGCAGTCTCAGGTAAACCAGCAA 

ATTTGGCTCGATAATATCGCTGGCCTAAGCGTCAGGCTTGTTTTCATGCTGTGCCACTCCAG 

AGACTTCTGCCACCTGGCCGCCACACTGAAAACTGTCCTGCTCAGTGCCAAGGTGCTACTCT 

TGCAAGCCACACTTAGAGAGAGTGGAGATGTTTATTTCTCTCGCTCCTTTAGAAAACGTGGT 

GAGTCCTGAGTTCCACTGCTGTGCTTCAGTCAACTGACCAAACACTGCTTTGAATTATAGGA 

GGAGAACAATAACCTACCATCCGCAAGCATGCTAATTTGATGGAAGTTACAGGGTAGCATGA 

TTAAAACTACCTTTGATAAATTACAGTCAAAGATTGTGTCACCTCAAAGGCCTTGAAGAATA 

TATTTTCTTGGTGAATTTTTGTATGTCTGTCATATGACACTTGGGTTTTTTAATTAATTCTA 

TTTTATATATATAAATATATGTTTCTTTTCCTGTGAAAAGCTGTTTTTCTCACATGTGAACA 

GCTTGCACCTCATTTTACCATGCGTGAGGGAATGGCAAATAAGAATGTTTGAGCACACTGCC 

CACAATGAATGTAACTATTTTCTAAACACTTTACTAGAAGAACATTTCAGTATAAAAAACCT 

AATTTATTTTTACAGAAAAATATTTTGTTGTTTTTATAAAAAGTTATGCAAATGACTTTTAT 

TTTTATTTCCTGCATACCATTAGAAGAATTTTATTTCATTTCTTCAAATTATCAAGCACTGT 

AATACTATAAATTAATGTAATACTGTGTGAATTCAGACTATAAAAAACATCATTCAGAAAAC 

TTTATAATCGTCATTGTTCAATCAAGATTTTGAATGTAATAAGATGAATATATTCCTTACAA 

BNSCXXID <W0 9946281A2J.> 
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FIGURE 42B 

ATTACTTGGAAATTCAATGTTTGTGCAGAGTTGAGACAACTTTATTGTTTCTATCATAAACT 
ATTTATGTATCTTAATTATTAAAATGATTTACTTTATGGCACTAGAAAATTTACTGTGGCTT 
TTCTGATCTAACTTCTAGCTAAAATTGTATCATTGGTCCTAAAAAATAAAAATCTTTACTAA 
TAGGCAATTGAAGGAATGGTTTGCTAACAACCACAGTAATATAATATGATTTTACAGATAGA 
TGCTTCCCCTTGGCTATGACATGGAGAAAGATTTTCCCATAATAATAACTAATATTTATATT 
AGGTTGGTGCAAAACTAGTTGCGGTTTTTCCCATTAAAAGTAATAACCTTACTCTTATACAA 
AGTGGACACTGTGGGGAGATACAGACAAATGGAAGATACGGATCCTGCCTGGAGTAGGTAAC 
CTTGCTTGGAAACCCCACATGCAAACGTCATGAGGAGAATTAAAGGAGTATTATCAGTAATG 
AAGTTTATCATGGGTCATCAATGAGCATAGATTGGTGTGGATCCTGTAGACCCTGGTGTTTT 
CTTTGAAGTGCCCTCTCCTAATGCAGAGGCCTTGAAGCTTACAGTATACACTTGAAAAGTCA 
CAGATAGCTAGAATTATGATCTTTGAAGTTATAACTGTGATCTGAAAATGTGTGTGGTGGTA 
TGACAGCATACCATTAAATACATTTACATCACAGCTCAAAGGACTGTGATATAATCCATTTA 
TATCACAACTCAAAGGACTGTGATATAATCCATTTATATCACAGCTCACAGTTTCTGAAAAT 
GTATAAAAGAATCTATAATCTAGTACTGAAATTACTAAATTGGGTAAGATGATTTAAATGAT 
TTTAATTTTAACATTTTATTTCTAGAATATATGGCTCCATTTTATTTTATAGTGTAAAGTTG 
TATTTCCTAAAGTTTGTGTTTTGTCGACAGTATCTTTTAAATGAGTCTTAAAAATAAAGGCA 
TATTGTTCATGTTTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



43/^3? 
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FIGURE 43 



></usr/seqcib2/sst/DNA/Dnaseqs .iain/ss.DNA48296 
xsubunit 1 of 1, 515 aa, i stop 
XMW: 56885, pi: 6.49, NX(S/T): 5 

MAPRGCAGHPPPPSPQACVCPGKMLAMGALAGFWILCLLTYGYLSWGQALEEEEEGALLAQA 
GEKLEPSTTSTSQPHLIFILADDQGFRDVGYHGSEIKTPTLDKLAAEGVKLENYYVQPICTP 
SRSQFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFNRKEC 
MPTRRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQ 
ILASHNPTKPIFLYTAYQAVHSPLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLA 
LKTYGFYNNSIIIYSSDNGGQPTAGGSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNKGTVCK 
ELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRSPRVDILHNIDPYTPRQKMAPG 
QQAMGSGTLQSSQPSECSTGNCLQEILATATGSPLSLSATWDRTGGTMNGSPCQLAKVYGFS 
TSQPTHMRGWTYLTGIQES 

Important Features: 
Signal Peptide: 
amino acids 1-37 

Sulfatases signature 1. 

amino acids 120-132 

Sulfatases signature 2. 

amino acids 168-177 

Tyrosine kinase phosphorylation site. 

amino acids 163-169 

N-glycosylation sites. 

amino acids 157-160, 306-309 and 318-321 




wo 99/46281 




PCT/US99/05028 



FIGURE 44 

CGGACGCGTGGGTGCGAGTGGAGCGGAGGACCCGAGCGGCTGAGGAGAGAGGAGGCGGCGGC 

TTAGCTGCTACGGGGTCCGGCCGGCGCCCTCCCGAGGGGGGCTCAGGAGGAGGAAGGAGGAC 

CCGTGCGAGAMfiCCTCTGCCCTGGAGCCTTGCGCTCCCGCTGCTGCTCTCCTGGGTGGCAG 

GTGGTTTCGGGAACGCGGCCAGTGCAAGGCATCACGGG7TGTTAGCATCGGCACGTCAGCCT 

GGGGTCTGTCACTATGGAACTAAACTGGCCTGCTGCTACGGCTGGAGAAGAAACAGCAAGGG 

AGTCTGTGAAGCTACATGCGAACCTGGATGTAAGTTTGGTGAGTGCGTGGGACCAAACAAAT 

GCAGATGCTTTCCAGGATACACCGGGAAAACCTGCAGTCAAGATGTGAATGAGTGTGGAATG 

AAACCCCGGCCATGCCAACACAGATGTGTGAATACACACGGAAGCTACAAGTGCTTTTGCCT 

CAGTGGCCACATGCTCATGCCAGATGCTACGTGTGTGAACTCTAGGACATGTGCCATGATAA 

ACTGTCAGTACAGCTGTGAAGACACAGAAGAAGGGCCACAGTGCCTGTGTCCATCCTCAGGA 

CTCCGCCTGGCCCCAAATGGAAGAGACTGTCTAGATATTGATGAATGTGCCTCTGGTAAAGT 

CATCTGTCCCTACAATCGAAGATGTGTGAACACATTTGGAAGCTACTACTGCAAATGTCACA 

TTGGTTTCGAACTGCAATATATCAGTGGACGATATGACTGTATAGATATAAATGAATGTACT 

ATGGATAGCCATACGTGCAGCCACCATGCCAATTGCTTCAATACCCAAGGGTCCTTCAAGTG 

TAAATGCAAGCAGGGATATAAAGGCAATGGACTTCGGTGTTCTGCTATCCCTGAAAATTCTG 

TGAAGGAAGTCCTCAGAGCACCTGGTACCATCAAAGACAGAATCAAGAAGTTGCTTGCTCAC 

AAAAACAGCATGAAAAAGAAGGCAAAAATTAAAAATG7TACCCCAGAACCCACCAGGACTCC 

TACCCCTAAGGTGAACTTGCAGCCCTTCAACTATGAAGAGATAGTTTCCAGAGGCGGGAACT 

CTCATGGAGGTAAAAAAGGGAATGAAGAGAAAISaAAGAGGGGCTTGAGGATGAGAAAAGAG 

AAGAGAAAGCCCTGAAGAATGACATAGAGGAGCGAAGCCTGCGAGGAGATGTGTTTTTCCCT 

AAGGTGAATGAAGCAGGTGAATTCGGCCTGATTCTGGTCCAAAGGAAAGCGCTAACTTCCAA 

ACTGGAACATAAAGATTTAAATATCTCGGTTGACTGCAGCTTCAATCATGGGATCTGTGACT 

GGAAACAGGATAGAGAAGATGATTTTGACTGGAATCCTGCTGATCGAGATAATGCTATTGGC 

TTCTATATGGCAGTTCCGGCCTTGGCAGGTCACAAGAAAGACATTGGCCGATTGAAACTTCT 

CCTACCTGACCTGCAACCCCAAAGCAACTTCTGTTTGCTCTTTGATTACCGGCTGGCCGGAG 

ACAAAGTCGGGAAACTTCGAGTGTTTGTGAAAAACAGTAACAATGCCCTGGCATGGGAGAAG 

ACCACGAGTGAGGATGAAAAGTGGAAGACAGGGAAAATTCAGTTGTATCAAGGAACTGATGC 

TACCAAAAGCATCATTTTTGAAGCAGAACGTGGCAAGGGCAAAACCGGCGAAATCGCAGTGG 

ATGGCGTCTTGCTTGTTTCAGGCTTATGTCCAGATAGCCTTTTATCTGTGGATGACTGAATG 

TTACTATCTTTATATTTGACTTTGTATGTCAGTTCCCTGGTTTTTTTGATATTGCATCATAG 

GACCTCTGGCATTTTAGAATTACTAGCTGAAAAATTGTAATGTACCAACAGAAATATTATTG 

TAAGATGCCTTTCTTGTATAAGATATGCCAATATTTGCTTTAAATATCATATCACTGTATCT 

TCTCAGTCATTTCTGAATCTTTCCNCATTATATTATAAAATNTGGAAANGTCAGTTTATCTC 

CCCTCCTCNGTATATCTGATTTGTATANGTANGTTGATGNGCTTCTCTCTACAACATTTCTA 

GAAAATAGAAAAAAAAGCACAGAGAAATGTTTAACTGTTTGACTCTTATGATACTTCTTGGA 

AACTATGACATCAAAGATAGACTTTTGCCTAAGTGGCTTAGCTGGGTCTTTCATAGCCAAAC 

TTGTATATTTAATTCTTTGTAATAATAA 
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FIGURE 4B 

MPLPWSLALPLLLSWVAGGFGNAASARHHGLLASARQPGVCHYGTKLACCYGVIRRNSKGVCE 
ATCEPGCKFGECVGPNKCRCFPGYTGKTCSQDVNECGMKPRPCQHRCVNTHGSYKCFCLSGH 
MLMPDATCVNSRTCAMINCQYSCEDTEEGPQCLCPSSGLRLAPNGRDCLDIDECASGKVICP 
YNRRCVNTFGSYYCKCHIGFELQYISGRYDCIDINECTMDSHTCSHHANCFNTQGSFKCKCK 
QGYKGNGLRCSAIPENSVKEVLRAPGTIKDRIKKLLAHKNSMKKKAKIKNVTPEPTRTPTPK 
VNLQPFNYEEIVSRGGNSHGGKKGNEEK 

Signal peptide: 

amino acids 1-21 

EGF-like domain cysteine pattern signature, 
amino acids 80-91 

Calcixim-binding EGF-like domains 

amino acids 103-124, 230-251 and 185-206 
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FIGURE 46 

GGGAGCTGCTGCTGTGGCTGCTGGTGCTGTGCGCGCTGCTCCTGCTCTTGGTGCAGCTGCTG 

CGCTTCCTGAGGGCTGACGGCGACCTGACGCTACTATGGGCCGAGTGGCAGGGACGACGCCC 

AGAATGGGAGCTGACTGATIOSGTGGTGTGGGTGACTGGAGCCTCGAGTGGAATTGGTGAGG 

AGCTGGCTTACCAGTTGTCTAAACTAGGAGTTTCTCTTGTGCTGTCAGCCAGAAGAGTGCAT 

GAGCTGGAAAGGGTGAAAAGAAGATGCCTAGAGAATGGCAATTTAAAAGAAAAAGATATACT 

TGTTTTGCCCCTTGACCTGACCGACACTGGTTCCCATGAAGCGGCTACCAAAGCTGTTCTCC 

AGGAGTTTGGTAGAATCGACATTCTGGTCAACAATGGTGGAATGTCCCAGCGTTCTCTGTGC 

ATGGATACCAGCTTGGATGTCTACAGAAAGCTAATAGAGCTTAACTACTTAGGGACGGTGTC 

CTTGACAAAATGTGTTCTGCCTCACATGATCGAGAGGAAGCAAGGAAAGATTGTTACTGTGA 

ATAGCATCCTGGGTATCATATCTGTACCTCTTTCCATTGGATACTGTGCTAGCAAGCATGCT 

CTCCGGGGTTTTTTTAATGGCCTTCGAACAGAACTTGCCACATACCCAGGTATAATAGTTTC 

TAACATTTGCCCAGGACCTGTGCAATCAAATATTGTGGAGAATTCCCTAGCTGGAGAAGTCA 

CAAAGACTATAGGCAATAATGGAGACCAGTCCCACAAGATGACAACCAGTCGTTGTGTGCGG 

CTGATGTTAATCAGCATGGCCAATGATTTGAAAGAAGTTTGGATCTCAGAACAACCTTTCTT 

GTTAGTAACATATTTGTGGCAATACATGCCAACCTGGGCCTGGTGGATAACCAACAAGATGG 

GGAAGAAAAGGATTGAGAACTTTAAGAGTGGTGTGGATGCAGACTCTTCTTATTTTAAAATC 

TTTAAGACAAAACAISACTGAAAAGAGCACCTGTACTTTTCAAGCCACTGGAGGGAGAAATG 

GAAAACATGAAAACAGCAATCTTCTTATGCTTCTGAATAATCAAAGACTAATTTGTGATTTT 

ACTTTTTAATAGATATGACTTTGCTTCCAACATGGAATGAAATAAAAAATAAATAATAAAAG 
ATTGCCATGAATCTTGCAAAA 
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></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA36343 
xsubunit 1 of 1, 289 aa, 1 stop 
xMW: 32268, p.1 : 9.21, NX{S/T): 0 

MVVWVTGASSGIGEELAYQLSKLGVSLVLSARRVHELERVKRRCLENGNLKEKDILVLPLDL 

TDTGSHEAATKAVLQEFGRIDILVNNGGMSQRSLCMDTSLDVYRKLIELNYLGTVSLTKC^ 

PHMIERKQGKIVTVNSILGIISVPLSIGYCASKHALRGFFNGLRTELATYPGIIVSNICPGP 

VQSNIVENSLAGEVTKTIGNNGDQSHKMTTSRCVRLMLISMANDLKEWISEQPFL^ 

QYMPTWAWWITNKMGKKRIENFKSGVDADSSYFKIFKTKHD 

Isqportant Features: 
Signal Peptide: 

amino acids 1-31 

Transmembrane domain: 
amino acids 136-157 

Tyrosine kinase phosphozylation site. 
106-113 and 107-114 

Homologous region to Short -chain alcohol dehydrogenase 

amino acids 80-90, 131-168, 1-13 and 176-185 




BNSDOCID- <W0 9946281 A2J_> 



wo 99/46281 PCT/US99/05028 

FIGURE 4a 

TG&ISCTGCTGAGCTCCCTGGTGTCTCTCGCTGGTTCTGTCTACCTGGCCTGGATCCTGTTC 

TTCGTGCTCTATGATTTCTGCATTGTTTGTATCACCACCTATGCTATCAACGTGAGCCTGAT 

?JSSJIS?^^^^^^'^^^^'^^^^^G^CCCCAGGGCAAGGCTAAGAGGCACTGAGCCCTCA 

?5?J?^2S™^^'^^^'^^'^^^'^^^^^'^'r'^^^^T'rGGTCTTCAAGCCGCTCAGCGTGCCTGTG 

S™J ^^^'^^^^^'^^^^^^^^^T^GGAGGGCAACACAGTCCCTGGCGAGTGGCCC 

JSSS^S^^^^^^^'^^^^^^^^^GGAGCCCACATCTGCAGCGGCTCCCTGGTGGCAGACAC 

CTGGGTCCTCACTGCTGCCCACTGCTTTGAAMGGCAGCAGCAACAGAACTGAATTCCTGGT 

^JS!^?I$^J?^^""CTGCAGCGTGAGGGACTCAGCCCTGGGGCCGAAGAGGTGGGGGTG 

JSISS^SISS^^^'^^^^^^^^^^^^^T^^^^^'^A^GCCAGGGCTCAGACCTGGCCCTGCT 

GCAGCTCGCCCACCCCACGACCCACACACCCCTCTGCCTGCCCCAGCCCGCCCATCGCTTCC 

^^IJ^JSSS^^^^^^^^^^^^'^^^^'^GGCTGGGATCAGGACACCAGTGATGCTCCTGGGACC 

CTACGCAATCTGCGCCTGCGTCTCATCAGTCGCCCCACATGTAACTGTATCTACAACCAGCT 

GCACCAGCGACACCTGTCCAACCCGGCCCGGCCTGGGATGCTATGTGGGGGCCCCCAGCCTG 

GGGTGCAGGGCCCCTGTCAGGGAGATTCCGGGGGCCCTGTGCTGTGCCTCGAGCCTGACGGA 

E^SISSSr^^^^^^^^^-^^^^^^A^^TTTGCATCAAGCTGTGCCCAGGAGGACGCTCCTGT 

SSJS^IS^^^^CACAGCTGCTCACAGTTCCTGGCTGCAGGCTCGAGTTCAGGGGGCAGCTT 

^^'^^^^^'^^^^^^^^^^C^GAGATGAGTGATGAGGACAGCTGTGTAGCCTGTGGA 

JSSJJS^SJ^^'^^^^^^^^^^^^^^^'^GG^^C^CCCTCCCCATGGCCCTGGGAGGCCAGGCT 

r„i5^^^^^^^^^'^A^<^TGGCCTGTGGCGGAGCCCTGGTGTCAGAGGAGGCGGTGCTAACTG 

CTGCCCACTGCTTCATTGGGCGCCAGGCCCCAGAGGAATGGAGCGTAGGGCTGGGGACCAGA 

'SS?S^SS?^^^^^^^^^°^^*^^^^^^^T^^'r^C-^TGGAGCCTACACCCACCCTGAGGGGGG 

CTACGACATGGCCCTCCTGCTGCTGGCCCAGCCTGTGACACTGGGAGCCAGCCTGCGGCCCC 

TCTGCCTGCCCTATCCTGACCACCACCTGCCTGATGGGGAGCGTGGCTGGGTTCTGGGACGG 

GCCCGCCCAGGAGCAGGCATCAGCTCCCTCCAGACAGTGCCCGTGACCCTCCTGGGGCCTAG 

GGCCTGCAGCCGGCTGCATGCAGCTCCTGGGGGTGATGGCAGCCCTATTCTGCCGGGGATGG 

TGTGTACCAGTGCTGTGGGTGAGCTGCCCAGCTGTGAGGGCCTGTCTGGGGCACCACTGGTG 

™S5*^^^°^°^^^^^^A'^^GTTCCTGGCCGGGCTGCACAGCTTCGGAGATGCTTGCCAAGG 

CCCCGCCAGGCCGGCGGTCTTCACCGCGCTCCCTGCCTATGAGGACTGGGTCAGCAGTTTGG 

ACTGGCAGGTCTACTTCGCCGAGGAACCAGAGCCCGAGGCTGAGCCTGGAAGCTGCCTGGCC 

AACATAAGCCAACCAACCAGCTGCUS&CAGGGGACCTGGCCATTCTCAGGACAAGAGAATGC 

AGGCAGGCAAATGGCATTACTGCCCCTGTCCTCCCCACCCTGTCATGTGTGATTCCAGGCAC 

CAGGGCAGGCCCAGAAGCCCAGCAGCTGTGGGAAGGAACCTGCCTGGGGCCACAGGTGCCCA 

CTCCCCACCCTGCAGGACAGGGGTGTCTGTGGACACTCCCACACCCAACTCTGCTACCAAGC 

AGGCGTCTCAGCTTTCCTCCTCCTTTACTCTTTCAGATACAATCACGCCAGCCACGTTGTTT 

TGAAAATTTCTTTTTTTGGGGGGCAGCAGTTTTCCTTTTTTTAAACTTAAATAAATTGTTAC 
AAAATAAAA 



43/-23^ 
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FIGURE 49 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA40571 

MLLSSLVSLAGSVYLAWILFFVLYDFCIVCITTYAINVSLMWLSFRKVQEPQGKAKRHG^ 
PGEWPWQASVRRQGAH I CSGSLVADTWVLTAAHCFEKAAATELNSWSWLGSLQREGLSPGA 
EEVGVAALQLPRAYNHYSQGSDLALLQIJUIPTTHTPLCLPQPAHRFPFGASCWATGWDQDTS 
DAPGTLRNLRLRLISRPTCNCIYNQLHQRHLSNPARPGMLCGGPQPGVQGPCQGDSGGPVLC 
LEPDGHWVQAGIISFASSCAQEDAPVLLTNTAAHSSWLQARVQGAAFLAQSPETPEMSDEDS 
CVACGSLRTAGPQAGAPSPWPWEARLMHQGQLACGGALVSEEAVLTAAHCFIGRQAPEEWSV 
GLGTRPEEWGLKQL I LHGA YTHPEGGYDMALLLLAQPVTLGASLRPLCLPYPDHHLPDGERG 
WVLGRARPGAGISSLQTVPVTLLGPRACSRLHAAPGGDGSPILPGMVCTSAVGELPSCEGLS 
GAPLVHEVRGTWFLAGLHSFGDACQGPARPAVFTALPAYEDWVSSLDWQVYFAEEPEPEAEP 
GSCLANISQPTSC 

Important features: 
Signal peptide: 

amino acids 1-15 

Homologous region to Serine proteases, trypsin family 
amino acids 79-95, 343-359 and 237-247 

N*glyco8ylation sites. 

amino acids 37-40 and 564-567 

Kr ingle domains 

amino acids 79-96, 343-360 and 235-247 
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FIGURE 50 

CGGGCCGCCCCCGGCCCCCATTCGGGCCGGGCCTCGCTGCGGCGGCGACTGAGCCAGGCTGG 

GCCGCGTCCCTGAGTCCCAGAGTCGGCGCGGCGCGGCAGGGGCAGCCTTCCACCACGGGGAG 

CCCAGCTGTCAGCCGCCTCACAGGAAGAISCTGCGTCGGCGGGGCAGCCCTGGCATGGGTGT 

GCATGTGGGTGCAGCCCTGGGAGCACTGTGGTTCTGCCTCACAGGAGCCCTGGAGGTCCAGG 

TCCCTGAAGACCCAGTGGTGGCACTGGTGGGCACCGATGCCACCCTGTGCTGCTCCTTCTCC 

CCTGAGCCTGGCTTCAGCCTGGCACAGCTCAACCTCATCTGGCAGCTGACAGATACCAAACA 

GCTGGTGCACAGCTTTGCTGAGGGCCAGGACCAGGGCAGCGCCTATGCCAACCGCACGGCCC 

TCTTCCCGGACCTGCTGGCACAGGGCAACGCATCCCTGAGGCTGCAGCGCGTGCGTGTGGCG 

GACGAGGGCAGCTTCACCTGCTTCGTGAGCATCCGGGATTTCGGCAGCGCTGCCGTCAGCCT 

GCAGGTGGCCGCTCCCTACTCGAAGCCCAGCATGACCCTGGAGCCCAACAAGGACCTGCGGC 

CAGGGGACACGGTGACCATCACGTGCTCCAGCTACCAGGGCTACCCTGAGGCTGAGGTGTTC 

TGGCAGGATGGGCAGGGTGTGCCCCTGACTGGCAACGTGACCACGTCGCAGATGGCCAACGA 

GCAGGGCTTGTTTGATGTGCACAGCGTCCTGCGGGTGGTGCTGGGTGCGAATGGCACCTACA 

GCTGCCTGGTGCGCAACCCCGTGCTGCAGCAGGATGCGCACRGCTCTGTCACCATCACAGGG 

CAGCCTATGACATTCCCCCCAGAGGCCCTGTGGGTGACCGTGGGGCTGTCTGTCTGTCTCAT 

TGCACTGCTGGTGGCCCTGGCTTTCGTGTGCTGGAGAAAGATCAAACAGAGCTGTGAGGAGG 

AGAATGCAGGAGCTGAGGACCAGGATGGGGAGGGAGAAGGCTCCAAGACAGCCCTGCAGCCT 

CTGAAACACTCTGACAGCAAAGAAGATGATGGACAAGAAATAGCCISaCCATGAGGACCAGG 

GAGCTGCTACCCCTCCCTACAGCTCCTACCCTCTGGCTGCAATGGGGCTGCACTGTGAGCCC 

TGCCCCCAACAGATGCATCCTGCTCTGACAGGTGGGCTCCTTCTCCAAAGGATGCGATACAC 

AGACCACTGTGCAGCCTTATTTCTCCAATGGACATGATTCCCAAGTCATCCTGCTGCCTTTT 

TTCTTATAGACACAATGAACAGACCACCCACAACCTTAGTTCTCTAAGTCATCCTGCCTGCT 

GCCTTATTTCACAGTACATACATTTCTTAGGGACACAGTACACTGACCACATCACCACCCTC 

TTCTTCCAGTGCTGCGTGGACCATCTGGCTGCCTTTTTTCTCCAAAAGATGCAATATTCAGA 

CTGACTGACCCCCTGCCTTATTTCACCAAAGACACGATGCATAGTCACCCCGGCCTTGTTTC 

TCCAATGGCCGTGATACACTAGTGATCATGTTCAGCCCTGCTTCCACCTGCATAGAATCTTT 

TCTTCTCAGACAGGGACAGTGCGGCCTCAACATCTCCTGGAGTCTAGAAGCTGTTTCCTTTC 

CCCTCCTTCCTCCCTGCCCCAAGTGAAGACAGGGCAGGGCCAGGAATGCTTTGGGGACACCG 

AGGGGACTGCCCCCCACCCCCACCATGGTGCTATTCTGGGGCTGGGGCAGTCTTTTCCTGGC 

TTGCCTCTGGCCAGCTCCTGGCCTCTGGTAGAGTGAGACTTCAGACGTTCTGATGCCTTCCG 

GATGTCATCTCTCCCTGCCCCAGGAATGGAAGATGTGAGGACTTCTAATTTAAATGTGGGAC 

TCGGAGGGATTTTGTAAACTGGGGGTATATTTTGGGGAAAATAAATGTCTTTGTAAAAAAAA 
AAAAAAAAAAAAAA 
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FIGURE 51 



></usr/seqcib2/sst/DNA/Dnaseqs.min/ss .DNA41386 
xsubunit 1 of 1, 316 aa, 1 stop, 1 unknovm 
xMW: -1, pi: 4.62, NX(S/T): 4 

MLRRRGSPGMGVHVGAALGALWPCLTGALEVQVPEDPWALVGTDATLCCSFSPEPGFSLAQ 
LNLIWQLTDTKQLVHSFAEGQDQGSAYANRTALFPDLLAQGNASLRLQRVRVADEGSFTCFV 
SIRDFGSAAVSLQVAAPYSKPSMTLEPNKDLRPGDTVTITCSSYQGYPEAEVFWQDGQGVPL 
TGNVTTSQMANEQGLFDVHSVLRVVLGANGTYSCLVRNPVLQQDAHXSVTITGQPMTFPPE^ 
LWVTVGLSVCLIALLVALAFVCWRKIKQSCEEENAGAEDQDGEGEGSKTALQPLKHSDSKED 

DGQEIA 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 
amino acids 251-270 

N-glycosylation site. 

amino acids 91-94, 104-107, 189-192 and 215-218 

Homologous region to Immunoglobulins and MHC 
amino acids 217-234 
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TTCGTGACCCTTGAGAAAAGAGTTGGTGGTAAATGTGCCACGTCTTCTAAGAAGGGGGAGTC 

CTGAACTTGTCTGAAGCCCTTGTCCGTAAGCCTTGAACTACGTTCTTAAATCTATGAAGTCG 

AGGGACCTTTCGCTGCTTTTGTAGGGACTTCTTTCCTTGCTTCAGCAACAISAGGCTTTTCT 

TGTGGAACGCGGTCTTGACTCTGTTCGTCACTTCTTTGATTGGGGCTTTGATCCCTGAACCA 

GAAGTGAAAATTGAAGTTCTCCAGAAGCCATTCATCTGCCATCGCAAGACCAAAGGAGGGGA 

TTTGATGTTGGTCCACTATGAAGGCTACTTAGAAAAGGACGGCTCCTTATTTCACTCCACTC 

ACAAACATAACAATGGTCAGCCCATTTGGTTTACCCTGGGCATCCTGGAGGCTCTCAAAGGT 

TGGGACCAGGGCTTGAAAGGAATGTGTGTAGGAGAGAAGAGAAAGCTCATCATTCCTCCTGC 

TCTGGGCTATGGAAAAGAAGGAAAAGGTAAAATTCCCCCAGAAAGTACACTGATATTTAATA 

TTGATCTCCTGGAGATTCGAAATGGACCAAGATCCCATGAATCATTCCAAGAAATGGATCTT 

AATGATGACTGGAAACTCTCTAAAGATGAGGTTAAAGCATATTTAAAGAAGGAGTTTGAAAA 

ACATGGTGCGGTGGTGAATGAAAGTCATCATGATGCTTTGGTGGAGGATATTTTTGATAAAG 

AAGATGAAGACAAAGATGGGTTTATATCTGCCAGAGAATTTACATATAAACACGATGAGTTA 

lAfiAGATACATCTACCCTTTTAATATAGCACTCATCTTTCAAGAGAGGGCAGTCATCTTTAA 

AGAACATTTTATTTTTATACAATGTTCTTTCTTGCTTTGTTTTTTATTTTTATATATTTTTT 

CTGACTCCTATTTAAAGAACCCCTTAGGTTTCTAAGTACCCATTTCTTTCTGATAAGTTATT 

GGGAAGAAAAAGCTAATTGGTCTTTGAATAGAAGACTTCTGGACAATTTTTCACTTTCACAG 

ATATGAAGCTTTGTTTTACTTTCTCACTTATAAATTTAAAATGTTGCAACTGGGAATATACC 

ACGACATGAGACCAGGTTATAGCS^CAAATTAGCSICCCTATATTTCTGCTTCCCTCTATTTTC 

TCCAAGTTAGAGGTCAACATTTGAAAAGCCTTTTGCAATAGCCCAAGGCTTGCTATTTTCAT 

GTTATAATGAAATAGTTTATGTGTAACTGGCTCTGAGTCTCTGCTTGAGGACCAGAGGAAAA 

TGGTTGTTGGACCTGACTTGTTAATGGCTACTGCTTTACTAAGGAGATGTGCAATGCTGAAG 

TTAGAAACAAGGTTAATAGCCAGGCATGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAG 

GCTGAGGCGGGCGGATCACCTGAGGTTGGGAGTTCGAGACCAGCCTGACCAACACGQAGAAA 

CCCTATCTCTACTAAAAATACAAAGTAGCCCGGCGTGGTGATGCGTGCCTGTAATCCCAGCT 

ACCCAGGAAGGCTGAGGCGGCAGAATCACTTGAACCCGAGGCCGAGGTTGCGGTAAGCCGAG 

ATCACCTNCAGCCTGGACACTCTGTCTCGAAAAAAGAAAAGAACACGGTTAATACCATATNA 

ATATGTATGCATTGAGACATGCTACCTAGGACTTAAGCTGATGAAGCTTGGCTCCTAGTGAT 

TGGTGGCCTATTATGATAAATAGGACAAATCATTTATGTGTGAGTTTCTTTGTAATAAAATG 

TATCAATATGTTATAGATGAGGTAGAAAGTTATATTTATATTCAATATTTACTTCTTAAGGC 

TAGCGGAATATCCTTCCTGGTTCTTTAATGGGTAGTCTATAGTATATTATACTACAATAACA 

TTGTATCATAAGATAAAGTAGTAAACCAGTCTACATTTTCCCATTTCTGTCTCATCAAAAAC 

TGAAGTTAGCTGGGTGTGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGGGCCAAGGAGGG 

TGGATCACTTGAGATCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCTTGTCTCTA 

CTAAAAATACAAAAATTAGCCAGGCGTGGTGGTGCACACCTGTAGTCCCAGCTACTCGGGAG 

GCTGAGACAGGAGATTTGCTTGAACCCGGGAGGCGGAGGTTGCAGTGAGCCAAGATTGTGCC 

ACTGCACTCCAGCCTGGGTGACAGAGCAAGACTCCATCTCAAAAAAAAAAAAAAGAAGCAGA 
CCTACAGCAGCTACTATTGAATAAATACCTATCCTGGATTTT 
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></usr/seqdb2/ssc/DNA/Dnaseqs . min/ss . DNA44194 
xsubunit 1 of 1, 211 aa, 1 stop 
xMW: 24172, pi: 5.99, NX(S/T): 1 

MRLFLWNAVLTLFVTSLIGALIPEPEVKIEVLQKPFICHRKTKGGDLMLVHYEGYLEKDGSL 
FHSTHKHNNGQ P I WFTLG I LEALKGWDQGLKGMCVGEKRKL 1 1 PPALGYGKEGKGKI PPEST 
LIFNIDLLEIRNGPRSHESFQEMDLNDDWKLSKDEVKAYLKKEFEKHGAWNESHHDALVED 
I FDKEDEDKDGF I SAREFTYKHDEL 

Important features: 
Signal peptide: 

amino acids 1-20 

N-glycosylation site. 

amino acids 176-179 

Casein kinase II phosphorylation site. 

amino acids 143-146, 156-159, 178-181 and 200-203 

Endoplasmic reticulum targeting sequence. 

amino acids 208-211 

FKBP-type peptidyl -prolyl cis- trans isomerase 

amino acids 78-114 and 118-131 

EF-hand calcixim-binding domain. 

amino acids 191-203, 184-203 and 140-159 

S-lOO/ICaBP type calcium binding domain 
amino acids 183-203 
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FIGURE 54 

AATAAAGCTTCCTTAATGTTGTATATGTCTTTGAAGTACATCCGTGCATTTTTTTTTAGCAT 

CCAACCATTCCTCCCTTGTAGTTCTCGCCCCCTCAAATCACCCTCTCCCGTAGCCCACCCGA 

CTAACATCTCAGTCTCTGAAAAISCACAGAGATGCCTGGCTACCTCGCCCTGCCTTCAGCCT 

CACGGGGCTCAGTCTCTTTTTCTCTTTGGTGCCACCAGGACGGAGCATGGAGGTCACAGTAC 

CTGCCACCCTCAACGTCCTCAATGGCTCTGACGCCCGCCTGCCCTGCACCTTCAACTCCTGC 

TACACAGTGAACCACAAACAGTTCTCCCTGAACTGGACTTACCAGGAGTGCAACAACTGCTC 

TGAGGAGATGTTCCTCCAGTTCCGCATGAAGATCATTAACCTGAAGCTGGAGCGGTTTCAAG 

ACCGCGTGGAGTTCTCAGGGAACCCCAGCAAGTACGATGTGTCGGTGATGCTGAGAAACGTG 

CAGCCGGAGGATGAGGGGATTTACAACTGCTACATCATGAACGCCCCTGACCGCCACCGTGG 

CCATGGCAAGATCCATCTGCAGGTCCTCATGGAAGAGCCCCCTGAGCGGGACTCCACGGTGG 

CCGTGATTGTGGGTGCCTCCGTCGGGGGCTTCCTGGCTGTGGTCATCTTGGTGCTGATGGTG 

GTCAAGTGTGTGAGGAGAAAAAAAGAGCAGAAGCTGAGCACAGATGACCTGAAGACCGAGGA 

GGAGGGCAAGACGGACGGTGAAGGCAACCCGGATGATGGCGCCAAGl&STGGGTGGCCGGCC 

CTGCAGCCTCCCGTGTCCCGTCTCCTCCCCTCTCCGCCCTGTACAGTGACCCTGCCTGCTCG 

CTCTTGGTGTGCTTCCCGTGACCTAGGACCCCAGGGCCCACCTGGGGCCTCCTGAACCCCCG 

ACTTCGTATCTCCCACCCTGCACCAAGAGTGACCCACTCTCTTCCATCCGAGAAACCTGCCA 

TGCTCTGGGACGTGT6GGCCCTGGGGAGAGGAGAGAAAGGGCTCCCACCTGCCAGTCCCTGG 

GGGGAGGCAGGAGGCACATGTGAGGGTCCCCAGAGAGAAGGGAGTGGGTGGGCAGGGGTAGA 

GGAGGGGCCGCTGTCACCTGCCCAGTGCTTGCCTGGCAGTGGCTTCAGAGAGGACCTGGTGG 

GGAGGGAGGGCTTTCCTGTGCTGACAGCGCTCCCTCAGGAGGGCCTTGGCCTGGCACGGCTG 

TGCTCCTCCCCTGCTCCCAGCCCAGAGCAGCCATCAGGCTGGAGGTGACGATGAGTTCCTGA 

AACTTGGAGGGGCATGTTAAAGGGATGACTGTGCATTCCAGGGCACTGACGGAAAGCCAGGG 

CTGCAGGCAAAGCTGGACATGTGCCCTGGCCCAGGAGGCCATGTTGGGCCCTCGTTTCCATT 

GCTAGTGGCCTCCTTGGGGCTCCTGTTGGCTCCTAATCCCTTAGGACTGTGGATGAGGCCAG 

ACTGGAAGAGCAGCTCCAGGTAGGGGGCCATGTTTCCCAGCGGGGACCCACCAACAGAGGCC 

AGTTTCAAAGTCAGCTGAGGGGCTGAGGGGTGGGGCTCCATGGTGAATGCAGGTTGCTGCAG 

GCTCTGCCTTCTCCATGGGGTAACCACCCTCGCCTGGGCAGGGGCAGCCAAGGCTGGGAAAT 

GAGGAGGCCATGCACAGGGTGGGGCAGCTTTCTTTGGGGCTTCAGTGAGAACTCTCCCAGTT 

GCCCTTGGTGGGGTTTCCACCTGGCTTTTGGCTACAGAGAGGGAAGGGAAAGCCTGAGGCCG 

GCATAAGGGGAGGCCTTGGAACCTGAGCTGCCAATGCCAGCCCTGTCCCATCTGCGGCCACG 

CTACTCGCTCCTCTCCCAACAACTCCCTTCGTGGGGACAAAAGTGACAATTGTAGGCCAGGC 

ACAGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGCGGGTGGATTACCTCCAT 

CTGTTTAGTAGAAATGGGCAAAACCCCATCTCTACTAAAAATACAAGAATTAGCTGGGCGTG 

GTGGCGTGTGCCTGTAATCCCAGCTATTTGGGAGGCTGAGGCAGGAGAATCGCTTGAGCCCG 

GGAAGCAGAGGTTGCAGTGAACTGAGATAGTGATAGTGCCACTGCAATTCAGCCTGGGTGAC 
ATAGAGAGACTCCATCTCAAAAAAAA 
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FIGURE 55 



</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA45415 
<subunit 1 of 1, 215 aa, 1 scop 
<MW: 24326, pi: 6.32, NX(S/T) : 4 

MHRDAWLPRPAFSLTGLSLFFSLVPPGRSMEVTVPATLNVU^GSDARLPCTFNSCYTVNHKQ 
FSLNWTYQECNNCSEEMFLQFRMKIINLKLERFQDRVEFSGNPSKYDVSVMLRNVQPEDEGI 
YNCYIMNPPDRHRGHGKIHLQVLMEEPPERDSTVAVIVGASVGGFLAWILVLMWKCVR^ 
KEQKLSTDDLKTEEEGKTDGEGNPDDGAK 

Important features: 
Signal peptide: 

amino acids 1-20 

Transaiesibrane domain: 

amino acids 161-179 

Immunoglobulin- like fold : 
amino acids 83-127 

N-glycoaylation sites. 

amino acids 42-45, 66-69 and 74-77 
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FIGURE 56 



GTTGTATATGTCCTGAAGTACATCCGTGCATTTTTTTTAGCATCCAACCATCCTCCCTTGTA 

GTTCTCGCCCCCTCAAATCACCTTCTCCCTTAGCCCACCCNACTAACATCTCAGTCTCTGAA 

AATGCACAGAGATGCCTGGCTACCTCGCCCTGCCTTCAGCCTCACGGGGCTCAGTCTCTTTT 

TCTCTTTGGTGCCACCAGGACGGAGCATGGAGGTCCACAGTACCTGNCCACCCTCAACGTCC 

TCAATGGCTCTGACGCCCGCCTGCCCTGCCCTTCAACTCCTGCTACACAGTGAACCACAAAC 

AGTTCTCCCTGAACTGGACTTACCAGGAGTGCAACAACTGCTCTGAGGAGATGTTCCTCCAG 

TTCCGCATGAAGATCATTAACCTGAAGCTGGAGCGGTTTCAAGACCGCGTGGAGTTCTCAGG 

GAACCCCAGCAAGTACGATGTGTCGGTGATGCTGAGAAACGTGCAGCCGGAGGATGAGGGGA 
TTTACAACTGCTACATCATGAACCCCCC 
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FIQURE 57 



TCACGGGGCTCATCTCTTTTTCTCTTTGGTGCCCACCAGGACGGAGCATGGAGGTNCACATA 
CCTGCCACCCTCAACGTCCTCAATGGCTTTGACGCCCGCCTGCCCTGCACCTTCAACTCCNG 
CTACACAGTGAACCACAAACAGTTCTCCCTGAACTGGATTTACCAGGAGTGCAACAACTGGC 
TCTGAGGAGATGTTCCTCCAGTTCCCGCATGGAAGATCATTTAACCTGAAAGCTGGAAGCGG 
TTTTCAAGAACCGCGTGGAAGTTTCTCAGGGAACCCCAGCAAGTACGATGTGTCGGTGATGC 
TGAGAAACGTGCAGCCGGAGGATGAGGGGATTTACAACTGCTACATCATGAACCCCCC 
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TGCGGCGACCGTCGTACACCa^SGGCCTCCACCTCCGCCCCTACCGTGTGGGGCTGCTCCCG 

GATGGCCTCCTGTTCCTCTTGCTGCTGCTAATGCTGCTCGCGGACCCAGCGCTCCCGGCCGG 

ACGTCACCCCCCAGTGGTGCTGGTCCCTGGTGATTTGGGTAACCAACTGGAAGCCAAGCTGG 

ACAAGCCGACAGTGGTGCACTACCTCTGCTCCAAGAAGACCGAAAGCTACTTCACAATCTGG 

CTGAACCTGGAACTGCTGCTGCCTGTCATCATTGACTGCTGGATTGACAATATCAGGCTGGT 

TTACAACAAAACATCCAGGGCCACCCAGTTTCCTGATGGTGTGGATGTACGTGTCCCTGGCT 

TTGGGAAGACCTTCTCACTGGAGTTCCTGGACCCCAGCAAAAGCAGCGTGGGTTCCTATTTC 

CACACCATGGTGGAGAGCCTTGTGGGCTGGGGCTACACACGGGGTGAGGATGTCCGAGGGGC 

TCCCTATGACTGGCGCCGAGCCCCAAATGAAAACGGGCCCTACTTCCTGGCCCTCCGCGAGA 

TGATCGAGGAGATGTACCAGCTGTATGGGGGCCCCGTGGTGCTGGTTGCCCACAGTATGGGC 

AACATGTACACGCTCTACTTTCTGCAGCGGCAGCCGCAGGCCTGGAAGGACAAGTATATCCG 

GGCCTTCGTGTCACTGGGTGCGCCCTGGGGGGGCGTGGCCAAGACCCTGCGCGTCCTGGCTT 

CAGGAGACAACAACCGGATCCCAGTCATCGGGCCCCTGAAGATCCGGGAGCAGCAGCGGTCA 

GCTGTCTCCACCAGCTGGCTGCTGCCCTACAACTACACATGGTCACCTGAGAAGGTGTTCGT 

GCAGACACCCACAATCAACTACACACTGCGGGACTACCGCAAGTTCTTCCAGGACATCGGCT 

TTGAAGATGGCTGGCTCATGCGGCAGGACACAGAAGGGCTGGTGGAAGCCACGATGCCACCT 

GGCGTGCAGCTGCACTGCCTCTATGGTACTGGCGTCCCCACACCAGACTCCTTCTACTATGA 

GAGCTTCCCTGACCGTGACCCTAAAATCTGCTTTGGTGACGGCGATGGTACTGTGAACTTGA 

AGAGTGCCCTGCAGTGCCAGGCCTGGCAGAGCCGCCAGGAGCACCAAGTGTTGCTGCAGGAG 

CTGCCAGGCAGCGAGCACATCGAGATGCTGGCCAACGCCACCACCCTGGCCTATCTGAAACG 

TGTGCTCCTTGGGCCCIJ5ACTCCTGTGCCACAGGACTCCTGTGGCTCGGCCGTGGACCTGCT 

GTTGGCCTCTGGGGCTGTCATGGCCCACGCGTTTTGCAAAGTTTGTGACTCACCATTCAAGG 

CCCCGAGTCTTGGACTGTGAAGCATCTGCCATGGGGAAGTGCTGTTTGTTATCCTTTCTCTG 

TGGCAGTGAAGAAGGAAGAAATGAGAGTCTAGACTCAAGGGACACTGGATGGCAAGAATGCT 

GCTGATGGTGGAACTGCTGTGACCTTAGGACTGGCTCCACAGGGTGGACTGGCTGGGCCCTG 

GTCCCAGTCCCTGCCTGGGGCCATGTGTCCCCCTATTCCTGTGGGCTTTTCATACTTGCCTA 

CTGGGCCCTGGCCCCGCAGCCTTCCTATGAGGGATGTTACTGGGCTGTGGTCCTGTACCCAG 

AGGTCCCAGGGATCGGCTCCTGGCCCCTCGGGTGACCCTTCCCACACACCAGCCACAGATAG 

GCCTGCCACTGGTCATGGGTAGCTAGAGCTGCTGGCTTCCCTGTGGCTTAGCTGGTGGCCAG 

CCTGACTGGCTTCCTGGGCGAGCCTAGTAGCTCCTGCAGGCAGGGGCAGTTTGTTGCGTTCT 

TCGTGGTTCCCAGGCCCTGGGACATCTCACTCCACTCCTACCTCCCTTACCACCAGGAGCAT 

TCAAGCTCTGGATTGGGCAGCAGATGTGCCCCCAGTCCCGCAGGCTGTGTTCCAGGGGCCCT 

GATTTCCTCGGATGTGCTATTGGCCCCAGGACTGAAGCTGCCTCCCTTCACCCTGGGACTGT 

GGTTCCAAGGATGAGAGCAGGGGTTGGAGCCATGGCCTTCTGGGAACCTATGGAGAAAGGGA 

ATCCAAGGAAGCAGCCAAGGCTGCTCGCAGCTTCCCTGAGCTGCACCTCTTGCTAACCCCAC 

CATCACACTGCCACCCTGCCCTAGGGTCTCACTAGTACCAAGTGGGTCAGCACAGGGCTGAG 

GATGGGGCTCCTATCCACCCTGGCCAGCACCCAGCTTAGTGCTGGGACTAGCCCAGAAACTT 

GAATGGGACCCTGAGAGAGCCAGGGGTCCCCTGAGGCCCCCCTAGGGGCTTTCTGTCTGCCC 

CAGGGTGCTCCATGGATCTCCCTGTGGCAGCAGGCATGGAGAGTCAGGGCTGCCTTCATGGC 

AGTAGGCTCTAAGTGGGTGACTGGCCACAGGCCGAGAAAAGGGTACAGCCTCTAGGTGGGGT 

TCCCAAAGACGCCTTCAGGCTGGACTGAGCTGCTCTCCCACAGGGTTTCTGTGCAGCTGGAT 

TTTCTCTGTTGCATACATGCCTGGCATCTGTCTCCCCTTGTTCCTGAGTGGCCCCACATGGG 

GCTCTGAGCAGGCTGTATCTGGATTCTGGCAATAAAAGTACTCTGGATGCTGTAAAAAAAAA 

AAAAAAAAAAAAAA 
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FIGURE 59 

></usr/seqcib2/sst/DNA/Dnaseqs.min/ss.DNA44189 
xsubunit 1 of 1, 412 aa, 1 stop 
XMW: 46658, pi: 6.65, NX(S/T): 4 

MGLHLRPYRVGLLPDGLLFLLLLIidLLADPALPAGRHPPVVLVPGDLGNQLEAKLDKPTVVH 
YLCSKKTESYFTIWLNLELLLPVIIDCWIDNIRLVYNKTSRATQFPDGVDVRVPGFGKTFSL 
EFLDPSKSSVGSYFHTMVESLVGWGYTRGEDVRGAPYDWRRAPNENGPYFLALREMIEEMYQ 
LYGGPWLVAHSMGNMYTLYFLQRQPQAWKDKYIRAFVSLGAPWGGVAKTLRVIASGDNNRI 
PVIGPLKIREQQRSAVST5WLLPYNYTWSPEKVFVQTPTINYTLRDYRKFFQDIGFEDGWLM 
RQDTEGLVEATMPPGVQLHCLYGTGVPTPDSFYYESFPDRDPKICFGDGDGTVNLKSALQCQ 
AWQSRQEHQVLLQELPGSEHIEMLANATTLAYLKRVLLGP 

Important features: 
Signal peptide: 
amino acids 1-28 

Potential lipid substrate binding site: 

amino acids 147-164 

N-glycosylation sites. 

amino acids 99-102, 273-276, 289-292 and 398-401 

Lipases, serine proteins 

amino acids 189-201 

Beta-transducin family Trp-Asp repeat 
amino acids 353-365 
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FIGURE 60 

CGGACGCGTGGGCGGACGCGTGGGGCGGCGGCAGCGGCGGCGACGGCGACAISGAGAGCGGG 

GCCTACGGCGCGGCCAAGGCGGGCGGCTCCTTCGACCTGCGGCGCTTCCTGACGCAGCCGCA 

GGTGGTGGCGCGCGCCGTGTGCTTGGTCTTCGCCTTGATCGTGTTCTCCTGCATCTATGGTG 

AGGGCTACAGCAATGCCCACGAGTCTAAGCAGATGTACTGCGTGTTCAACCGCAACGAGGAT 

GCCTGCCGCTATGGCAGTGCCATCGGGGTGCTGGCCTTCCTGGCCTCGGCCTTCTTCTTGGT 

GGTCGACGCGTATTTCCCCCAGATCAGCAACGCCACTGACCGCAAGTACCTGGTCATTGGTG 

ACCTGCTCTTCTCAGCTCTCTGGACCTTCCTGTGGTTTGTTGGTTTCTGCTTCCTCACCAAC 

CAGTGGGCAGTCACCAACCCGAAGGACGTGCTGGTGGGGGCCGACTCTGTGAGGGCAGCCAT 

CACCTTCAGCTTCTTTTCCATCTTCTCCTGGGGTGTGCTGGCCTCCCTGGCCTACCAGCGCT 

ACAAGGCTGGCGTGGACGACTTCATCCAGAATTACGTTGACCCCACTCCGGACCCCAACACT 

GCCTACGCCTCCTACCCAGGTGCATCTGTGGACAACTACCAACAGCCACCCTTCACCCAGAA 

CGCGGAGACCACCGAGGGCTACCAGCCGCCCCCTGTGTACISAGTGGCGGTTA6CGTGGGAA 

GGGGGACAGAGAGGGCCCTCCCCTCTGCCCTGGACTTTCCCATCAGCCTCCTGGAACTGCCA 

GCCCCTCTCTTTCACCTGTTCCATCCTGTGCAGCTGACACACAGCTAAGGAGCCTCATAGCC 

TGGCGGGGGCTGGCAGAGCCACACCCCAAGTGCCTGTGCCCAGAGGGCTTCAGTCAGCCGCT 

CACTCCTCCAGGGCACTTTTAGGAAAGGGTTTTTAGCTAGTGTTTTTCCTCGCTTTTAATGA 

CCTCAGCCCCGCCTGCAGTGGCTAGAAGCCAGCAGGTGCCCATGTGCTACTGACAAGTGCCT 

CAGCTTCCCCCCGGCCCGGGTCAGGCCGTGGGAGCCGCTATTATCTGCGTTCTCTGCCAAAG 

ACTCGTGGGGGCCATCACACCTGCCCTGTGCAGCGGAGCCGGACCAGGCTCTTGTGTCCTCA 

CTCAGGTTTGCTTCCCCTGTGCCCACTGCTGTATGATCTGGGGGCCACCACCCTGTGCCGGT 

GGCCTCTGGGCTGCCTCCCGTGGTGTGAGGGCGGGGCTGGTGCTCATGGCACTTCCTCCTTG 

CTCCCACCCCTGGCAGCAGGGAAGGGCTTTGCCTGACAACACCCAGCTTTATGTAAATATTC 

TGCAGTTGTTACTTAGGAAGCCTGGGGAGGGCAGGGGTGCCCCATGGCTCCCAGACTCTGTC 

TGTGCCGAGTGTATTATAAAATCGTGGGGGAGATGCCCGGCCTGGGATGCTGTTTGGAGACG 
GAATAAATGTTTTCTCATTCAAAG 
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FIGURE 61 



</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA48304 
<subunit 1 of 1, 224 aa, 1 stop 
<MW: 24810, pi: 4.75, NX(S/T): 1 

MESGAYGAAKAGGSFDLRRFLTQPQVVARAVCLVFALIVFSCIYGEGYSNAHESKQMYCVFN 
RNEDACRYGSAIGVLAFLASAFFLWDAYFPQISNATDRKYLVIGDLLFSALWTFLWFVGFC 
FLTNQWAVTNPKDVLVGADSVRAAITFSFFSIFSWGVIASLAYQRYKAGVDDFIQNYVDPTP 
DPNTAYASYPGASVDNYQQPPFTQNAETTEGYQPPPVY 

Important features: 

Type II Transmembrane domain: 

amino acids 24-43 

Other transmembrane domains: 

amino acids 74-90, 108-126 and 145-161 

N-glycosylation site, 
amino acids 97-100 
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FIGURE 62 

GAGCCACCTACCCTGCTCCGAGGCCAGGCCTGCAGGGCCTCATCGGCCAGAGGGTGATCAGT 
GAGCAGAAGG&ISCCCGTGGCCGAGGCCCCCCAGGTGGCTGGCGGGCAGGGGGACGGAGGTG 
ATGGCGAGGAAGCGGAGCCAGAGGGGATGTTCAAGGCCTGTGAGGACTCCAAGAGAAAAGCC 
CGGGGCTACCTCCGCCTGGTGCCCCTGTTTGTGCTGCTGGCCCTGCTCGTGCTGGCTTCGGC 
GGGGGTGCTACTCTGGTATTTCCTAGGGTACAAGGCGGAGGTGATGGTCAGCCAGGTGTACT 
CAGGCAGTCTGCGTGTACTCAATCGCCACTTCTCCCAGGATCTTACCCGCCGGGAATCTAGT 
GCCTTCCGCAGTGAAACCGCCAAAGCCCAGAAGATGCTCAAGGAGCTCATCACCAGCACCCG 
CCTGGGAACTTACTACAACTCCAGCTCCGTCTATTCCTTTGGGGAGGGACCCCTCACCTGCT 
TCTTCTGGTTCATTCTCCAAATCCCCGAGCACCGCCGGCTGATGCTGAGCCCCGAGGTGGTG 
CAGGCACTGCTGGTGGAGGAGCTGCTGTCCACAGTCAACAGCTCGGCTGCCGTCCCCTACAG 
GGCCGAGTACGAAGTGGACCCCGAGGGCCTAGTGATCCTGGAAGCCAGTGTGAAAGACATAG 
CTGCATTGAATTCCACGCTGGGTTGTTACCGCTACAGCTACGTGGGCCAGGGCCAGGTCCTC 
CGGCTGAAGGGGCCTGACCACCTGGCCTCCAGCTGCCTGTGGCACCTGCAGGGCCCCAAGGA 
CCTCATGCTCAAACTCCGGCTGGAGTGGACGCTGGCAGAGTGCCGGGACCGACTGGCCATGT 
ATGACGTGGCCGGGCCCCTGGAGAAGAGGCTCATCACCTCGGTGTACGGCTGCAGCCGCCAG 
GAGCCCGTGGTGGAGGTTCTGGCGTCGGGGGCCATCATGGCGGTCGTCTGGAAGAAGGGCCT 
GCACAGCTACTACGACCCCTTCGTGCTCTCCGTGCAGCCGGTGGTCTTCCAGGCCTGTGAAG 
TGAACCTGACGCTGGACAACAGGCTCGACTCCCAGGGCGTCCTCAGCACCCCGTACTTCCCC 
AGCTACTACTCGCCCCAAACCCACTGCTCCTGGCACCTCACGGTGCCCTCTCTGGACTACGG 
CTTGGCCCTCTGGTTTGATGCCTATGCACTGAGGAGGCAGAAGTATGATTTGCCGTGCACCC 
AGGGCCAGTGGACGATCCAGAACAGGAGGCTGTGTGGCTTGCGCATCCTGCAGCCCTACGCC 
GAGAGGATCCCCGTGGTGGCCACGGCCGGGATCACCATCAACTTCACCTCCCAGATCTCCCT 
CACCGGGCCCGGTGTGCGGGTGCACTATGGCTTGTACAACCAGTCGGACCCCTGCCCTGGAG 
AGTTCCTCTGTTCTGTGAATGGACTCTGTGTCCCTGCCTGTGATGGGGTCAAGGACTGCCCC 
AACGGCCTGGATGAGAGAAACTGCGTTTGCAGAGCCACATTCCAGTGCAAAGAGGACAGCAC 
ATGCATCTCACTGCCCAAGGTCTGTGATGGGCAGCCTGATTGTCTCAACGGCAGCGATGAAG 
AGCAGTGCCAGGAAGGGGTGCCATGTGGGACATTCACCTTCCAGTGTGAGGACCGGAGCTGC 
GTGAAGAAGCCCAACCCGCAGTGTGATGGGCGGCCCGACTGCAGGGACGGCTCGGATGAGGA 
GCACTGTGACTGTGGCCTCCAGGGCCCCTCCAGCCGCATTGTTGGTGGAGCTGTGTCCTCCG 
AGGGTGAGTGGCCATGGCAGGCCAGCCTCCAGGTTCGGGGTCGACACATCTGTGGGGGGGCC 
CTCATCGCTGACCGCTGGGTGATAACAGCTGCCCACTGCTTCCAGGAGGACAGCATGGCCTC 
CACGGTGCTGTGGACCGTGTTCCTGGGCAAGGTGTGGCAGAACTCGCGCTGGCCTGGAGAGG 
TGTCCTTCAAGGTGAGCCGCCTGCTCCTGCACCCGTACCACGAAGAGGACAGCCATGACTAC 
GACGTGGCGCTGCTGCAGCTCGACCACCCGGTGGTGCGCTCGGCCGCCGTGCGCCCCGTCTG 
CCTGCCCGCGCGCTCCCACTTCTTCGAGCCCGGCCTGCACTGCTGGATTACGGGCTGGGGCG 
CCTTGCGCGAGGGCGGCCCCATCAGCAACGCTCTGCAGAAAGTGGATGTGCAGTTGATCCCA 
CAGGACCTGTGCAGCGAGGCCTATCGCTACCAGGTGACGCCACGCATGCTGTGTGCCGGCTA 
CCGCAAGGGCAAGAAGGATGCCTGTCAGGGTGACTCAGGTGGTCCGCTGGTGTGCAAGGCAC 
TCAGTGGCCGCTGGTTCCTGGCGGGGCTGGTCAGCTGGGGCCTGGGCTGTGGCCGGCCTAAC 
TACTTCGGCGTCTACACCCGCATCACAGGTGTGATCAGCTGGATCCAGCAAGTGGTGACCIS 
AGGAACTGCCCCCCTGCAAAGCAGGGCCCACCTCCTGGACTCAGAGAGCCCAGGGCAACTGC 
CAAGCAGGGGGACAAGTATTCTGGCGGGGGGTGGGGGAGAGAGCAGGCCCTGTGGTGGCAGG 
AGGTGGCATCTTGTCTCGTCCCTGATGTCTGCTCCAGTGATGGCAGGAGGATGGAGAAGTGC 
CAGCAGCTGGGGGTCAAGACGTCCCCTGAGGACCCAGGCCCACACCCAGCCCTTCTGCCTCC 
CAATTCTCTCTCCTCCGTCCCCTTCCTCCACTGCTGCCTAATGCAAGGCAGTGGCTCAGCAG 
CAAGAATGCTGGTTCTACATCCCGAGGAGTGTCTGAGGTGCGCCCCACTCTGTACAGAGGCT 
GTTTGGGCAGCCTTGCCTCCAGAGAGCAGATTCCAGCTTCGGAAGCCCCTGGTCTAACTTGG 
GATCTGGGAATGGAAGGTGCTCCCATCGGAGGGGACCCTCAGAGCCCTGGAGACTGCCAGGT 
GGGCCTGCTGCCACTGTAAGCCAAAAGGTGGGGAAGTCCTGACTCCAGGGTCCTTGCCCCAC 
CCCTGCCTGCCACCTGGGCCCTCACAGCCCAGACCCTCACTGGGAGGTGAGCTCAGCTGCCC 
TTTGGAATAAAGCTGCCTGATCAAAAAAAAAAAAAAAAAAAAA 
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></usr/seqdb2/sst/DNA/Dnaseqs.rain/ss.DNA49152 
xsubunit 1 of 1, 802 aa, 1 stop 
XMW: 88846, pi: 6.41, NX(S/T): 7 

MPVAEAPQVAGGQGDGGDGEEAEPEGMFKACEDSKRKARGYLRLVPLEVLLALLVLASAGVL 
LWYFLGYKAEVMVSQVYSGSLRVLNRHFSQDLTRRESSAFRSETAKAQKMLKELITSTRLGT 
YYNSS S VYS FGEGPLTC FFWFI LQI PEHRRLMLSPEWQALLVEELLSTVNSSAAVPYRAEY 
EVDPEGLVILEASVKDIAALNSTLGCYRYSYVGQGQVLRLKGPDHLASSCLWHLQGPKDLML 
KLRLEWTLAECRDRLAMYDVAGPLEKRLITSVYGCSRQEPWEVLASGAIMAWWKKGLHSY 
YDPFVLSVQPWFQACEVNLTLDNRLDSQGVLSTPYFPSYYSPQTHCSWHLTVPSLDYGLAL 
WFDAYALRRQKYDLPCTQGQWTIQNRRLCGLRILQPYAERIPWATAGITINFTSQISLTGP 
GVRVHYGLYNQS DPC PGE FLCSVNGLCVPACDGVKDCPNGLDERNCVCRAT FQCKEDSTC I S 
LPKVCDGQPDCLNGSDEEQCQEGVPCGTFTFQCEDRSCVKKPNPQCDGRPDCRDGSDEEHCD 
CGLQG P S S RI VGGAVS SEGE WPWQAS LQVRGRH ICGGAL I ADRWVI TAAHCFQEDSMASTVL 
WTVFLGKVWQNSRWPGEVSFKVSRLLLHPYHEEDSHDYDVALLQLDHPWRSAAVRPVCLPA 
RSHFFEPGLHCW I TGWGALREGGP I SNALQKVDVQLI PQDLCSEAYRYQVTPRMLCAGYRKG 
KKDACQGDSGGPLVCKALSGRWFLAGLVSWGLGCGRPNYFGVYTRITGVISWIQQWT 

Important features: 

Type II transmembrane domain: 

amino acids 46-67 

Serine proteases, trypsin family, histidine active site, 
amino acids 604-609 

N-glycosylation sites. 

amino acids 127-130, 175-178, 207-210, 329-332, 424-427, 444-447 
and 509-512 

Kringle domains. 

amino acids 746-758 and 592-609 

Homologous region to Kallikrein Light Chain: 
amino acids 568-779 

Homologous region to Low-density lipoprotein receptor: 
amino acids 4 51-567 
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GCACCCAGGGCCAGTGGACGATCCAGAACAGGAGGCTGTGTGGCTTGCGCATCCTGCAGCCC 

TACGCCGAGAGGATCCCCGTGGTGGCCACGGCCGGGATCACCATCAACTTCACCTCCCAGAT 

CTCCCTCACCGGGCCCGGTGTGCGGGTGCACTATGGCTTGTACAACCAGTCGGACCCCTGCC 

CTGGAGAGTTCCTCTGTTCTGTGAATGGACTCTGTGTCCCTGCCTGTGATGGGGTCAAGGAC 

TGCCCCAACGGCCTGGATGAGAGAAACTGCGTTTGCAGAGCCACATTCCAGTGCAAAGAGGA 

CAGCACATGCATCTCACTGCCCAAGGTCTGTGATGGGCAGCCTGATTGTCTCAACGGCAGCG 

ATGAAGAGCAGTGCCAGGAAGGGGTGCCATGTGGGACATTCACCTTCCAGTGTGAGGACCGG 

AGCTGCGTGAAGAAGCCCAACCCGCAGTGTGATGGGCGGCCCGACTGCAGGGACGGCTCGGA 

TGAGGAGCACTGTGACTGTGGCCTCCAGGGCCCCTCCAGCCGCATTGTTGGTGGAGCTGTGT 

CCTCCGAGGGTGAGTGGCCATGGCAGGCCAGCCTCCAGGTTCGGGGTCGACACATCTGTGGG 

GGGGCCCTCATCGCTGACCGCTGGGTGATAACAGCTGCCCACTGCTTCCAGGAGGACAGCAT 

GGCCTCCACGGTGCTGTGGACCGTGTTCCTGGGCAAGGTGTGGCAGAACTCGCGCTGGCCTG 

GAGAGGTGTCCTTCAAGGTGAGCCGCCTGCTCCTGCACCCGTACCACGAAGAGGACAGCCAT 

GACTACGACGTGGCGCTGCTGCAGCTCGACCACCCGGTGGTGCGCTCGGCCGCCGTGCGCCC 

CGTCTGCCTGCCCGCGCGCTCCCACTTCTTCGAGCCCGGCCTGCACTGCTGGATTACGGGCT 

GGGGCGCCTTGCGCGAGGGCGGCCCCATCAGCAACGCTCTGCAGAAAGTGGATGTGCAGTTG 

ATCCCACAGGACCTGTGCAGCGAGGCCTATCGCTACCAGGT6ACGCCACGCATGCTGTGTGC 

CGGCTACCGCAAGGGCAAGAAGGATGCCTGTCAGGGTGACTCAGGTGGTCCGCTGGTGTGCA 

AGGCACTCAGTGGCCGCTGGTTCCTGGCGGGGCTGGTCAGCTGGGGCCTGGGCTGTGGCCGG 

CCTAACTACTTCGGCGTCTACACCCGCATCACAGGTGTGATCAGCTGGATCCAGCAAGTGGT 

GACCTGAGGAACTGCCCCCCTGCAAAGCAGGGCCCACCTCCTGGACTCAGAGAGCCCAGGGC 
AACTGCCAAGCAGGGGGACAAGTAT 
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GGACGAGGGCAGATCTCGTTCTGGGGCAAGCCGTTGACACTCGCTCCCTGCCACCGCCCGGG 

CTCCGTGCCGCCAAGTTTTCATTTTCCACCTTCTCTGCCTCCAGTCCCCCAGCCCCTGGCCG 

AGAGAAGGGTCTTACCGGCCGGGATTGCTGGAAACACCAAGAGGTGGTTTTTGTTTTTTAAA 

ACTTCTGTTTCTTGGGAGGGGGTGTGGCGGGGCAGG^MSAGCAACTCCGTTCCTCTGCTCTG 

TTTCTGGAGCCTCTGCTATTGCTTTGCTGCGGGGAGCCCCGTACCTTTTGGTCCAGAGGGAC 

GGCTGGAAGATAAGCTCCACAAACCCAAAGCTACACAGACTGAGGTCAAACCATCTGTGAGG 

TTTAACCTCCGCACCTCCAAGGACCCAGAGCATGAAGGATGCTACCTCTCCGTCGGCCACAG 

CCAGCCCTTAGAAGACTGCAGTTTCAACATGACAGCTAAAACCTTTTTCATCATTCACGGAT 

GGACGATGAGCGGTATCTTTGAAAACTGGCTGCACAAACTCGTGTCAGCCCTGCACACAAGA 

GAGAAAGACGCCAATGTAGTTGTGGTTGACTGGCTCCCCCTGGCCCACCAGCTTTACACGGA 

TGCGGTCAATAATACCAGGGTGGTGGGACACAGCATTGCCAGGATGCTCGACTGGCTGCAGG 

AGAAGGACGATTTTTCTCTCGGGAATGTCCACTTGATCGGCTACAGCCTCGGAGCGCACGTG 

GCCGGGTATGCAGGCAACTTCGTGAAAGGAACGGTGGGCCGAATCACAGGTTTGGATCCTGC 

CGGGCCCATGTTTGAAGGGGCCGACATCCACAAGAGGCTCTCTCCGGACGATGCAGATTTTG 

TGGATGTCCTCCACACCTACACGCGTTCCTTCGGCTTGAGCATTGGTATTCAGATGCCTGTG 

GGCCACATTGACATCTACCCCAATGGGGGTGACTTCCAGCCAGGCTGTGGACTCAACGATGT 

CTTGGGATCAATTGCATATGGAACAATCACAGAGGTGGTAAAATGTGAGCATGAGCGAGCCG 

TCCACCTCTTTGTTGACTCTCTGGTGAATCAGGACAAGCCGAGTTTTGCCTTCCAGTGCACT 

GACTCCAATCGCTTCAAAAAGGGGATCTGTCTGAGCTGCCGCAAGAACCGTTGTAATAGCAT 

TGGCTACAATGCCAAGAAAATGAGGAACAAGAGGAACAGCAAAATGTACCTAAAAACCCGGG 

CAGGCATGCCTTTCAGAGGTAACCTTCAGTCCCTGGAGTGTCCCIS&GGAAGGCCCTTAATA 

CCTCCTTCTTAATACCATGCTGCAGAGCAGGGCACATCCTAGCCCAGGAGAAGTGGCCAGCA 

CAATCCAATCAAATCGTTGCAAATCAGATTACACTGTGCATGTCCTAGGAAAGGGAATCTTT 

ACAAAATAAACAGTGTGGACCCCTAATAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 66 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA49646 
xsubunit 1 of 1, 3 54 aa, 1 stop 
xMW: 39362, pi: 8.35, NX(S/T): 2 

MSNSVPLLCFWSLCYCFAAGSPVPFGPEGRLEDKLHKPKATQTEVKPSVRFNLRTSKDPEHE 
GCYLSVGHSQPLEDCSFNMTAKTFFIIHGWTMSGIFENWLHKLVSALHTREKDANWWDWL 
PLAHQLYTDAVNNTRWGHSIARMLDWLQEKDDPSLGNVHLIGYSLGAHVAGYAGNFVKGTV 
GRITGLDPAGPMFEGADIHKRLSPDDADFVDVLHTYTRSFGLSIGIQMPVGHIDIYPNGGDF 
QPGCGLNDVLGSIAYGTITEWKCEHERAVHLFVDSLVNQDKPSFAFQCTDSNRFKKGICLS 
CRKNRCNSIGYNAKKMRNKRNSKMYLKTRAGMPFRGNLQSLECP 

Important features: 
Signal peptide: 

amino acids 1-16 

Lipases, serine active site. 

amino acids 163-172 

N-glycosylation sites. 

amino acids 80-83 and 136-13 9 



FIfiURE 67 

CGGACGCGTGGGCGGACGCGTGGGCCTGGGCAAGGGCCGGGGCGCCGGGCCGAGCCACCTCT 

TCCCCTCCCCCGCTTCCCTGTCGCGCTCCGCTGGCTGGACGCGCTGGAGGAGTGGAGCAGCA 

CCCGGCCGGCCCTGGGGGCTGACAGTCGGCAAAGTTTGGCCCGAAGAGGAAGTGGTCTCAAA 

CCCCGGCAGGTGGCGACCAGGCCAGACCAGGGGCGCTCGCTGCCTGCGGGCGGGCTGTAGGC 

GAGGGCGCGCCCCAGTGCCGAGACCCGGGGCTTCAGGAGCCGGCCCCGGGAGAGAAGAGTGC 

GGCGGCGGACGGAGAAAACAACTCCAAAGTTGGCGAAAGGCACCGCCCCTACTCCCGGGCTG 

CCGCCGCCTCCCCGCCCCCAGCCCTGGCATCCAGAGTACGGGTCGAGCCCGGGCCATGGAGC 

CCCCCTGGGGAGGCGGCACCAGGGAGCCTGGGCGCCCGGGGCTCCGCCGCGACCCCATCGGG 

TAGACCACAGAAGCTCCGGGACCCTTCCGGCACCTCTGGACAGCCCAGGMfiCTGTTGGCCA 

CCCTCCTCCTCCTCCTCCTTGGAGGCGCTCTGGCCCATCCAGACCGGATTATTTTTCCAAAT 

CATGCTTGTGAGGACCCCCCAGCAGTGCTCTTAGAAGTGCAGGGCACCTTACAGAGGCCCCT 

GGTCCGGGACAGCCGCACCTCCCCTGCCAACTGCACCTGGCTCATCCTGGGCAGCAAGGAAC 

AGACTGTCACCATCAGGTTCCAGAAGCTACACCTGGCCTGTGGCTCAGAGCGCTTAACCCTA 

CGCTCCCCTCTCCAGCCACTGATCTCCCTGTGTGAGGCACCTCCCAGCCCTCTGCAGCTGCC 

CGGGGGCAACGTCACCATCACTTACAGCTATGCTGGGGCCAGAGCACCCATGGGCCAGGGCT 

TCCTGCTCTCCTACAGCCAAGATTGGCTGATGTGCCTGCAGGAAGAGTTTCAGTGCCTGAAC 

CACCGCTGTGTATCTGCTGTCCAGCGCTGTGATGGGGTTGATGCCTGTGGC3ATGGCTCTGA 

TGAAGCAGGTTGCAGCTCAGACCCCTTCCCTGGCCTGACCCCAAGACCCGTCCCCTCCCTGC 

CTTGCAATGTCACCTTGGAGGACTTCTATGGGGTCTTCTCCTCTCCTGGATATACACACCTA 

GCCTCAGTCTCCCACCCCCAGTCCTGCCATTGGCTGCTGGACCCCCATGATGGCCGGCGGCT 

GGCCGTGCGCTTCACAGCCCTGGACTTGGGCTTTGGAGATGCAGTGCATGTGTATGACGGCC 

CTGGGCCCCCTGAGAGCTCCCGACTACTGCGTAGTCTCACCCACTTCAGCAATGGCAAGGCT 

GTCACTGTGGAGACACTGTCTGGCCAGGCTGTTGTGTCCTACCACACAGTTGCTTGGAGCAA 

TGGTCGTGGCTTCAATGCCACCTACCATGTGCGGGGCTATTGCTTGCCTTGGGACAGACCCT 

GTGGCTTAGGCTCTGGCCTGGGAGCTGGCGAAGGCCTAGGTGAGCGCTGCTACAGTGAGGCA 

CAGCGCTGTGACGGCTCATGGGACTGTGCTGACGGCACAGATGAGGAGGACTGCCCAGGCTG 

CCCACCTGGACACTTCCCCTGTGGGGCTGCTGGCACCTCTGGTGCCACAGCCTGCTACCTGC 

CTGCTGACCGCTGCAACTACCAGACTTTCTGTGCTGATGGAGCA6ATGAGAGACGCTGTCGG 

CATTGCCAGCCTGGCAATTTCCGATGCCGGGACGAGAAGTGCGTGTATGAGACGTGGGTGTG 

CGATGGGCAGCCAGACTGTGCGGACGGCAGTGATGAGTGGGACTGCTCCTATGTTCTGCCCC 

GCAAGGTCATTACAGCTGCAGTCATTGGCAGCCTAGTGTGCGGCCTGCTCCTGGTCATCGCC 

CTGGGCTGCACCTGCAAGCTCTATGCCATTCGCACCCAGGAGTACAGCATCTTTGCCCCCCT 

CTCCCGGATGGAGGCTGAGATTGTGCAGCAGCAGGCACCCCCTTCCTACGGGCAGCTCATTG 

CCCAGGGTGCCATCCCACCTGTAGAAGACTTTCCTACAGAGAATCCTAATGATAACTCAGTG 

CTGGGCAACCTGCGTTCTCTGCTACAGATCTTACGCCAGGATATGACTCCAGGAGGTGGCCC 

AGGTGCCCGCCGTCGTCAGCGGGGCCGCTTGATGCGACGCCTGGTACGCCGTCTCCGCCGCT 

GGGGCTTGCTCCCTCGAACCAACACCCCGGCTCGGGCCTCTGAGGCCAGATCCCAGGTCACA 

CCTTCTGCTGCTCCCCTTGAGGCCCTAGATGGTGGCACAGGTCCAGCCCGTGAGGGCGGGGC 

AGTGGGTGGGCAAGATGGGGAGCAGGCACCCCCACTGCCCATCAAGGCTCCCCTCCCATCTG 

CTAGCACGTCTCCAGCCCCCACTACTGTCCCTGAAGCCCCAGGGCCACTGCCCTCACTGCCC 

CTAGAGCCATCACTATTGTCTGGAGTGGTGCAGGCCCTGCGAGGCCGCCTGTTGCCCAGCCT 

GGGGCCCCCAGGACCAACCCGGAGCCCCCCTGGACCCCACACAGCAGTCCTGGCCCTGGAAG 

ATGAGGACGATGTGCTACTGGTGCCACTGGCTGAGCCGGGGGTGTGGGTAGCTGAGGCAGAG 

GATGAGCCACTGCTTACClfi&GGGGACCTGGGGGCTCTACTGAGGCCTCTCCCCTGGGGGCT 

CTACTCATAGTGGCACAACCTTTTAGAGGTGGGTCAGCCTCCCCTCCACCACTTCCTTCCCT 

GTCCCTGGATTTCAGGGACTTGGTGGGCCTCCCGTTGACCCTATGTAGCTGCTATAAAGTTA 

AGTGTCCCTCAGGCAGGGAGAGGGCTCACAGAGTCTCCTCTGTACGTGGCCATGGCCAGACA 

CCCCAGTCCCTTCACCACCACCTGCTCCCCACGCCACCACCATTTGGGTGGCTGTTTTTAAA 

AAGTAAAGTTCTTAGAGGATCATAGGTCTGGACACTCCATCCTTGCCAAACCTCTACCCAAA 

Kgtggccttaagcaccggaatgccaattaactagagaccctccagcccccaaggggaggatt 
tgggcagaacctgaggttttgccatccacaatccctcctacagggcctggctcacaaaaaga 
gtgcaacaaatgcttctattccatagctacggcattgctcagtaagttgaggtcaaaaataa 

aggaatcatacatctc 
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FIGURE 68 

</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA49631 
<subunit 1 of 1, 713 aa, 1 stop 
<MW: 76193, pl4 5.42, NX(S/T): 4 

MLLATLLLLLLGGALAHPDRIIFPNHACEDPPAVLLEVQGTLQRPLVRDSRTSPANCTWLIL 
GSKEQTVTIRFQKLHLACGSERLTLRSPLQPLISLCEAPPSPLQLPGGNVTITYSYAGARAP 
MGQGFLLSYSQDWLMCLQEEFQCLNHRCVSAVQRCDGVDACGDGSDEAGCSSDPFPGLTPRP 
VPSLPCNVTLEDFYGVFSSPGYTHLASVSHPQSCHWLLDPHDGRRLAVRFTALDLGFGDAVH 
VYDGPGPPESSRLLRSLTHFSNGKAVTVETLSGQAWSYHTVAWSNGRGFNATYHVRGYCLP 
WDRPCGLGSGLGAGEGLGERCYSEAQRCDGSWDCADGTDEEDCPGCPPGHFPCGAAGTSGAT 
ACYLPADRCNYQTFCADGADERRCRHCQPGNFRCRDEKCVYETWVCDGQPDCADGSDEWDCS 
YVLPRKVITAAVIGSLVCGLLLVIALGCTCKLYAIRTQEYSIFAPLSRMEAEIVQQQAPPSY 
GQLIAQGAIPPVEDFPTENPNDNSVLGNLRSLLQILRQDMTPGGGPGARRRQRGRLMRRLVR 
RLRRWGLLPRTNTPARASEARSQVTPSAAPLEALDGGTGPAREGGAVGGQDGEQAPPLPIKA 
PLPSASTSPAPTTVPEAPGPLPSLPLEPSLLSGWQALRGRLLPSLGPPGPTRSPPGPHTAV 
LALEDEDDVLLVPLAEPGVWVAEAEDE PLLT 

Important features: 
Signal peptide: 

amino acids 1-16 

Transinenbrane domain: 
amino acids 442-462 

LDL-receptor class A (LDLRA) domain proteins 
amino acids 411-431, 152-171, 331-350 and 374-393 
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CGAGCTGGGCGAGAAGTAGGGGAGGGCGGTGCTCCGCCGCGGTGGCGGTTGCTATCGCTTCG 
CAGAACCTACTCAGGCAGCCAGCTGAGAAGAGTTGAGGGAAAGTGCTGCTGCTGGGTCTGCA 
GACGCGAIfiGATAACGTGCAGCCGAAAATAAAACATCGCCCCTTCTGCTTCAGTGTGAAAGG 
CCACGTGAAGATGCTGCGGCTGGCACTAACTGTGACATCTATGACCTTTTTTATCATCGCAC 
AAGCCCCTGAACCATATATTGTTATCACTGGATTTGAAGTCACCGTTATCTTATTTTTCATA 
CTTTTATATGTACTCAGACTTGATCGATTAATGAAGTGGTTATTTTGGCCTTTGCTTGATAT 
TATCAACTCACTGGTAACAACAGTATTCATGCTCATCGTATCTGTGTTGGCACTGATACCAG 
AAACCACAACATTGACAGTTGGTGGAGGGGTGTTTGCACTTGTGACAGCAGTATGCTGTCTT 
GCCGACGGGGCCCTTATTTACCGGAAGCTTCTGTTCAATCCCAGCGGTCCTTACCAGAAAAA 
GCCTGTGCATGAAAAAAAAGAAGTTTTGIMTTTTATATTACTTTTTAGTTTGATACTAAGT 
ATTAAACATATTTCTGTATTCTTCCAAAAAAAAAAAAAAAAAA 
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FIGURE 70 



></usr/seqdb2/sst/DNA/Dnaseqs.niin/ss.DNA49645 
xsubunit 1 of 1, 152 aa, 1 stop 
XMW: 17170, pi: 9.62, NX(S/T): 1 

MDNVQPKIKHRPFCFSVKGHVKMLRLALTVTSMTFFIIAQAPEPYIVITGFEVTVILFFILL 
YVLRLDRLMKWLFWPLLDIINSLVTTVFMLIVSVLALIPETTTLTVGGGVFALVTAVCCLAD 
GAL I YRKLL FNPSG P YQKKP VHEKKEVL 

Important features : 

Potential type II transmembrane domain: 

amino acids 2 6-42 

Other potential transmembrane domain: 
amino acids 44-65, 81-101 and 109-129 

Leucine zipper pattern 
amino acids 78-99 and 85-106 

N-myristoylation site. 

amino acids 110-115 

Ribonucleotide reductase large subunit protein 
amino acids 116-127 
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FIGURE 71 



GGGCGAGAAGTAGGGGAGGGCGTGTTCCGCCGCGGTGGCGGTTGCTATCGTTTTGCAGAACC 
TACTCAGGCAGCCAGNTGAGAAGAGTTGAGGGAAAGTGCTGCTGCTGGGTCTGCAGACGCGA 
TGGATAACGTGCAGCCGAAAATAAAACATCGCCCCTTCTGCTTCAGTGTGAAAGGCCACGTG 
AAGATGCTGCGGCTGGCACTAACTGNGACATCTATGACCTTTTTTATNATCGCACAAGCCCC 
TGAACCATATATTGTTATCACTGGATTTGAAGTCACCGTTATCTTATTTTTCATACTTTTAT 
ATGTACTCAGACTTGATCGATTAATGAAGTGGTTATTTTGGCCTTTGCTTGATATTATCAAC 
TCACTGGTAACAACAGTATTCATGCTCATCGTATCTGTGTTGGCACTGATACCAGAAACCAC 
AACATTGACAGTTGGTGGAGGGGTGTTTGCACTTGTGAO^GCAGTATGCTGTNTTGCCGAC 
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FIGURE 72 

CAGCCCCGCGCGCCGGCCGAGTCGCTGAGCCGCGGCTGCCGGACGGGACGGGACCGGCTAGG 

CTGGGCGCGCCCCCCGGGCCCCGCCGTGGGCMSGGCGCACTGGCCCGGGCGCTGCTGCTGC 

CTCTGCTGGCCCAGTGGCTCCTGCGCGCCGCCCCGGAGCTGGCCCCCGCGCCCTTCACGCTG 

CCCCTCCGGGTGGCCGCGGCCACGAACCGCGTAGTTGCGCCCACCCCGGGACCCGGGACCCC 

TGCCGAGCGCCACGCCGACGGCTTGGCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGG 

GCGCCGCCAACTTCTTGGCCATGGTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTAC 

CTGGAGATGCTGATCGGGACCCCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAG 

TAACTTTGCCGTGGCAGGAACCCCGCACTCCTACATAGACACGTACTTTGACACAGAGAGGT 

CTAGCACATACCGCTCCAAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACG 

GGCTTCGTTGGGGAAGACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAA 

CATTGCCACTATTTTTGAATCAGAGAATTTCTTTTTGCCTGGGATTAAATGGAATGGAATAC 

TTGGCCTAGCTTATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCC 

CTGGTGACACAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCC 

CGTTGCTGGATCTGGGACCAACGGAGGTAGTCTTGTCTTGGGTGGAATTGAACCAAGTTTGT 

ATAAAGGAGACATCTGGTATACCCCTATTAAGGAAGAGTGGTACTACCAGATAGAAATTCTG 

AAATTGGAAATTGGAGGCCAAAGCCTTAATCTGGACTGCAGAGAGTATAACGCAGACAAGGC 

CATCGTGGACAGTGGCACCACGCTGCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTGG 

AAGCTGTGGCCCGCGCATCTCTGATTCCAGAATTCTCTGATGGTTTCTGGACTGGGTCCCAG 

CTGGCGTGCTGGACGAATTCGGAAACACCTTGGTCTTACTTCCCTAAAATCTCCATCTACCT 

GAGAGACGAGAACTCCAGCAGGTCATTCCGTATCACAATCCTGCCTCAGCTTTACATTCAGC 

CCATGATGGGGGCCGGCCTGAATTATGAATGTTACCGATTCGGCATTTCCCCATCCACAAAT 

GCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTTCTACGTCATCTTCGACAGAGCCCAGAA 

GAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCAGTGTCTGAAATTT 

CCGGGCCTTTCTCAACAGAGGATGTAGCCAGCAACTGTGTCCCCGCTCAGTCTTTGAGCGAG 

CCCATTTTGTGGATTGTGTCCTATGCGCTCATGAGCGTCTGTGGAGCCATCCTCCTTGTCTT 

AATCGTCCTGCTGCTGCTGCCGTTCCGGTGTCAGCGTCGCCCCCGTGACCCTGAGGTCGTCA 

ATGATGAGTCCTCTCTGGTCAGACATCGCTGGAAATSAATAGCCAGGCCTGACCTCAAGCAA 

CCATGAACTCAGCTATTAAGAAAATCACATTTCCAGGGCAGCAGCCGGGATCGATGGTGGCG 

CTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCTGCTCCCAGATGCCTTCTAGATTCAC 

TGTCTTTTGATTCTTGATTTTCAAGCTTTCAAATCCTCCCTACTTCCAAGAAAAATAATTAA 
AAAAAAAACTTCATTCTAA 
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FIGURE 73 



></usr/seqdb2/sst/DNA/Dnaseqs.min/ss .DNA45493 
xsubunit 1 of 1, 518 aa, 1 step 
XMW: 56180, pi: 5.08, NX(S/T): 2 

MGALARALLLPLLAQWLLRAAPEIAPAPFTLPLRVAAATNRWAPTPGPGTPAERHADGI^ 
ALEPALASPAGAANFLAMVDNLQGDSGRGYYLEMLIGTPPQKLQILVDTGSSNFAVAGTPHS 
YIDTYFDTERSSTYRSKGFDVTVKYTQGSWTGFVGEDLVTIPKGFNTSFLVNIATIFESENF 
FLPGIKWNGILGLAYATLAKPSSSLETFFDSLVTQANIPNVFSMQMCGAGLPVAGSGTNGGS 
LVLGGIEPSLYKGDIWYTPIKEEWYYQIEILKLEIGGQSLNLDCREYNADKAIVDSGTTLLR 
LPQKVFDAWEAVARASLIPEFSDGFWTGSQLACWTNSETPWSYFPKISIYLRDENSSRSFR 
ITILPQLYIQPMMGAGLNYECYRFGISPSTNALVIGATVMEGFYVIFDRAQKRVGFAASPCA 
EIAGAAVSEISGPFSTEDVASNCVPAQSLSEPILWIVSYALMSVCGAILLVLIVLLLLPFRC 
QRRPRDPEWNDESSLVRHRWK 

Important features: 
Signal peptide: 
amino acids 1-20 

Transmembrane domain: 
amino acids 466-494 

N-glycosylation sites. 

amino acids 170-173 and 366-369 

Leucine zipper pattern, 
amino acids 10-31 and 197-118 

Eukaryotic and viral aspartyl proteases 
amino acids 109-118, 252-261 and 298-310 
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FIGURE 74 

CGCCTCCGCCTTCGGAGGCTGACGCGCCCGGGCGCCGTTCCAGGCCTGTGCAGGGCGGATCG 
GCAGCCGCCTGGCGGCGATCCAGGGCGGTGCGGGGCCTGGGCGGGAGCCGGGAGGCGCGGCC 
GGCMSGAGGCGCTGCTGCTGGGCGCGGGGTTGCTGCTGGGCGCTTACGTGCTTGTCTACTA 
CAACCTGGTGAAGGCCCCGCCGTGCGGCGGCATGGGCAACCTGCGGGGCCGCACGGCCGTGG 
TCACGGGCGCCAACAGCGGCATCGGAAAGATGACGGCGCTGGAGCTGGCGCGCCGGGGAGCG 
CGCGTGGTGCTGGCCTGCCGCAGCCAGGAGCGCGGGGAGGCGGCTGCCTTCGACCTCCGCCA 
GGAGAGTGGGAACAATGAGGTCATCTTCATGGCCTTGGACTTGGCCAGTCTGGCCTCGGTGC 
GGGCCTTTGCCACTGCCTTTCTGAGCTCTGAGCCACGGTTGGACATCCTCATCCACAATGCC 
GGTATCAGTTCCTGTGGCCGGACCCGTGAGGCGTTTAACCTGCTGCTTCGGGTGAACCATAT 
CGGTCCCTTTCTGCTGACACATCTGCTGCTGCCTTGCCTGAAGGCATGTGCCCCTAGCCGCG 
TGGTGGTGGTAGCCTCAGCTGCCCACTGTCGGGGACGTCTTGACTTCAAACGCCTGGACCGC 
CCAGTGGTGGGCTGGCGGCAGGAGCTGCGGGCATATGCTGACACTAAGCTGGCTAATGTACT 
GTTTGCCCGGGAGCTCGCCAACCAGCTTGAGGCCACTGGCGTCACCTGCTATGCAGCCCACC 
CAGGGCCTGTGAACTCGGAGCTGTTCCTGCGCCATGTTCCTGGATGGCTGCGCCCACTTTTG 
CGCCCATTGGCTTGGCTGGTGCTCCGGGCACCAAGAGGGGGTGCCCAGACACCCCTGTATTG 
TGCTCTACAAGAGGGCATCGAGCCCCTCAGTGGGAGATATTTTGCCAACTGCCATGTGGAAG 
AGGTGCCTCCAGCTGCCCGAGACGACCGGGCAGCCCATCGGCTATGGGAGGCCAGCAAGAGG 
CTGGCAGGGCTTGGGCCTGGGGAGGATGCTGAACCCGATGAAGACCCCCAGTCTGAGGACTC 
AGAGGCCCCATCTTCTCTAAGCACCCCCCACCCTGAGGAGCCCACAGTTTCTCAACCTTACC 
CCAGCCCTCAGAGCTCACCAGATTTGTCTAAGATGACGCACCGAATTCAGGCTAAAGTTGAG 
CCTGAGATCCAGCTCTCCIMCCCTCAGGCCAGGATGCTTGCCATGGCACTTCATGGTCCTT 
GAAAACCTCGGATGTGTGTGAGGCCATGCCCTGGACACTGACGGGTTTGTGATCTTGACCTC 
CGTGGTTACTTTCTGGGGCCCCAAGCTGTGCCCTGGACATCTCTTTTCCTGGTTGAAGGAAT 
AATGGGTGATTATTTCTTCCTGAGAGTGACAGTAACCCCAGATGGAGAGATAGGGGTATGCT 
AGACACTGTGCTTCTCGGAAATTTGGATGTAGTATTTTCAGGCCCCACCCTTATTGATTCTG 
ATCAGCTCTGGAGCAGAGGCAGGGAGTTTGCAATGTGATGCACTGCCAACATTGAGAATTAG 
TGAACTGATCCCTTTGCAACCGTCTAGCTAGGTAGTTAAATTACCCCCATGTTAATGAAGCG 
GAATTAGGCTCCCGAGCTAAGGGACTCGCCTAGGGTCTCACAGTGAGTAGGAGGAGGGCCTG 
GGATCTGAACCCAAGGGTCTGAGGCCAGGGCCGACTGCCGTAAGATGGGTGCTGAGAAGTGA 
GTCAGGGCAGGGCAGCTGGTATCGAGGTGCCCCATGGGAGTAAGGGGACGCCTTCCGGGCGG 
ATGCAGGGCTGGGGTCATCTGTATCTGAAGCCCCTCGGAATAAAGCGCGTTGACCGCCAAAA 
AAAAAAAAAAAAAAAAA 
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FIGURE 75 

</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA48227 
<subunit 1 of i, 377 aa, 1 stop 
<MW: 40849. pi: 7,98, NX(S/T): 0 

MEALLLGAGLLLGAYVLVYYNLVKAPPCGGMGNLRGRTAWTGANSGIGKMTALELARRGAR 
WIJICRSQERGEAAAFDLRQESGNNEVIFMALDLASIASVRAFATAFLSSEPRLDILIHNAG 
ISSCGRTREAFNLLLRVNHIGPFLLTHLLLPCLKACAPSRWWASAAHCRGRLDFKRLDRP 
WGWRQELRAYADTKLANVLFARELANQLEATGVTCYAAHPGPVNSELFLRHVPGWLRPLLR 
PLAWLVLRAPRGGAQTPLYCALQEGIEPLSGRYFANCHVEEVPPAARDDRAAHRLWEASKRL 
AGLGPGEDAEPDEDPQSEDSEAPSSLSTPHPEEPTVSQPYPSPQSSPDLSKMTHRIQAKVEP 

EIQLS 

Important features: 
Signal peptide: 
amino acids 1-16 

Glycosaminoglycan attachment site, 
amino acids 46-49 

Short-chain alcohol dehydrogenase family 
amino acids 37-49 and 114-124 
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FIGURE 7fiA 

GGAGGAGACAGCCTCCTGGGGGGCAGGGGTTCCCTGCCTCTGCTGC7CCTGCTCATCA15GG 
AGGCATGGCTCAGGACTCCCCGCCCCAGATCCTAGTCCACCCCCAGGACCAGCTGTTCCAGG 
GCCCTGGCCCTGCCAGGATGAGCTGCCAAGCCTCAGGCCAGCCACCTCCCACCATCCGCTGG 
TTGCTGAATGGGCAGCCCCTGAGCATGGTGCCCCCAGACCCACACCACCTCCTGCCTGATGG 
GACCCTTCTGCTGCTACAGCCCCCTGCCCGGGGACATGCCCACGATGGCCAGGCCCTGTCCA 
CAGACCTGGGTGTCTACACATGTGAGGCCAGCAACCGGCTTGGCACGGCAGTCAGCAGAGGC 
GCTCGGCTGTCTGTGGCTGTCCTCCGGGAGGATTTCCAGATCCAGCCTCGGGACATGGTGGC 
TGTGGTGGGTGAGCAGTTTACTCTGGAATGTGGGCCGCCCTGGGGCCACCCAGAGCCCACAG 
TCTCATGGTGGAAAGATGGGAAACCCCTGCCCCTCCAGCCCGGAAGGCACACAGTGTCCGGG 
GGGTCCCTGCTGATGGCAAGAGCAGAGAAGAGTGACGAAGGGACCTACATGTGTGTGGCCAC 
CAACAGCGCAGGACATAGGGAGAGCCGCGCAGCCCGGGTTTCCATCCAGGAGCCCCAGGACT 
ACACGGAGCCTGTGGAGCTTCTGGCTGTGCGAATTCAGCTGGAAAATGTGACACTGCTGAAC 
CCGGATCCTGCAGAGGGCCCCAAGCCTAGACCGGCGGTGTGGCTCAGCTGGAAGGTCAGTGG 
CCCTGCTGCGCCTGCCCAATCTTACACGGCCTTGTTCAGGACCCAGACTGCCCCGGGAGGCC 
AGGGAGCTCCGTGGGCAGAGGAGCTGCTGGCCGGCTGGCAGAGCGCAGAGCTTGGAGGCCTC 
CACTGGGGCCAAGACTACGAGTTCAAAGTGAGACCATCCTCTGGCCGGGCTCGAGGCCCTGA 
CAGCAACGTGCTGCTCCTGAGGCTGCCGGAAAAAGTGCCCAGTGCCCCACCTCAGGAAGTGA 
CTCTAAAGCCTGGCAATGGCACTGTCTTTGTGAGCTGGGTCCCACCACCTGCTGAAAACCAC 
AATGGCATCATCCGTGGCTACCAGGTCTGGAGCCTGGGCAACACATCACTGCCACCAGCCAA 
CTGGACTGTAGTTGGTGAGCAGACCCAGCTGGAAATCGCCACCCATATGCCAGGCTCCTACT 
GCGTGCAAGTGGCTGCAGTCACTGGTGCTGGAGCTGGGGAGCCCAGTAGACCTGTCTGCCTC 
CTTTTAGAGCAGGCCATGGAGCGAGCCACCCAAGAACCCAGTGAGCATGGTCCCTGGACCCT 
GGAGCAGCTGAGGGCTACCTTGAAGCGGCCTGAGGTCATTGCCACCTGCGGTGTTGCACTCT 
GGCTGCTGCTTCTGGGCACCGCCGTGTGTATCCACCGCCGGCGCCGAGCTAGGGTGCACCTG 
GGCCCAGGTCTGTACAGATATACCAGTGAGGATGCCATCCTAAAACACAGGATGGATCACAG 
TGACTCCCAGTGGTTGGCAGACACTTGGCGTTCCACCTCTGGCTCTCGGGACCTGAGCAGCA 
GCAGCAGCCTCAGCAGTCGGCTGGGGGCGGATGCCCGGGACCCACTAGACTGTCGTCGCTCC 
TTGCTCTCCTGGGACTCCCGAAGCCCCGGCGTGCCCCTGCTTCCAGACACCAGCACTTTTTA 
TGGCTCCCTCATCGCTGAGCTGCCCTCCAGTACCCCAGCCAGGCCAAGTCCCCAGGTCCCAG 
CTGTCAGGCGCCTCCCACCCCAGCTGGCCCAGCTCTCCAGCCCCTGTTCCAGCTCAGACAGC 
CTCTGCAGCCGCAGGGGACTCTCTTCTCCCCGCTTGTCTCTGGCCCCTGCAGAGGCTTGGAA 
GGCCAAAAAGAAGCAGGAGCTGCAGCATGCCAACAGTTCCCCACTGCTCCGGGGCAGCCACT 
CCTTGGAGCTCCGGGCCTGTGAGTTAGGAAATAGAGGTTCCAAGAACCrTTCCCAAAGCCCA 
GGAGCTGTGCCCCAAGCTCTGGTTGCCTGGCGGGCCCTGGGACCGAAACTCCTCAGCTCCTC 
AAATGAGCTGGTTACTCGTCATCTCCCTCCAGCACCCCTCTTTCCTCATGAAACTCCCCCAA 
CTCAGAGTCAACAGACCCAGCCTCCGGTGGCACCACAGGCTCCCTCCTCCATCCTGCTGCCA 
GCAGCCCCCATCCCCATCCTTAGCCCCTGCAGTCCCCCTAGCCCCCAGGCCTCTTCCCTCTC 
TGGCCCCAGCCCAGCTTCCAGTCGCCTGTCCAGCTCCTCACTGTCATCCCTGGGGGAGGATC 
AAGACAGCGTGCTGACCCCTGAGGAGGTAGCCCTGTGCTTGGAACTCAGTGAGGGTGAGGAG 
ACTCCCAGGAACAGCGTCTCTCCCATGCCAAGGGCTCCTTCACCCCCCACCACCTATGGGTA 
CATCAGCGTCCCAACAGCCTCAGAGTTCACGGACATGGGCAGGACTGGAGGAGGGGTGGGGC 
CCAAGGGGGGAGTCTTGCTGTGCCCACCTCGGCCCTGCCTCACCCCCACCCCCAGCGAGGGC 
TCCTTAGCCAATGGTTGGGGCTCAGCCTCTGAGGACAATGCCGCCAGCGCCAGAGCCAGCCT 
TGTCAGCTCCTCCGATGGCTCCTTCCTCGCTGATGCTCACTTTGCCCGGGCCCTGGCAGTGG 
CTGTGGATAGCTTTGGTTTCGGTCTAGAGCCCAGGGAGGCAGACTGCGTCTTCATAGATGCC 
TCATCACCTCCCTCCCCACGGGATGAGATCTTCCTGACCCCCAACCTCTCCCTGCCCCTGTG 
GGAGTGGAGGCCAGACTGGTTGGAAGACATGGAGGTCAGCCACACCCAGCGGCTGGGAAGGG 
GGATGCCTCCCTGGCCCCCTGACTCTCAGATCTCTTCCCAGAGAAGTCAGCTCCACTGTCGT 
ATGCCCAAGGCTGGTGCTTCTCCTGTAGATTACTCClSaACCGTGTCCCTGAGACTTCCCAG 
ACGGGAATCAGAACCACTTCTCCTGTCCACCCACAAGACCTGGGCTGTGGTGTGTGGGTCTT 
GGCCTGTGTTTCTCTGCAGCTGGGGTCCACCTTCCCAAGCCTCCAGAGAGTTCTCCCTCCAC 
GATTGTGAAAACAAATGAAAA.CAA.2iJi.TTAGAGCAAAGCTGACCTGGAGCCCTCAGGGAGCAA 
AACATCATCTCCACCTGACTCCTAGCCACTGCTTTCTCCTCTGTGCCATCCACTCCCACCAC 
CAGGTTGTTTTGGCCTGAGGAGCAGCCCTGCCTGCTGCTCTTCCCCCACCATTTGGATCACA 
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FIGURE 76B 

GGAAGTGGAGGAGCCAGAGGTGCCTTTGTGGAGGACAGCAGTGGCTGCTGGGAGAGGGCTGT 
GGAGGAAGGAGCTTCTCGGAGCCCCCTCTCAGCCTTACCTGGGCCCCTCCTCTAGAGAAGAG 
CTCAACTCTCTCCCAACCTCACCATGGAAAGAAAATAATTATGAATGCCACTGAGGCACTGA 
GGCCCTACCTCATGCCAAACAAAGGGTTCAAGGCTGGGTCTAGCGAGGATGCTGAAGGAAGG 
GAGGTATGAGACCGTAGGTCAAAAGCACCATCCTCGTACTGTTGTCACTATGAGCTTAAGAA 
ATTTGATACCATAAAATGGTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 77 

</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA41404 
<subunit 1 of 1, 985 aa, 1 stop 
<MW: 105336, pi: 6.55, NX(S/T): 7 

MGGMAQDSPPQILVHPQDQLFQGPGPARMSCQASGQPPPTIRWLLNGQPLSMVPPDPHHLLP 
DGTLLLLQPPARGHAHDGQALSTDLGVYTCEASNRLGTAVSRGARLSVAVLREDFQIQPRDM 
VAWGEQFTLECGPPWGHPEPTVSWWKDGKPLALQPGRHTVSGGSLLMARAEKSDEGTYMCV 
ATNSAGHRESRAARVSIQEPQDYTEPVELLAVRIQLENVTLLNPDPAEGPKPRPAVWLSWKV 
SGPAAPAQSYTALFRTQTAPGGQGAPWAEELLAGWQSAELGGLHWGQDYEETCVRPSSGRARG 
PDSNVLLLRLPEKVPSAPPQEVTLKPGNGTVFVSWVPPPAENHNGIIRGYQVWSLGNTSLPP 
ANWTWGEQTQLEIATHMPGSYCVQVAAVTGAGAGEPSRPVCLLLEQAMERATQEPSEHGPW 
TLEQLRATLKRPEVIATCGVALWLLLLGTAVCIHRRRRARVHLGPGLYRYTSEDAILKHRMD 
HSDSQWLADTWRSTSGSRDLSSSSSLSSRLGADARDPLDCRRSLLSWDSRSPGVPLLPDTST 
FYGSLIAELPSSTPARPSPQVPAVRRLPPQLAQLSSPCSSSDSLCSRRGLSSPRLSLAPAEA 
WKAKKKQELQHANSSPLLRGSHSLELRACELGNRGSKNLSQSPGAVPQALVAWRALGPKLLS 
SSNELVTRHLPPAPLFPHETPPTQSQQTQPPVAPQAPSSILLPAAPIPILSPCSPPSPQASS 
LSGPSPASSRLSSSSLSSLGEDQDSVLTPEEVALCLELSEGEETPRNSVSPMPRAPSPPTTY 
GYISVPTASEFTDMGRTGGGVGPKGGVLLCPPRPCLTPTPSEGSLANGWGSASEDNAASARA 
S LVS S S DGS FLADAH FARALAVAVDS FG FGLE PREADCVFI DAS S PPS PRDE I FLTPNLS LP 

LWEWRPDWLEDMEVSHTQRLGRGMPPWPPDSQISSQRSQLHCRMPKAGASPVDYS 



Important features: 
Transmembrane domain: 
amino acids 448-467 



N-glycosylation sites: 

amino acids 224-227, 338-341, 367-370, 374-377, 658-661 and 926- 
929 

N-myristoylation sites. 

amino acids 47-52, 80-85, 88-93, 99-104, 105-110, 181-186, 272- 
277, 290-295, 355-360, 403-408, 462-467, 561-566, 652-657, 849- 
854 and 876-881 

Phospho tyrosine interaction domain protexns 
amino acids 740-753 
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FIGURE 78 

CTCCCACGGTGTCCAGCGCCCAGMISCGGCTTCTGGTCCTGCTATGGGGTTGCCTGCTGCT 

CCCAGGTTATGAAGCCCTGGAGGGCCCAGAGGAAATCAGCGGGTTCGAAGGGGACACTGTGT 

CCCTGCAGTGCACCTACAGGGAAGAGCTGAGGGACCACCGGAAGTACTGGTGCAGGAAGGGT 

GGGATCCTCTTCTCTCGCTGCTCTGGCACCATCTATGCAGAAGAAGAAGGCCAGGAGACAAT 

GAAGGGCAGGGTGTCCATCCGTGACAGCCGCCAGGAGCTCTCGCTCATTGTGACCCTGTGGA 

ACCTCACCCTGCAAGACGCTGGGGAGTACTGGTGTGGGGTCGAAAAACGGGGCCCCGATGAG 

TCTTTACTGATCTCTCTGTTCGTCTTTCCAGGACCCTGCTGTCCTCCCTCCCCTTCTCCCAC 

CTTCCAGCCTCTGGCTACAACACGCCTGCAGCCCAAGGCAAAAGCTCAGCAAACCCAGCCCC 

CAGGATTGACTTCTCCTGGGCTCTACCCGGCAGCCACCACAGCCAAGCAGGGGAAGACAGGG 

GCTGAGGCCCCTCCATTGCCAGGGACTTCCCAGTACGGGCACGAAAGGACTTCTCAGTACAC 

AGGAACCTCTCCTCACCCAGCGACCTCTCCTCCTGCAGGGAGCTCCCGCCCCCCCATGCAGC 

TGGACTCCACCTCAGCAGAGGACACCAGTCCAGCTCTCAGCAGTGGCAGCTCTAAGCCCAGG 

GTGTCCATCCCGATGGTCCGCATACTGGCCCCAGTCCTGGTGCTGCTGAGCCTTCTGTCAGC 

CGCAGGCCTGATCGCCTTCTGCAGCCACCTGCTCCTGTGGAGAAAGGAAGCTCAACAGGCCA 

CGGAGACACAGAGGAACGAGAAGTTCTGGCTCTCACGCTTGACTGCGGAGGAAAAGGAAGCC 

CCTTCCCAGGCCCCTGAGGGGGACGTGATCTCGATGCCTCCCCTCCACACATCTGAGGAGGA 

GCTGGGCTTCTCGAAGTTTGTCTCAGCGl^GGCAGGAGGCCCTCCTGGCCAGGCCAGCAGT 

GAAGCAGTATGGCTGGCTGGATCAGCACCGATTCCCGAAAGCTTTCCACCTCAGCCTCAGAG 

TCCAGCTGCCCGGACTCCAGGGCTCTCCCCACCCTCCCCAGGCTCTCCTCTTGCATGTTCCA 

GCCTGACCTAGAAGCGTTTGTCAGCCCTGGAGCCCAGAGCGGTGGCCTTGCTCTTCCGGCTG 

GAGACTGGGACATCCCTGATAGGTTCACATCCCTGGGCAGAGTACCAGGCTGCTGACCCTCA 

GCAGGGCCAGACAAGGCTCAGTGGATCTGGTCTGAGTTTCAATCTGCCAGGAACTCCTGGGC 

CTCATGCCCAGTGTCGGACCCTGCCTTCCTCCCACTCCAGACCCCACCTTGTCTTCCCTCCC 

TGGCGTCCTCAGACTTAGTCCCACGGTCTCCTGCATCAGCTGGTGATGAAGAGGAGCATGCT 

GGGGTGAGACTGGGATTCTGGCTTCTCTTTGAACCACCTGCATCCAGCCCTTCAGGAAGCCT 

GTGAAAAACGTGATTCCTGGCCCCACCAAGACCCACCAAAACCATCTCTGGGCTTGGTGCAG 

GACTCTGAATTCTAACAATGCCCAGTGACTGTCGCACTTGAGTTTGAGGGCCAGTGGGCCTG 

ATGAACGCTCACACCCCTTCAGCTTAGAGTCTGCATTTGGGCTGTGACGTCTCCACCTGCCC 

CAATAGATCTGCTCTGTCTGCGACACCAGATCCACGTGGGGACTCCCCTGAGGCCTGCTAAG 

TCCAGGCCTTGGTCAGGTCAGGTGCACATTGCAGGATAAGCCCAGGACCGGCACAGAAGTGG 

TTGCCTTTNCCATTTGCCCTCCCTGGNCCATGCCTTCTTGCCTTTGGAAAAAATGATGAAGA 

AAACCTTGGCTCCTTCCTTGTCTGGAAAGGGTTACTTGCCTATGGGTTCTGGTGGCTAGAGA 

GAAAAGTAGAAAACCAGAGTGCACGTAGGTGTCTAACACAGAGGAGAGTAGGAACAGGGCGG 

ATACCTGAAGGTGACTCCGAGTCCAGCCCCCTGGAGAAGGGGTCGGGGGTGGTGGTAAAGTA 

GCACAACTACTATTTTTTTTCTTTTTCCATTATTATTGTTTTTTAAGACAGAATCTCGTGCT 

GCTGCCCAGGCTGGAGTGCAGTGGCACGATCTGCAAACTCCGCCTCCTGGGTTCAAGTGATT 

CTTCTGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCACGCACCACCACACCTGGCTAATT 

TTTGTACTTTTAGTAGAGATGGGGTTTCACCATGTTGGCCAGGCTGGTCTTGAACTCCTGAC 

CTCAAATGAGCCTCCTGCTTCAGTCTCCCAAATTGCCGGGATTACAGGCATGAGCCACTGTG 

TCTGGCCCTATTTCCTTTAAAAAGTGAAATTAAGAGTTGTTCAGTATGCAAAACTTGGAAAG 

ATGGAGGAGAAAAAGAAAAGGAAGAAAAAAATGTCACCCATAGTCTCACCAGAGACTATCAT 

TATTTCGTTTTGTTGTACTTCCTTCCACTCTTTTCTTCTTCACATAATTTGCCGGTGTTCTT 

TTTACAGAGCAATTATCTTGTATATACAACTTTGTATCCTGCCTTTTCCACCTTATCGTTCC 

ATCACTTTATTCCAGCACTTCTCTGTGTTTTACAGACCTTTTTATAAATAAAATGTTCATCA 

GCTGCATAAAAAAAAAAAT^ 
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FIGURE 79 



</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA44196 
<subunit 1 of 1, 332 aa, 1 stop 
<MW: 36143, pi: 5,89, NX(S/T): 1 

MRLLVLLWGCLLLPGYEALEGPEEISGFEGDTVSLQCTYREELRDHRKYWCRKGGILFSRCS 
GTIYAEEEGQETMKGRVSIRDSRQELSLIVTLWNLTLQDAGEYWCGVEKRGPDESLLISLFV 
FPGPCCPPSPSPTFQPLATTRLQPKAKAQQTQPPGLTSPGLYPAATTAKQGKTGAEAPPLPG 
TSQYGHERTSQYTGTSPHPATSPPAGSSRPPMQLDSTSAEDTSPALSSGSSKPRVSIPMVRI 

LAPVLVLLSLLSAAGLIAFCSHLLLWRKEAQQATETQRNEKFWLSRLTAEEKEAPSQAPEGD 
VISMPPLHTSEEELGFSKFVSA 

Important features: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 
amino acids 248-269 

N-glycosylation site, 
amino acids 96-99 

Fibrinogen beta and gamma chains C-termmal domain, 
amino acids 104-113 

Ig like V-type domain: 
amino acids 13-128 




BNSDOCID- <WO 99462eiA2.l.> 
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TTGTGACTAAAAGCTGGCCTAGCAGGCCAGGGAGTGCAGCTGCAGGCGTGGGGGTGGCAGGA 

GCCGCAGAGCCAGAGCAGACAGCCGAGAAACAGGTGGACAGTGTGAAAGAACCAGTGGTCTC 

GCTCTGTTGCCCAGGCTAGAGTGTACTGGCGTGATCATAGCTCACTGCAGCCTCAGACTCCT 

GGACTTGAGAAATCCTCCTGCCTTAGCCTCCTGCATATCTGGGACTCCAGGGGTGCACTCAA 

GCCCTGTTTCTTCTCCTTCTGTGAGTGGACCACGGAGGCTGGTGAGCTGCCTGTCATCCCAA 

AGCTCAGCTCTGAGCCAGAGTGGTGGTGGCTCCACCTCTGCCGCCGGCATAGAAGCCAGGAG 

CAGGGCTCTCAGAAGGCGGTGGTGCCCAGCTGGGATCAISTTGTTGGCCCTGGTCTGTCTGC 

TCAGCTGCCTGCTACCCTCCAGTGAGGCCAAGCTCTACGGTCGTTGTGAACTGGCCAGAGTG 

CTACATGACTTCGGGCTGGACGGATACCGGGGATACAGCCTGGCTGACTGGGTCTGCCTTGC 

TTATTTCACAAGCGGTTTCAACGCAGCTGCTTTGGACTACGAGGCTGATGGGAGCACCAACA 

ACGGGATCTTCCAGATCAACAGCCGGAGGTGGTGCAGCAACCTCACCCCGAACGTCCCCAAC 

GTGTGCCGGATGTACTGCTCAGATTTGTTGAATCCTAATCTCAAGGATACCGTTATCTGTGC 

CATGAAGATAACCCAAGAGCCTCAGGGTCTGGGTTACTGGGAGGCCTGGAGGCATCACTGCC 

AGGGAAAAGACCTCACTGAATGGGTGGATGGCTGTGACTTCiaGGATGGACGGAACCATGCA 

CAGCAGGCTGGGAAATGTGGTTTGGTTCCTGACCTAGGCTTGGGAAGACAAGCCAGCGAATA 

AAGGATGGTTGAACGTGAAA 



BNSOOCIO; <W0 994«2eiA2.L> 
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FIGURE 81 

</usr/seqdb2/sst/DNA/Dnaseqs ,min/ss . DNA52187 
<subunit 1 of 1, 146 aa, i stop 
<MW: 16430, pi: 5.05, NX{S/T): 1 

MLIALVCLLSCLLPSSEAKLYGRCELARVLHDFGLDGYRGYSIADm^CIJ\YFTSGFNAAA^ 

YEADGSTNNGIFQINSRRWCSNLTPNVPNVCRMYCSDLLNPNLKDTVICAMKITQEPQGLGY 
WEAWRHHCQGKDLTEWVDGCDF 

Inqjortant features: 
Signal peptide: 
amino acids 1-18 

N-myristoylation site, 
amino acids 67-72 

Homolgous region to Alpha-lactalbumm / lysozyme C proteins, 
amino acids 34-58 (catalytic domain), 111-132 and 66-107 
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FIGURE 82 

AGCCGCTGCCCCGGGCCGGGCGCCCGCGGCGGCACC&lfiAGTCCCCGCTCGTGCCTGCGTTC 

GCTGCGCCTCCTCGTCTTCGCCGTCTTCTCAGCCGCCGCGAGCAACTGGCTGTACCTGGCCA 

AGCTGTCGTCGGTGGGGAGCATCTCAGAGGAGGAGACGTGCGAGAAACTCAAGGGCCTGATC 

CAGAGGCAGGTGCAGATGTGCAAGCGGAACCTGGAAGTCATGGACTCGGTGCGCCGCGGTGC 

CCAGCTGGCCATTGAGGAGTGCCAGTACCAGTTCCGGAACCGGCGCTGGAACTGCTCCACAC 

TCGACTCCTTGCCCGTCTTCGGCAAGGTGGTGACGCAAGGGACTCGGGAGGCGGCCTTCGTG 

TACGCCATCTCTTCGGCAGGTGTGGCCTTTGCAGTGACGCGGGCGTGCAGCAGTGGGGAGCT 

GGAGAAGTGCGGCTGTGACAGGACAGTGCATGGGGTCAGCCCACAGGGCTTCCAGTGGTCAG 

GATGCTCTGACAACATCGCCTACGGTGTGGCCTTCTCACAGTCGTTTGTGGATGTGCGGGAG 

AGAAGCAAGGGGGCCTCGTCCAGCAGAGCCCTCATGAACCTCCACAACAATGAGGCCGGCAG 

GAAGGCCATCCTGACACACATGCGGGTGGAATGCAAGTGCCACGGGGTGTCAGGCTCCTGTG 

AGGTAAAGACGTGCTGGCGAGCCGTGCCGCCCTTCCGCCAGGTGGGTCACGCACTGAAGGAG 

AAGTTTGATGGTGCCACTGAGGTGGAGCCACGCCGCGTGGGCTCCTCCAGGGCACTGGTACC 

ACGCAACGCACAGTTCAAGCCGCACACAGATGAGGACCTGGTGTACTTGGAGCCTAGCCCCG 

ACTTCTGTGAGCAGGACATGCGCAGCGGCGTGCTGGGCACGAGGGGCCGCACATGCAACAAG 

ACGTCCAAGGCCATCGACGGCTGTGAGCTGCTGTGCTGTGGCCGCGGCTTCCACACGGCGCA 

GGTGGAGCTGGCTGAACGCTGCAGCTGCAAATTCCACTGGTGCTGCTTCGTCAAGTGCCGGC 

AGTGCCAGCGGCTCGTGGAGTTGCACACGTGCCGATS&CCGCCTGCCTAGCCCTGCGCCGGC 

AACCACCTAGTGGCCCAGGGAAGGCCGATAATTTAAACAGTCTCCCACCACCTACCCCAAGA 

GATACTGGTTGTATTTTTTGTTCTGGTTTGGTTTTTGGGTCCTCATGTTATTTATTGCCGAA 

ACCAGGCAGGCAACCCCAAGGGCACCAACCAGGGCCTCCCCAAAGCCTGGGCCTTTGTGGCT 

GCCACTGACCAAAGGGACCTTGCTCGTGCCGCTGGCTGCCCGCATGTGGCTGCCACTGACCA 

CTCAGTTGTTATCTGTGTCCGTTTTTCTACTTGCAGACCTAAGGTGGAGTAACAAGGAGTAT 

TACCACCACATGGCTACTGACCGTGTCATCGGGGAAGAGGGGGCCTTATGGCAGGGAAAATA 

GGTACCGACTTGATGGAAGTCACACCCTCTGGAAAAAAGAACTCTTAACTCTCCAGCACACA 

TACACATGGACTCCTGGCAGCTTGAGCCTAGAAGCCATGTCTCTCAAATGCCCTGAGAAAGG 

GAACAAGCAGATACCAGGTCAAGGGCACCAGGTTCATTTCAGCCCTTACATGGACAGCTAGA 

GGTTCGATATCTGTGGGTCCTTCCAGGCAAGAAGAGGGAGATGAGAGCAAGAGACGACTGAA 

GTCCCACCCTAGAACCCAGCCTGCCCCAGCCTGCCCCTGGGAAGAGGAAACTTAACCACTCC 

CCAGACCCACCTAGGCAGGCATATAGGCTGCCATCCTGGACCAGGGATCCCGGCTGTGCCTT 

TGCAGTCATGCCCGAGTCACCTTTCACAGCGCTGTTCCTCCATGAAACTGAAAAACACACAC 

ACACACACACACACACACACACACACACACACACACACGGACACACACACACACCTGCGAGA 

GAGAGGGAGGAAAGGGCTGTGCCTTTGCAGTCATGCCCGAGTCACCTTTCACAGCACTGTTCCTC 



BNStXXID: <W0 9946281 A2J_> 
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FIGURE 83 

</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA48328 
<subunit 1 of l, 351 aa, 1 stop 
<MW: 39052, pi: 8.97, NX(S/T): 2 

MSPRSCLRSLRLLVFAVFSAAASNWLYLAKLSSVGSISEEETCEKLKGLIQRQVQMCKRNLE 
VMDSVRRGAQLAIEECQYQFRNRRWNCSTLDSLPVFGKWTQGTREAAFVYAISSAGVAFAV 
TRACSSGELEKCGCDRTVHGVSPQGFQWSGCSDNIAYGVAFSQSF7DVRERSKGASSSRALM 
NLHNNEAGRKAILTHMRVECKCHGVSGSCEVKTCWRAVPPFRQVGHALKEKFDGATEVEPRR 

VGSSRALVPRNAQFKPHTDEDLVYLEPSPDFCEQDMRSGVLGTRGRTCNKTSKAIDGCELLC 
CGRGFHTAQVELAERCSCKFHWCCFVKCRQCQRLVELHTCR 

Iinportant features: 
Signal peptide: 
amino acids 1-22 

N-glycosylation sites. 

amino acids 88-91 and 297-300 

Wnt-1 family signature, 
amino acids 206-215 

Homologous region to Wnt-1 family proteins 

amino acids 183-235, 305-350, 97-138, 53-92 and 150 -174 



994628 1A2_L> 
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FIGURE 84 

CGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCTGGGTGCCTGCAT 

CGCCAIfiGACACCACCAGGTACAGCAAGTGGGGCGGCAGCTCCGAGGAGGTCCCCGGAGGGC 

CCTGGGGACGCTGGGTGCACTGGAGCAGGAGACCCCTCTTCTTGGCCCTGGCTGTCCTGGTC 

ACCACAGTCCTTTGGGCTGTGATTCTGAGTATCCTATTGTCCAAGGCCTCCACGGAGCGCGC 

GGCGCTGCTTGACGGCCACGACCTGCTGAGGACAAACGCCTCGAAGCAGACGGCGGCGCTGG 

GTGCCCTGAAGGAGGAGGTCGGAGACTGCCACAGCTGCTGCTCGGGGACGCAGGCGCAGCTG 

CAGACCACGCGCGCGGAGCTTGGGGAGGCGCAGGCGAAGCTGATGGAGCAGGAGAGCGCCCT 

GCGGGAACTGCGTGAGCGCGTGACCCAGGGCTTGGCTGAAGCCGGCAGGGGCCGTGAGGACG 

TCCGCACTGAGCTGTTCCGGGCGCTGGAGGCCGTGAGGCTCCAGAACAACTCCTGCGAGCCG 

TGCCCCACGTCGTGGCTGTCCTTCGAGGGCTCCTGCTACTTTTTCTCTGTGCCAAAGACGAC 

GTGGGCGGCGGCGCAGGATCACTGCGCAGATGCCAGCGCGCACCTGGTGATCGTTGGGGGCC 

TGGATGAGCAGGGCTTCCTCACTCGGAACACGCGTGGCCGTGGTTACTGGCTGGGCCTGAGG 

GCTGTGCGCCATCTGGGCAAGGTTCAGGGCTACCAGTGGGTGGACGGAGTCTCTCTCAGCTT 

CAGCCACTGGAACCAGGGAGAGCCCAATGACGCTTGGGGGCGCGAGAACTGTGTCATGATGC 

TGCACACGGGGCTGTGGAACGACGCACCGTGTGACAGCGAGAAGGACGGCTGGATCTGTGAG 

AAAAGGCACAACTGCIS&CCCCGCCCAGTGCCCTGGAGCCGCGCCCATTGCAGCATGTCGTA 

TCCTGGGGGCTGCTCACCTCCCTGGCTCCTGGAGCT6ATTGCCAAAGAGTTTTTTTCTTCCT 

CATCCACCGCTGCTGAGTCTCAGAAACACTTGGCCCAACATAGCCCTGTCCAGCCCAGTGCC 

TGGGCTCTGGGACCTCCATGCCGACCTCATCCTAACTCCACTCACGCAGACCCAACCTAACC 

TCCACTAGCTCCAAAATCCCTGCTCCTGCGTCCCCGTGATATGCCTCCACTTCTCTCCCTAA 

CCAAGGTTAGGTGACTGAGGACTGGAGCTGTTTGGTTTTCTCGCATTTTCCACCAAACTGGA 

AGCTGTTTTTGCAGCCTGAGGAAGCATCAATAAATATTTGAGAAATGAAAAAA 



86/^3? 
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FIGURE SB 

</usr/seqdb2/sst/DNA/Dnaseqs-min/ss.DNA56352 
<subunit 1 of 1, 293 aa, 1 stop 
<MW: 32562, pi: 6.53, NXIS/T): 2 

MDTTRYSKWGGSSEEVPGGPWGRWVHWSRRPLFLMVLVTTVLWAVILSILLSKASTERAA 
LLDGHDLLRTNASKQTAALGALKEEVGDCHSCCSGTQAQLQTTRAELGEAQAKLMEQESALR 
ELRERVTQGLAEAGRGREDVRTELFRALEAVRLQNNSCEPCPTSWLSFEGSCYFFSVPKTTW 

AAAQDHCADASAHLVIVGGLDEQGFLTRNTRGRGYWLGLRAVRHLGKVQGYQWVDGVSLSFS 
HWNQGEPNDAWGRENCVMMLHTGLWNDAPCDSEKDGWICEKRHNC 

Ixiqportant features: 

Type II transmembrane domain: 

amino acids 31-54 

N-glycosylation sites. 

amino acids 73-76 and 159-162 

Leucine zipper pattern, 
amino acids 102-123 

N-myristoylaticn sites, 

amino acids 18-23, 133-138 and 242-247 

C-type lectin domain signature, 
amino acids 264-287 



wo 99/46281 
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GCCAGGGGAAGAGGGTGATCCGACCCGGGGAAGGTCGCTGGGCAGGGCGAGTTGGGAAAGCG 

GCAGCCCCCGCCGCCCCCGCAGCCCCTTCTCCTCCTTTCTCCCACGTCCTATCTGCCTCTCG 

CTGGAGGCCAGGCC.GTGCAGCATCGAAGACAGGAGGAACTGGAGCCTCATTGGCCGGCCCGG 

GGCGCCGGCCTCGGGCTTAAATAGGAGCTCCGGGCTCTGGCTGGGACCCGACCGCTGCCGGC 

CGCGCTCCCGCTGCTCCTGCCGGGTGAISGAAAACCCCAGCCCGGCCGCCGCCCTGGGCAAG 

GCCCTCTGCGCTCTCCTCCTGGCCACTCTCGGCGCCGCCGGCCAGCCTCTTGGGGGAGAGTC 

CATCTGTTCCGCCAGAGCCCCGGCCAAATACAGCATCACCTTCACGGGCAAGTGGAGCCAGA 

CGGCCTTCCCCAAGCAGTACCCCCTGTTCCGCCCCCCTGCGCAGTGGTCTTCGCTGCTGGGG 

GCCGCGCATAGCTCCGACTACAGCATGTGGAGGAAGAACCAGTACGTCAGTAACGGGCTGCG 

CGACTTTGCGGAGCGCGGCGAGGCCTGGGCGCTGATGAAGGAGATCGAGGCGGCGGGGGAGG 

CGCTGCAGAGCGTGCACGAGGTGTTTTCGGCGCCCGCCGTCCCCAGCGGCACCGGGCAGACG 

TCGGCGGAGCTGGAGGTGCAGCGCAGGCACTCGCTGGTCTCGTTTGTGGTGCGCATCGTGCC 

CAGCCCCGACTGGTTCGTGGGCGTGGACAGCCTGGACCTGTGCGACGGGGACCGTTGGCGGG 

AACAGGCGGCGCTGGACCTGTACCCCTACGACGCCGGGACGGACAGCGGCTTCACCTTCTCC 

TCCCCCAACTTCGCCACCATCCCGCAGGACACGGTGACCGAGATAACGTCCTCCTCTCCCAG 

CCACCCGGCCAACTCCTTCTACTACCCGCGGCTGAAGGCCCTGCCTCCCATCGCCAGGGTGA 

CACTGCTGCGGCTGCGACAGAGCCCCAGGGCCTTCATCCCTCCCGCCCCAGTCCTGCCCAGC 

AGGGACAATGAGATTGTAGACAGCGCCTCAGTTCCAGAAACGCCGCTGGACTGCGAGGTCTC 

CCTGTGGTCGTCCTGGGGACTGTGCGGAGGCCACTGTGGGAGGCTCGGGACCAAGAGCAGGA 

CTCGCTACGTCCGGGTCCAGCCCGCCAACAACGGGAGCCCCTGCCCCGAGCTCGAAGAAGAG 

gctgagtgcgtccctgataactgcgtc3:mgaccagagccccgcagcccctggggccccccg 

GAGCCATGGGGTGTCGGGGGCTCCTGTGCAGGCTCATGCTGCAGGCGGCCGAGGGCACAGGG 

ggtttcgcgctgctcctgaccgcggtgaggccgcgccgaccatctctgcactgaagggccct 

CTGGTGGCCGGCACGGGCATTGGGAAACAGCCTCCTCCTTTCCCAACCTTGCTTCTTAGGGG 

cccccgtgtcccgtctgctctcagcctcctcctcctgcaggataaagtcatccccaaggctc 
cagctactctaaattatgtctccttataagttattgctgctccaggagattgtccttcatcg 
tccaggggcctggctcccacgtggttgcagatacctcagacctggtgctctaggctgtgctg 
agcccactctcccgagggcgcatccaagcgggggccacttgagaagtgaataaatggggcgg 
tttcggaagcgtcagtgtttccatgttatggatctctctgcgtttgaataaagactatctct 
gttgctcacaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 
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FIRtJRE 87 



></usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA53971 
xsubunit 1 of 1, 331 aa, 1 stop 
XMW: 35844, pj : 5.45, NX(S/T): 2 

MENPSPAAALGKALCALLLATLGAAGQPLGGESICSARAPAKYSITFTGKWSQTAFPKQYPL 
FRP P AQW S S LLGAAHS S D YSMWRKNQYVSNGLRDFAERGEAWALMKE lEAAGEALQSVHEVF 
SAPAVPSGTGQTSAELEVQRRHSLVSFWRIVPSPDWFVGVDSLDLCDGDRWREQAALDLYP 
YDAGTDSGFTFSSPNFATIPQDTVTEITSSSPSHPANSFYYPRLKALPPIARVTLLRLRQSP 
RAF I P PAPVL P S RDNE I VDS AS VPET PLDCE VS LWSS WGLCGGHCGRLGTKSRTRYVRVQPA 
NNGSPCPELEEEAECVPDNCV 

In\portant features: 
Signal peptide : 
amino acids 1-26 




wo 99/46281 ^ W fCT'<-S«'««» 

FIGURE 88 



GGCGGCGTCCGTGAGGGGCTCCTTTGGGCAGGGGTAGTGTTTGGTGTCCCTGTCTTGCGTGA 

TATTGACAAACTGAAGCTTTCCTGCACCACTGGACTTAAGGAAGAGTGTACTCGTAGGCGGA 

CAGCTTTAGTGGCCGGCCGGCCGCTCTCATCCCCCGTAAGGAGCAGAGTCCTTTGTACTGAC 

CAAGAlfiAGCAACATCTACATCCAGGAGCCTCCCACGAATGGGAAGGTTTTATTGAAAACTA 

CAGCTGGAGATATTGACATAGAGTTGTGGTCCAAAGAAGCTCCTAAAGCTTGCAGAAATTTT 

ATCCAACTTTGTTTGGAAGCTTATTATGACAATACCATTTTTCATAGAGTTGTGCCTGGTTT 

CATAGTCCAAGGCGGAGATCCTACTGGCACAGGGAGTGGTGGAGAGTCTATCTATGGAGCGC 

CATTCAAAGATGAATTTCATTCACGGTTGCGTTTTAATCGGAGAGGACTGGTTGCCATGGCA 

AATGCTGGTTCTCATGATAATGGCAGCCAGTTTTTCTTCACACTGGGTCGAGCAGATGAACT 

TAACAATAAGCATACCATCTTTGGAAAGGTTACAGGGGATACAGTATATAACATGTTGCGAC 

TGTCAGAAGTAGACATTGATGATGACGAAAGACCACATAATCCACACAAAATAAAAAGCTGT 

GAGGTT TTGT T TAATCC TTTTGATGACATCATTCCAAGGGAAATTAAAAGGCTGAAAAAAGA 

GAAACCAGAGGAGGAAGTAAAGAAATTGAAACCCAAAGGCACAAAAAATTTTAGTTTACTTT 

CATTTGGAGAGGAAGCTGAGGAAGAAGAGGAGGAAGTAAATCGAGTTAGTCAGAGCATGAAG 

GGCAAAAGCAAAAGTAGTCATGACTTGCTTAAGGATGATCCACATCTCAGTTCTGTTCCAGT 

TGTAGAAAGTGAAAAAGGTGATGCACCAGATTTAGTTGATGATGGAGAAGATGAAAGTGCAG 

AGCATGATGAATATATTGATGGTGATGAAAAGAACCTGATGAGAGAAAGAATTGCCAAAAAA 

TTAAAAAAGGACACAAGTGCGAATGTTAAATCAGCTGGAGAAGGAGAAGTGGAGAAGAAATC 

'agtcagccgcagtgaagagctcagaaaagaagcaagacaattaaaacgggaactcttagcag 

CAAAACAAAAAAAAGTAGAAAATGCAGCAAAACAAGCAGAAAAAAGAAGTGAAGAGGAAGAA 
GCCCCTCCAGATGGTGCTGTTGCCGAATACAGAAGAGAAAAGCAAAAGTATGAAGCTTTGAG 
GAAGCAACAGTCAAAGAAGGGAACTTCCCGGGAAGATCAGACCCTTGCACTGCTGAACCAGT 
TTT^TCTAAACTCACTCAAGCAATTGCTGAAACACCTGAAAATGACATTCCTGT^CAGAA 

gtagaagatgatgaaggatggatgtcacatgtacttcagtttgaggat7aaagcagaaaagt 
gaaagatgcaagcatgcaagactcagatacatttgaaatctatgatcctcggaatccagtga 
ataaaagaaggagggaagaaagcaaaaagctgatgagagagaaaaaagaaagaagaiaaaat 
gagaataatgataaccagaacttgctggaaatgtgcctacaatggccttgtaacagccattg 
ttcccaacagcatcacttaggggtgtgaaaagaagtatttttgaacctgttgtctggttttg 
aaaaacaattatcttgttttgcaaattgtggaatgatgtaagcaaatgcttttggttactgg 

TACATGTGTTTTTTCCTAGCTGACCTTTTATATTGCTAAATCTGAAATAAAATAACTTTCCT 
TCCACAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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></usr/seqdb2/sst/DNA/Dnaseqs.inin/ss.DNA50919 
xsubunit 1 of 1, 472 aa, 1 stop 
XMW: 53847, pi: 5.75, NX(S/T): 2 

MSNIYIQEPPTNGKVLLKTTAGDIDIELWSKEAPKACRNFIQLCLEAYYDNTIFHRWPGFI 

VQGGDPTGTGSGGESIYGAPFKDEFHSRLRFNRRGLVAMANAGSHDNGSQFFFTLGRADELN 

NKHTIFGKVTGDTVYNMLRLSEVDIDDDERPHNPHKIKSCEVLFNPFDDIIPREIKRLKKEK 

PEEEVKKLKPKGTKNFSLLSFGEEAEEEEEEVNRVSQSMKGKSKSSHDLLKDDPHLSSVPW 

ESEKGDAPDLVDDGEDESAEHDEYIDGDEKNLMRERIAKKLKKDTSANVKSAGEGEVEKKSV 

SRSEELRKEARQLKRELLAAKQKKVENAAKQAEKRSEEEEAPPDGAVAEYRREKQKYEALRK 

QQSKKGTSREDQTLALLNQFKSKLTQAIAETPENDIPETEVEDDEGWMSHVLQFEDKSRKVK 
DASMQDSDTFEIYDPRNPVNKRRREESKKLMREKKERR 

Important features: 
Signal peptide: 
amino acids 1-21 

N-glycosylation sites. 

amino acids 109-112 and 201-204 

Cyclophilin-type pep tidyl -prolyl cis-trans isomerase signature. 

amino acids 4 9-66 

Homologous region to Cyclophilin-type peptidyl -prolyl cis-trans 
isomerase 

amino acids 96-140, 49-89 and 22-51 



BNSDOCID' <W0 9946281 A2_L> 
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FIGURE 90 

CGCCGCCGTTGGGGCTGGAAGTTCCCGCCAGGTCCGTGCCGGGCGAGAGAGATGCTGCCCGG 

CCCGCCTCGGCTTTGAGGCGAGAGAAGTGTCCCAGACCCATTTCGCCTTGCTGACGGCGTCG 

AGCCCTGGCCAGACMfiTCCACAGGGTTCTCCTTCGGGTCCGGGACTCTGGGCTCCACCACC 

GTGGCCGCCGGCGGGACCAGCACAGGCGGCGTTTTCTCCTTCGGAACGGGAACGTCTAGCAA 

CCCTTCTGTGGGGCTCAATTTTGGAAATCTTGGAAGTACTTCAACTCCAGCAACTACATCTG 

CTCCTTCAAGTGGTTTTGGAACCGGGCTCTTTGGATCTAAACCTGCCACTGGGTTCACTCTA 

GGAGGAACAAATACAGGTGCCTTGCACACCAAGAGGCCTCAAGTGGTCACCAAATATGGAAC 

CCTGCAAGGAAAACAGATGCATGTGGGGAAGACACCCATCCAAGTCTTTTTAGGAGTCCCCT 

TCTCCAGACCTCCTCTAGGTATCCTCAGGTTTGCACCTCCAGAACCCCCGGAGCCCTGGAAA 

GGAATCAGAGATGCTACCACCTACCCGCCTGGATGGAGTCTCGCTCTGTCGCCAGGCTGGAG 

TGCAGTGGCACGATCTCGGCTCACTGCAACCTCCGCCTCCCGGGTTCAAGCGAGTCTCCTGC 

CTCAGCCTCTGAGTGTCTGGGGCTACAGGTGCCTGCAGGAGTCCTGGGGCCAGCTGGCCTCG 

ATGTACGTCAGCACGCGGGAACGGTACAAGTGGCTGCGCTTCAGCGAGGACTGTCTGTACCT 

GAACGTGTACGCGCCGGCGCGCGCGCCCGGGGATCCCCAGCTGCCAGTGATGGTCTGGTTCC 

CGGGAGGCGCCTTCATCGTGGGCGCTGCTTCTTCGTACGAGGGCTCTGACTTGGCCGCCCGC 

GAGAAAGTGGTGCTGGTGTTTCTGCAGCACAGGCTCGGCATCTTCGGCTTCCTGAGCACGGA 

CGACAGCCACGCGCGCGGGAACTGGGGGCTGCTGGACCAGATGGCGGCTCTGCGCTGGGTGC 

AGGAGAACATCGCAGCCTTCGGGGGAGACCCAGGAAATGTGACCCTGTTCGGCCAGTCGGCG 

GGGGCCATGAGCATCTCAGGACTGATGATGTCACCCCTAGCCTCGGGTCTCTTCCATCGGGC 

CATTTCCCAGAGTGGCACCGCGTTATTCAGACTTTTCATCACTAGTAACCCACTGAAAGTGG 

CCAAGAAGGTTGCCCACCTGGCTGGATGCAACCACAACAGCACACAGATCCTGGTAAACTGC 

CTGAGGGCACTATCAGGGACCAAGGTGATGCGTGTGTCCAACAAGATGAGATTCCTCCAACT 

GAACTTCCAGAGAGACCCGGAAGAGATTATCTGGTCCATGAGCCCTGTGGTGGATGGTGTGG 

TGATCCCAGATGACCCTTTGGTGCTCCTGACCCAGGGGAAGGTTTCATCTGTGCCCTACCTT 

CTAGGTGTCAACAACCTGGAATTCAATTGGCTCTTGCCTTATAATATCACCAAGGAGCAGGT 

ACCACTTGTGGTGGAGGAGTACCTGGACAATGTCAATGAGCATGACTGGAAGATGCTACGAA 

ACCGTATGATGGACATAGTTCAAGATGCCACTTTCGTGTATGCCACACTGCAGACTGCTCAC 

TACCACCGAGAAACCCCAATGATGGGAATCTGCCCTGCTGGCCACGCTACAACAAGGATGAA 

AAGTACCTGCAGCTGGATTTTACCACAAGAGTGGGCA2£SMGCTCAAGGAGAAGAAGATGGC 

TTTTTGGATGAGTCTGTACCAGTCTCAAAGACCTGAGAAGCAGAGGCAATTCTAAGGGTGGC 

TATGCAGGAAGGAGCCAAAGAGGGGTTTGCCCCCACCATCCAGGCCCTGGGGAGACTAGCCA 

TGGACATACCTGGGGACAAGAGTTCTACCCACCCCAGTTTAGAACTGCAGGAGCTCCCTGCT 

GCCTCCAGGCCAAAGCTAGAGCTTTTGCCTGTTGTGTGGGACCTGCACTGCCCTTTCCAGCC 

TGACATCCCATGATGCCCCTCTACTTCACTGTTGACATCCAGTTAGGCCAGGCCCTGTCAAC 

ACCACACTGTGCTCAGCTCTCCAGCCTCAGGACAACCTCTTTTTTTCCCTTCTTCAAATCCT 

CCCACCCTTCAATGTCTCCTTGTGACTCCTTCTTATGGGAGGTCGACCCAGACTGCCACTGC 

CCCTGTCACTGCACCCAGCTTGGCATTTACCATCCATCCTGCTCAACCTTGTTCCTGTCTGT 

TCACATTGGCCTGGAGGCCTAGGGCAGGTTGTGACATGGAGCAAACTTTTGGTAGTTTGGGA 

TCTTCTCTCCCACCCACACTTATCTCCCCCAGGGCCACTCCAAAGTCTATACACAGGGGTGG 

TCTCTTCAATAAAGAAGTGTT6ATTAGAAAAAAAAAAA 
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FIGURE 91 

</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA44179 
<subunit 1 of 1, 545 aa, 1 stop 
<MW: 58934, pi: 9.45, NX(S/T): 4 

MSTGFSFGSGTLGSTTVAAGGTSTGGVFSFGTGTSSNPSVGLNFGNLGSTSTPATTSAPSSG 

FGTGLFGSKPATGFTLGGTNTGALHTKRPQWTKYGTLQGKQMHVGKTPIQVFLGVPFSRPP 

LGILRFAPPEPPEPWKGIRDATTYPPGWSLALSPGWSAVARSRLTATSASRVQASLLPQPLS 

VWGYRCLQESWGQLASMYVSTRERYKWLRFSEDCLYLNVYAPARAPGDPQLPVMVWFPGGAF 

IVGAAS S YEGSDLAAREKWLVFLQHRLGI FGFLSTDDSHARGNWGLLDQMAALRWVQENIA 

AFGGDPGNVTLFGQSAGAMSISGLMMSPLASGLFHRAISQSGTALFRLFITSNPLKVAKKVA 

HLAGCNHNSTQILVNCLRALSGTKVMRVSNKMRFLQLNFQRDPEEIIWSMSPWDGWIPDD 

PLVLLTQGKVSSVPYLLGVNNLEFNWLLPYNITKEQVPLWEEYLDNVNEHDWKMLRNRMMD 
I VQDAT FVYATLQTAHYHRET PMMG ICPAGHAT TRMKS TCS W I LPQEWA 

Important features: 
Signal peptide: 
amino acids 1-29 

Carboxylesterases type-B serine active site, 
amino acids 312-327 

Carboxylesterases type-B signature 2, 
amino acids 218-228 

N-glycosylation sites. 

amino acids 318-321, 380-383 and 465-468 



BNSDCXID- <W0 9«4«2eiA2J_> 



33/23? 



wo W46281 




PCT/US99/05028 



GAGAACAGGCCTGTCTCAGGCAGGCCCTGCGCCTCCTATGCGGAG&ISCTACTGCCACTGCT 

GCTGTCCTCGCTGCTGGGCGGGTCCCAGGCTATGGATGGGAGATTCTGGATACGAGTGCAGG 

AGTCAGTGATGGTGCCGGAGGGCCTGTGCATCTCTGTGCCCTGCTCTTTCTCCTACCCCCGA 

CAAGACTGGACAGGGTCTACCCCAGCTTATGGCTACTGGTTCAAAGCAGTGACTGAGACAAC 

CAAGGGTGCTCCTGTGGCCACAAACCACCAGAGTCGAGAGGTGGAAATGAGCACCCGGGGCC 

GATTCCAGCTCACTGGGGATCCCGCCAAGGGGAACTGCTCCTTGGTGATCAGAGACGCGCAG 

ATGCAGGATGAGTCACAGTACTTCTTTCGGGTGGAGAGAGGAAGCTATGTGACATATAATTT 

CATGAACGATGGGTTCTTTCTAAAAGTAACAGTGCTCAGCTTCACGCCCAGACCCCAGGACC 

ACAACACCGACCTCACCTGCCATGTGGACTTCTCCAGAAAGGGTGTGAGCGCACAGAGGACC 

GTCCGACTCCGTGTGGCCTATGCCCCCAGAGACCTTGTTATCAGCATTTCACGTGACAACAC 

GCCAGCCCTGGAGCCCCAGCCCCAGGGAAATGTCCCATACCTGGAAGCCCAAAAAGGCCAGT 

TCCTGCGGCTCCTCTGTGCTGCTGACAGCCAGCCCCCTGCCACACTGAGCTGGGTCCTGOIG 

AACAGAGTCCTCTCCTCGTCCCATCCCTGGGGCCCTAGACCCCTGGGGCTGGAGCTGCCCGG 

GGTGAAGGCTGGGGATTCAGGGCGCTACACCTGCCGAGCGGAGAACAGGCTTGGCTCCCAGC 

AGCGAGCCCTGGACCTCTCTGTGCAGTATCCTCCAGAGAACCTGAGAGTGATGGTTTCCCAA 

GCAAACAGGACAGTCCTGGAAA?^CCTTGGGAACGGCACGTCTCTCCCAGTACTGGAGGGCCA 

AAGCCTGTGCCTGGTCTGTGTCACACACAGCAGCCCCCCAGCCAGGCTGAGCTGGACCCAGA 

GGGGACAGGTTCTGAGCCCCTCCCAGCCCTCAGACCCCGGGGTCCTGGAGCTGCCTCGGGTT 

CAAGTGGAGCACGAAGGAGAGTTCACCTGCCACGCTCGGCACCCACTGGGCTCCCAGCACGT 

CTCTCTCAGCCTCTCCGTGCACTATAAGAAGGGACTCATCTCAACGGCATTCTCCAACGGAG 

CGTTTCTGGGAATCGGCATCACGGCTCTTCTTTTCCTCTGCCTGGCCCTGATCATCATGAAG 

ATTCTACCGAAGAGACGGACTCAGACAGAAACCCCGAGGCCCAGGTTCTCCCGGCACAGCAC 

GATCCTGGATTACATCAATGTGGTCCCGACGGCTGGCCCCCTGGCTCAGAAGCGGAATCAGA 

AAGCCACACCAAACAGTCCTCGGACCCCTCCTCCACCAGGTGCTCCCTCCCCAGAATCAAAG 

AAGAACCAGAAAAAGCAGTATCAGTTGCCCAGTTTCCCAGAACCCAAATCATCCACTCAAGC 

CCCAGAATCCCAGGAGAGCCAAGAGGAGCTCCATTATGCCACGCTCAACTTCCCAGGCGTCA 

GACCCAGGCCTGAGGCCCGGATGCCCAAGGGCACCCAGGCGGATTATGCAGAAGTCAAGTTC 

CAATSS&GGGTCTCTTAGGCTTTAGGACTGGGACTTCGGCTAGGGAGGAAGGTAGAGTAAGAG 

GTTGAAGATAACAGAGTGCAAAGTTTCCTTCTCTCCCTCTCTCTCTCTCTTTCTCTCTCTCT 

CTCTCTTTCTCTCTCTTTTAAAAAAACATCTGGCCAGGGCACAGTGGCTCACGCCTGTAATC 

CCAGCACTTTGGGAGGTTGAGGTGGGCAGATCGCCTGAGGTCGGGAGTTCGAGACCAGCCTG 

GCCAACTTGGTGAAACCCCGTCTCTACTAAAAATACAAAAATTAGCTGGGCATGGTGGCAGG 

CGCCTGTAATCCTACCTACTTGGGAAGCTGAGGCAGGAGAATCACTTGAACCTGGGAGACGG 

AGGTTGCAGTGAGCCAAGATCACACCATTGCACGCCAGCCTGGGCAACAAAGCGAGACTCCA 

TCTCT^AAAAAAAAATCCTCCAAATGGGTTGGGTGTCTGTAATCCCAGCACTTTGGGAGGCTA 

AGGTGGGTGGATTGCTTGAGCCCAGGAGTTCGAGACCAGCCTGGGCAACATGGTGAAACCCC 

ATCTCTACAAAAAATACAAAACATAGCTGGGCTTGGTGGTGTGTGCCTGTAGTCCCAGCTGT 

CAGACATTTAAACCAGAGCAACTCCATCTGGAATAGGAGCTGAATAAAATGAGGCTGAGACC 

TACTGGGCTGCATTCTCAGACAGTGGAGGCATTCTAAGTCACAGGATGAGACAGGAGGTCCG 

TACAAGATACAGGTCATAAAGACTTTGCTGATAAAACAGATTGCAGTAAAGAAGCCAACCAA 

ATCCCACCAAAACCAAGTTGGCCACGAGAGTGACCTCTGGTCGTCCTCACTGCTACACTCCT 

GACAGCACCATGACAGTTTACAAATGCCATGGCAACATCAGGAAGTTACCCGATATGTCCCA 

AAAGGGGGAGGAATGAATAATCCACCCCTTGTTTAGCAAATAAGCAAGAAATAACCATAAAA 

GTGGGCAACCAGCAGCTCTAGGCGCTGCTCTTGTCTATGGAGTAGCCATTCTTTTGTTCCTT 

TACTTTCTTAATAAACTTGCTTTCACCTTAAAAAAA 
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FIGURE 93 

></usr/seqcib2/sst/DNA/Dnaseqs.min/ss.DNA54002 
xsubunit i of 1, 544 aa, 1 stop 
XMW: 60268, pi: 9.53, NX(S/T): 3 

MLLPLLLSSLLGGSQAMDGRFWIRVQESVMVPEGLCISVPCSFSYPRQDWTGSTPAYGYWFK 
AVTETTKGAPVATNHQSREVEMSTRGRFQLTGDPAKGNCSLVIRDAQMQDESQYFFRVERGS 
YVTYNFMNDGFFLKVTVLSFTPRPQDHNTDLTCHVDFSRKGVSAQRTVRLRVAYAPRDLVIS 
ISRDNTPALEPQPQGNVPYLEAQKGQFLRLLCAADSQPPATLSWVLQ^fRVLSSSHPWGPRPL 
GLELPGVKAGDSGRYTCRAENRLGSQQRALDLSVQYPPENLRVMVSQANRTVLENLGNGTSL 
PVLEGQS LC LVCVTHS S P PARLS WTQRGQVLS PS QPSDPGVLELPRVQVEHEGEFTCHARHP 
LGSQHVSLSLSVHYKKGLISTAFSNGAFLGIGITALLFLCLALIIMKILPKRRTQTETPRPR 
FSRHSTILDYINWPTAGPLAQKRNQKATPNSPRTPPPPGAPSPESKKNQKKQYQLPSFPEP 
KSSTQAPESQESQEELHYATLNFPGVRPRPEARMPKGTQADYAEVKFQ 

Important features: 
Signal peptide: 
amino acids 1-15 

Transmembrane domain: 

amino acids 399-418 

N-glycosylation site. 

amino acids 100-103, 297-300 and 306-309 

Immunoglobulins and major histocon^atibility conplex proteins 

signature. 

amino acids 365-371 
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FIQURE 94 

TGAAGAGTAATAGTTGGAATCAAAAGAGTCAACGCMIfiAACTGTTATTTACTGCTGCGTTT 

TATGTTGGGAATTCCTCTCCTATGGCCTTGTCTTGGAGCAACAGAAAACTCTCAAACAAAGA 

AAGTCAAGCAGCCAGTGCGATCTCATTTGAGAGTGAAGCGTGGCTGGGTGTGGAACCAATTT 

TTTGTACCAGAGGAAATGAATACGACTAGTCATCACATCGGCCAGCTAAGATCTGATTTAGA 

CAATGGAAACAATTCTTTCCAGTACAAGCTTTTGGGAGCTGGAGCTGGAAGTACTTTTATCA 

TTGATGAAAGAACAGGTGACATATATGCCATACAGAAGCTTGATAGAGAGGAGCGATCCCTC 

TACATCTTAAGAGCCCAGGTAATAGACATCGCTACTGGAAGGGCTGTGGAACCTGAGTCTGA 

GTTTGTCATCAAAGTTTCGGATATCAATGACAATGAACCAAAATTCCTAGATGAACCTTATG 

AGGCCATTGTACCAGAGATGTCTCCAGAAGGAACATTAGTTATCCAGGTGACAGCAAGTGAT 

GCTGACGATCCCTCAAGTGGTAATAATGCTCGTCTCCTCTACAGCTTACTTCAAGGCCAGCC 

ATATTTTTCTGTTGAACCAACAACAGGAGTCATAAGAATATCTTCTAAAATGGATAGAGAAC 

TGCAAGATGAGTATTGGGTAATCATTCAAGCCAAGGACATGATTGGTCAGCCAGGAGCGTTG 

TCTGGAACAACAAGTGTATTAATTAAACTTTCAGATGTTAATGACAATAAGCCTATATTTAA 

AGAAAGTTTATACCGCTTGACTGTCTCTGAATCTGCACCCACTGGGACTTCTATAGGAACAA 

TCATGGCATATGATAATGACATAGGAGAGAATGCAGAAATGGATTACAGCATTGAAGAGGAT 

GATTCGCAAACATTTGACATTATTACTAATCATGAAACTCAAGAAGGAATAGTTATATTAAA 

AAAGAAAGTGGATTTTGAGCACCAGAACCACTACGGTATTAGAGCAAAAGTTAAAAACCATC 

ATGTTCCTGAGCAGCTCATGAAGTACCACACTGAGGCTTCCACCACTTTCATTAAGATCCAG 

GTGGAAGATGTTGATGAGCCTCCTCTTTTCCTCCTTCCATATTATGTATTTGAAGTTTTTGA 

AGAAACCCCACAGGGATCATTTGTAGGCGTGGTGTCTGCCACAGACCCAGACAATAGGAAAT 

CTCCTATCAGGTATTCTATTACTAGGAGCAAAGTGTTCAATATCAATGATAATGGTACAATC 

ACTACAAGTAACTCACTGGATCGTGAAATCAGTGCTTGGTACAACCTAAGTATTACAGCCAC 

AGAAAAATXCAATATAGAACAGATCTCTTCGATCCCACTGTATGTGCAAGTTCTTAACATCA 

ATGATCATGCTCCTGAGTTCTCTCAATACTATGAGACTTATGTTTGTGAAAATGCAGGCTCT 

ggtcaggtaattcagactatcagtgcagtggatagagatgaatccatagaagagcaccattt 
ttactttaatctatctgtagaagacactaacaattcaagttttacaatcatagataatcaag 
ataacacagctgtcattttgactaatagaactggttttaaccttcaagaagaacctgtcttc 
tacatctccatcttaattgccgacaatggaatcccgtcacttacaagtacaaacacccttac 
catccatgtctgtgactgtggtgacagtgggagcacacagacctgccagtaccaggagcttg 

TGCTTTCCATGGGATTCAAGACAGAAGTTATCATTGCTATTCTCATTTGCATTATGATCATA 

TTTGGGTTTATTTTTTTGACTTTGGGTTTAAAACAACGGAGAAAACAGATTCTATTTCCTGA 

GAAAAGTGAAGATTTCAGAGAGAATATATTCCAATATGATGATGAAGGGGGTGGAGAAGAAG 

ATACAGAGGCCTTTGATATAGCAGAGCTGAGGAGTAGTACCATAATGCGGGAACGCAAGACT 

CGGAAAACCACAAGCGCTGAGATCAGGAGCCTATACAGGCAGTCTTTGCAAGTTGGCCCCGA 

CAGTGCCATATTCAGGAAATTCATTCTGGAAAAGCTCGAAGAAGCTAATACTGATCCGTGTG 

CCCCTCCTTTTGATTCCCTCCAGACCTACGCTTTTGAGGGAACAGGGTCATTAGCTGGATCC 

CTGAGCTCCTTAGAATCAGCAGTCTCTGATCAGGATGAAAGCTATGATTACCTTAATGAGTT 

GGGACCTCGCTTTAAAAGATTAGCATGCATGTTTGGTTCTGCAGTGCAGTCAAATAATiaJiG 

GCTTTTTACCATCAAAATTTTTAAAAGTGCTAATGTGTATTCGAACCCAATGGTAGTCTTAA 

AGAGTTTTGTGCCCTGGCTCTATGGCGGGGAAAGCCCTAGTCTATGGAGTTTTCTGATTTCC 

CTGGAGTAAATACTCCATGGTTATTTTAAGCTACCTACATGCTGTCATTGAACAGAGATGTG 

GGGAGAAATGTAAACAATCAGCTCACAGGCATCAATACAACCAGATTTGAAGTAAAATAATG 

TAGGAAGATATTAAAAGTAGATGAGAGGACACAAGATGTAGTCGATCCTTATGCGATTATAT 

CATTATTTACTTAGGAAAGAGTAAAAATACCAAACGAGAAAATTTAAAGGAGCAAAAATTTG 

CAAGTCAAATAGAAATGTACAAATCGAGATAACATTTACATTTCTATCATATTGACATGAAA 

ATTGAAAATGTATAGTCAGAGAAATTTTCATGAATTATTCCATGAAGTATTGTTTCCTTTAT 

TTAAA 
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FIGURE 9R 

></usr/seqcib2/sst/DNA/Dnaseqs.min/ss.DNA53906 
xsubunit 1 of l, 772 aa, 1 stop 
XMW: 87002, pi: 4.64, NX(S/T): 8 

MNCYLLLRFMLGIPLLWPCLGATENSQTKKVKQPVRSHLRVKRGWVWNQFFVPEEMNTTSHH 
IGQLRSDLDNGNNSFQYKLLGAGAGSTFIIDERTGDIYAIQKLDREERSLYILRAQVIDIAT 
GRAVEPESEFVIKVSDINDNEPKFLDEPYEAIVPEMSPEGTLVIQVTASDADDPSSGNNARL 
LYSLLQGQPYFSVEPTTGVIRISSKMDRELQDEYWVIIQAKDMIGQPGALSGTTSVLIKLSD 
VNDNKPIFKESLYRLTVSESAPTGTSIGTIMAYDNDIGENAEMDYSIEEDDSQTFDIITNHE 
TQEGIVILKKKVDFEHQNHYGIRAKVKNHHVPEQLMKYHTEASTTFIKIQVEDVDEPPLFLL 
PYYVFEVFEETPQGSFVGWSATDPDNRKSPIRYSITRSKVFNINDNGTITTSNSLDREISA 
WYNLSITATEKYNIEQISSIPLYVQVLNINDHAPEFSQYYETYVCENAGSGQVIQTISAVDR 
DESIEEHHFYFNLSVEDTNNSSFTIIDNQDNTAVILTNRTGFNLQEEPVFYISILIADNGIP 
SLTSTNTLTIHVCDCGDSGSTQTCQYQELVLSMGFKTEVI lAILICIMI I FGFI FLTLGLKQ 

RRKQILFPEKSEDFRENIFQYDDEGGGEEDTEAFDIAELRSSTIMRERKTRKTTSAEIRSLY 

RQSLQVGPDSAIFRKFILEKLEEANTDPCAPPFDSLQTYAFEGTGSLAGSLSSLESAVSDOD 
ESYDYLNELGPRFKRLACMFGSAVQSNN 

Important features: 
Signal peptide: 
amino acids 1-21 

Transmonbrane domain: 

amino acids 597-617 

N-glycosylation sites. 

amino acids 57-60, 74-77, 419-423, 437-440, 508-511, 515-518, 
516-519 and 534-537 

Cadherins extracellular repeated domain signature, 
amino acids 136-146 and 244-254 
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FIGURE 96 

ATTTCAAGGCCAGCCATATTTTTNTGTTGAACCAACAACAGGAGTCATAAGAATATTTTNTA 
AAATGGATAGAGAACTGCAAGATGAGTATTGGGTAATCATTCAAGCCAAGGACATGATTGGT 
CAGCCAGGAGCGTT.GTNTGGAACAACAAGTGTATTAATTAAACTTTCAGATGTTAATGACAA 
TAAGCCTATATTTAAAGAAAGTTTATACCGCTTGACTGTNTNTGAATCTGCACCCACTGGGA 
NTTNTATAGGAACAATCATGGCATATGATAATGACATAGGAGAGAATGCAGAAATGGATTAC 
AGCATTGAAGAGGATGATTCGCAAACATTTGACATTATT 
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FIGURE 97 

GCAACCTCAGCTTCTAGTATCCAGACTCCAGCGCCGCCCCGGGCGCGGACCCCAACCCCGAC 

CCAGAGCTTCTCCAGCGGCGGCGCAGCGAGCAGGGCTCCCCGCCTTAACTTCCTCCGCGGGG 

CCCAGCCACCTTCGGGAGTCCGGGTTGCCCACCTGCAAACTCTCCGCCTTCTGCACCTGCCA 

CCCCTGAGCCAGCGCGGGCCCCCGAGCGAGTCAlfiGCCAACGCGGGGCTGCAGCTGTTGGGC 

TTCATTCTCGCCTTCCTGGGATGGATCGGCGCCATCGTCAGCACTGCCCTGCCCCAGTGGAG 

GATTTACTCCTATGCCGGCGACAACATCGTGACCGCCCAGGCCATGTACGAGGGGCTGTGGA 

TGTCCTGCGTGTCGCAGAGCACCGGGCAGATCCAGTGCAAAGTCTTTGACTCCTTGCTGAAT 

CTGAGCAGCACATTGCAAGCAACCCGTGCCTTGATGGTGGTTGGCATCCTCCTGGGAGTGAT 

AGCAATCTTTGTGGCCACCGTTGGCATGAAGTGTATGAAGTGCTTGGAAGACGATGAGGTGC 

AGAAGATGAGGATGGCTGTCATTGGGGGTGCGATATTTCTTCTTGCAGGTCTGGCTATTTTA 

GTTGCCACAGCATGGTATGGCAATAGAATCGTTCAAGAATTCTATGACCCTATGACCCCAGT 

CAATGCCAGGTACGAATTTGGTCAGGCTCTCTTCACTGGCTGGGCTGCTGCTTCTCTCTGCC 

TTCTGGGAGGTGCCCTACTTTGCTGTTCCTGTCCCCGAAAAACAACCTCTTACCCAACACCA 

AGGCCCTATCCAAAACCTGCACCTTCCAGCGGGAAAGACTACGTGIfi&CACAGAGGCAAAAG 

GAGAAAATCATGTTGAAACAAACCGAAAATGGACATTGAGATACTATCATTAACATTAGGAC 

CTTAGAATTTTGGGTATTGTAATCTGAAGTATGGTATTACAAAACAAACAAACAAACAAAAA 

ACCCATGTGTTAAAATACTCAGTGCTAAACATGGCTTAATCTTATTTTATCTTCTTTCCTCA 

ATATAGGAGGGAAGATTTTTCCATTTGTATTACTGCTTCCCATTGAGTAATCATACTCAAAT 

GGGGGAAGGGGTGCTCCTTAAATATATATAGATATGTATATATACATGTTTTTCTATTAAAA 

ATAGACAGTAAAATACTATTCTCATTATGTTGATACTAGCATACTTAAAATATCTCTAAAAT 

AGGTAAATGTATTTAATTCCATATTGATGAAGATGTTTATTGGTATATTTTCTTTTTCGTCC 

TTATATACATATGTAACAGTCAAATATCATTTACTCTTCTTCATTAGCTTTGGGTGCCTTTG 

CCACAAGACCTAGCCTAATTTACCAAGGATGAATTCTTTCAATTCTTCATGCGTGCCCTTTT 

CATATACTTATTTTATTTTTTACCATAATCTTATAGCACTTGCATCGTTATTAAGCCCTTAT 

TTGTTTTGTGTTTCATTGGTCTCTATCTCCTGAATCTAACACATTTCATAGCCTACATTTTA 

GTTTCTAAAGCCAAGAAGAATTTATTACAAATCAGAACTTTGGAGGCAAATCTTTCTGCATG 

ACCAAAGTGATAAATTCCTGTTGACCTTCCCACACAATCCCTGTACTCTGACCCATAGCACT 

CTTGTTTGCTTTGAAAATATTTGTCCAATTGAGTAGCTGCATGCTGTTCCCCCAGGTGTTGT 

AACACAACTTTATTGATTGAATTTTTAAGCTACTTATTCATAGTTTTATATCCCCCTAAACT 

ACCTTTTTGTTCCCCATTCCTTAATTGTATTGTTTTCCCAAGTGTAATTATCATGCGTTTTA 

TATCTTCCTAATAAGGTGTGGTCTGTTTGTCTGAACAAAGTGCTAGACTTTCTGGAGTGATA 

ATCTGGTGACAAATATTCTCTCTGTAGCTGTAAGCAAGTCACTTAATCTTTCTACCTCTTTT 

TTCTATCTGCCAAATTGAGATAATGATACTTAACCAGTTAGAAGAGGTAGTGTGAATATTAA 

TTAGTTTATATTACTCTTATTCTTTGAACATGAACTATGCCTATGTAGTGTCTTTATTTGCT 

CAGCTGGCTGAGACACTGAAGAAGTCACTGAACAAAACCTACACACGTACCTTCATGTGATT 

CACTGCCTTCCTCTCTCTACCAGTCTATTTCCACTGAACAAAACCTACACACATACCTTCAT 

GT6GTTCAGTGCCTTCCTCTCTCTACCAGTCTATTTCCACTGAACAAAACCTACGCACATAC 

CTTCATGTGGCTCAGTGCCTTCCTCTCTCTACCAGTCTATTTCCATTCTTTCAGCTGTGTCT 

GACATGTTTGTGCTCTGTTCCATTTTAACAACTGCTCTTACTTTTCCAGTCTGTACAGAATG 

CTATTTCACTTGAGCAAGATGATGTAATGGAAAGGGTGTTGGCACTGGTGTCTGGAGACCTG 

GATTTGAGTCTTGGTGCTATCAATCACCGTCTGTGTTTGAGCAAGGCATTTGGCTGCTGTAA 

GCTTATTGCTTCATCTGTAAGCGGTGGTTTGTAATTCCTGATCTTCCCACCTCACAGTGATG 

TTGTGGGGATCCAGTGAGATAGAATACATGTAAGTGTGGTTTTGTAATTTAAAAAGTGCTAT 

ACTAAGGGAAAGAATTGAGGAATTAACTGCATACGTTTTGGTGTTGCTTTTCAAATGTTTGA 

AAATAAAAAAAATGTTAAG 



BNSDOCID- <W0 fl946281A2J.> 



wo 99/46281 




PCT/US99/05028 



FIfiURE 98 



></usr/seqclb2/sst/DNA/Dnaseqs .min/ss . DNA52185 
xsubunit 1 of I, 211 aa, 1 stop 
XMW: 22744, pi: 8.51, NX(S/T): 1 

MANAGLQLLGFILAFLGWIGAIVSTALPQWRIYSYAGDNIVTAQAMYEGLWMSCVSQSTGQI 
QCKVFDSLLNLSSTLQATRALMVVGILLGVIAIFVATVGMKCMKCLEDDEVQKMRMAVIGGA 
IFLLAGLAILVATAWYGNRIVQEFYDPMTPVNARYEFGQALFTGWAAASLCLLGGALLCCSC 
PRKTTSYPTPRPYPKPAPSSGKDYV 

Important features: 
Signal peptide: 
amino acids 1-21 

Transmembrane domains: 

amino acids 82-102, 118-142 and 161-187 

N-glycosylation site, 

amino acids 72-75 

PMP-22 / EMP / MP20 family proteins 
amino acids 70-111 

ABC-2 type transport system integral membrane protein 
amino acids 119-133 
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TTCTGGCCAAACCCGGGGCTNCAGCTGTTGGGCTTCATCTCGCCTTCCTGGGATGGATCGGC 
GCCATCNTCACACTGCCCTTCCCCAGTGGAGGATTTTACTCCCTATGCTGGCGACAACATCG 
TGACCGCCCAGCCCATGTACGAGGGGCTGTGGATGTCCNGCGTGTCGCAGAGCACCGGGCAG 
ATCCAGTGCAAAGTCTTTGACTCCTTGCTGAATCTGAGCAGCACATTGCAAGCAACCCGTGC 
CTTGATGGTGGTTGGCATCCTCCTGGGAGTGATAGCAATCTTTGTGGCCACCGTTGGCATGA 
AGTGTATGAAGTGCTTGGAAGACGATGAGGTGCAGAAGATGAGGATGGCTGTCATTGGGGGC 
GCGATATTTCTTCTTGCAGGTCTGGCTATTTTAGTTGCCACAGCATGGTATGGCAATAGAAN 
CNTTCAACANTTCTATGACCCTATGACCCCAGTCAATGCCAGGTACGAATTTGGTCA 

GGCTCTCTTCACTGGCTGGGCTGCTGCTTCTCTCTGCCTTCTGGGAGGTGCCCTACTTTGCT 
GTTCCTGTCCC 
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FIGURE 100 



ACCCTTGACCCAACGCGGCCCCCCGACCGNTTCATGGCCAAACGCGGGNCTCCAGCTGTTGG 
GCTTCATTCTCCCCTTCCTGGGATGGACCGGCGCCCATCNTCAGCACTGCCCTGCCCCAGTG 
GAGGATTTACTCCTATNCCGGCNACAACATCGTGACCGCCCAGGCCNTGTACGAGGGGCTGT 
GGATGTCCTGCGTGTCGCAGAGCACCGGGCAGATCCAGTGCAAAGTCTTTGACTCCCTTGCT 
GAATCTGAGCAGCACATTGCAAGCAACCCGTGCCTTGATGGTGGTTGGCATCCTCCTGGGAG 
TGATAGCAATCTTNNTGGCCACCGTTGTNNNTGAAGTGTATGAAGTGCTTG6AAGACGATGA 
GGTGCAGAAGATGAGGATGGCTGTCATTGGGGGCGCGATATTTCTTCTTGCAGGTCTGGCTA 
TTTTAGTTGCCACAGCATGGTATGGCAATAGAATCGTTCAAGAATTCTATGACCCTATGACCGA 
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FIGURE 101 

GGGCCCGACCATTATCCAACCGGGNTCACTGTTGGCTCATCTCCCTCCTGGATGAANCGCGC 
CATCNTCAGACTCCCTGCCCCATGGAGATTTNNCCTATGCTGGCGACAACATCNTGACCCCC 
AGCCATGTACGAGGGGCTTTGAACGTCNGCGTGTCGCAGANCACCGGGCAGATCCAGTGCAA 
AGTCTTTGACTCCTTGCTGAATCTGNGCAGCACATTGCAGCAACCCNTGCCCTGATGGTGGT 
TGGCATCCTCCTGGGAGTGATAGCAATCTTTGTGGCCACCGTTGGCATGAAGTGTATGAAGT 
GCTTGGAAGACGATGAGGTGCAGAAGATGAGGATGGCTGTCATTGGGGGCGCGATATTTCTT 
CTTGCAGGTCTGGCTATTTNNNGTTGCCACAGCATGGTATGGCAATAGAATCGTTCAAGAAT 
TCTATGACCCTATGACCCCAGTCAATGCCAGGTACGAATTTGGTCAGGCTCTCTTCACTGGC 
TGGGCTGCTGCTTCTCTCTGCCTTCTGGGAGGTGCCCTACTTTGCTGTTCCTGCGA 
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FIGURE 102 

ATTCTCCCCTCCTGGATGGATCGCNCCACCGTCACATTGCCTTCCCCCANTGGAGGATTNAC 
TCCTATGCTGGCGACAACATCGTGACCCCCCAGGCCATTTACCGAGGGGCTTTGGATGTCNT 
GCNTGTCGCAGAGCACCGGGCAGATCCCAGTGCAAAGTCTTTGACTCCTTGCTGAATCTGAG 
CAGCACATTGCAAGCAACCCGTGCCTTGATGGGGTTGGCATCCTCCTGGGAGTGATAGCAAC 
CTTTGTGGCCACCGTTGGCATGAAGTGTATGAAGTGCTTGGAAGACGATGAGGTGCCAGAAG 
ATGAGGATGGCTGTCATTGGGGGCGCGATATTTCTTGTTGCAGGTCTGGCTATTTTAGTNGC 
CACAGCATGGTATGGCAATAGANTNNTTCNNGNNNTCTATGACCCTATGACCCCAGTCAATG 
CCAGGTACGAATTTGGTCAGGCTCTCTTCACTGGCTGGGCTGCTGCTTCTCTCTGCCTTCTG 
GGAGGTGCCCTACTTTGCTGTTCCTGTCCC 
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FIGURE 103 

AGAGCACCGGCAGATCCCAGTNCAAAGTCTTTGACCCTTGCTGAATCTGAGCAGCACATTNC 

AAGCAACCCCTTGCCTTGAAGGTGGTTGNCATCCCCCCTGGGAGTGAATAGCAATCTTTGTG 

GCCACCGTTGGCATGAAGTNTATGAAGTGCTTGGAAGACGATGAGGTGCAGAAGATGAGGAT 

GGCTGTCATTGGGGGCGCGATATTTCTTCTTGCAGGTCTGGCTATTTTAGTNNCCACAGCAT 

GGTATGGCAATAGNATNNTTCGNGGNTTCTATGACCCTATGACCCCAGTCAATGCCAGGTAC 

GAATTTGGTCAGGCTCTCTTCACTGGCTGGGCTGCTGCTTCTCTCTGCCTTCTGGGAGGTGC 
CCTACTTTGCTGTTCCTGTCCCCGAA 
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FIGURE 104 

AGCAATGCCCTGCCCCCAGTGGAGGATTAATTCCTATGNTGGGGACAACATTGTGACNGCCC 
AGGCCATGTACGGGGGGCTGTGGATGTCCTGCGTGTCGCAGAGCACCGGGCAGATCCAGTGC 
AAAGTNTTTGACTCCTTGCTGAATTTGAGCAGCACATTGCAAGCAACCCGTGCCTTGATGGT 
GGTTGGCATCTTCCTGGGAGTGATAGCAATCTTTGTGGCCACCGTGGNAATGAAGTGTATGA 
AGTGCTTGGAAGACGATGAGGTGCAGAAGATGAGGATGGCTGTCATTGGGGGCGCGATATTT 
CTTNTTGCAGGTCTGGCTATTTTAGTTGCCACAGCATGGTATGGCAATAGAATNGTTCAAGA 
ATTTTATGACCCTATGACCCCAGTCAATGCCAGGTACGAATTTGGTCAGGCTTTNTTCACTG 
GCTGGGCTGCTGCTTNTTTCTGCCTTNTGGGAGGTGCCCTANTTTGCTGTTCCTGCGAACC 
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FIGURE 1QR 

TCATAGGGGGGCGCGATATTTTTTCTTGCAGGTNTGGTTATTTTAGTTGCCACAGCATGGTA 

TGGCAATAGAATCGTTCAAGAATTNTATGACCCTATGACCCCAGTCAATGCCAGGTACGAAT 

TTGGTCAGGCTCTNTTCACTGGNTGGGCTGCTGCTTCTNTNNGCCTTNTGGGAGGTGCCCTA 
CTTTGCTGTTCCTG 
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FIGURE 106 



TTCCTGGGATGGATCCGCCCCCATCNTCACATGCCCTGCCCCNTGGAGATTTACNCCTATGC 
TGGCGAACAACATCNTGACCGCCCAGGCCATGTACGAGGGGCTGTGGAATGTCCTGCGTGTC 
CCAGAGCACCGGGCAGATCCAGTGCAAAGTCTTTGACTCCTTGCTGAATCTGAGCAGCACAT 



CCACCGTTGGCATGAAAGTGTATGAAGTGCTTGGAAGACGATGAGGTGCAGAAGATGAGGAT 
GGCTGTCATTGGGGGCGCGATATTTCTTCTTGCAGGTCTGGCTATTTTAGNNGCCACAGCAT 
GGTATGGCAATCAGACCCNNTCANAAACTCTATGACCCTATGACCCCAGTCAATGCCAGGTA 
CGAATTTGGTCAGGCTCTCTTCACTGGCTGGGCTGCTGCTTCTCTCTGCCTTCTGGGAGGTG 
CCCTACTTTGCTGTTCCTGTCCCCGAT^AAACAACCTCTTACCCACG 



TGCAAGCAACCNTGCCTTGATGGTGGTTGGCATCCTCCTGGGAGTGATAGCAATCTTTGTGG 
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FIGURE 1 07 



CGGGGCTGCAGCTGTTGGGCTTCATCTCGCTTCCTGGGATGGAATCGGCGCCATCGTCAGCA 
CTGCCCTGCCCCATGGAGGATTTACTCNTATGCTGGCGACAACATCGTGACCNCCCAGGCCA 
TGTACGAGGGGCTGTGGATGTCNGCGTGTCGCAGAGCACCGGGCAGATCCAGTGCAAAGTCT 
TTGACTCCTTGCTGAATCTGAGCAGCACATTGCAAGCAACCNTGCCTTGATGGTGGTTGGCA 
TCCTCCTGGGAGTGATAGCAATCTTTGTGGCCACCGTTGGCATGAAGTGTATGAAGTGCTTG 
GAAGACGATGAGGTGCAGAAGATGAGGATGGCTGTCATTGGGGGCGCGATATTTCTTCTTGC 
AGGTCTGGCTATTTNTAGTTGCCACAGCATGGTATGGCAATAGAATCGTTCAAGAATTCTAT 
GACCCTATGACCCCAGTCAATGCCAGGTACGAATTTGGTCAGGCTCTCTTCACTGGCTGGGC 
TGCTGCTTCTCTCTGCCTTCTGGGAGGTGCCCTACTTTGCTGTTCCTGCGAA 
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FIGURE 108 

GCGTGCCGTCAGCTCGCCGGGCACCGCGGCCTCGCCCTCGCCCTCCGCCCCTGCGCCTGCAC 

CGCGTAGACCGACCCCCCCCTCCAGCGCGCCCACCCGGTAGAGGACCCCCGCCCGTGCCCCG 

ACCGGTCCCCGCCTTTTTGTAAAACTTAT^GCGGGCGCAGCATTAACGCTTCCCGCCCCGGT 

GACCTCTCAGGGGTCTCCCCGCCAAAGGTGCTCCGCCGCTAAGGAACATGGCGAAGGTGGAG 

CAGGTCCTGAGCCTCGAGCCGCAGCACGAGCTCAAATTCCGAGGTCCCTTCACCGATGTTGT 

CACCACCAACCTAAAGCTTGGCAACCCGACAGACCGAAATGTGTGTTTTAAGGTGAAGACTA 

CAGCACCACGTAGGTACTGTGTGAGGCCCAACAGCGGAATCATCGATGCAGGGGCCTCAATT 

AATGTATCTGTGATGTTACAGCCTTTCGATTATGATCCCAATGAGAAAAGTAAACACAAGTT 

TATGGTTCAGTCTATGTTTGCTCCAACTGACACTTCAGATATGGAAGCAGTATGGAAGGAGG 

CAAAACCGGAAGACCTTATGGATTCAAAACTTAGATGTGTGTTTGAATTGCCAGCAGAGAAT 

GAT AAAC C AC AT G ATGT AGAAAT AAAT AAAAT TAT AT CC ACAAC TGC ATCAAAGACAGAAAC 

ACCAATAGTGTCTAAGTCTCTGAGTTCTTCTTTGGATGACACCGAAGTTAAGAAGGTTATGG 

AAGAATGTAAGAGGCTGCAAGGTGAAGTTCAGAGGCTACGGGAGGAGAACAAGCAGTTCAAG 

GAAGAAGATGGACTGCGGATGAGGAAGACAGTGCAGAGCAACAGCCCCATTTCAGCATTAGC 

CCCAACTGGGAAGGAAGAAGGCCTTAGCACCCGGCTCTTGGCTCTGGTGGTTTTGTTCTTTA 

TCGTTGGTGTAATTATTGGGAAGATTGCCTTGiaSAGGTAGCATGCACAGGATGGTAAATTG 

GATTGGTGGATCCACCATATCATGGGATTTAAATTTATCATAACCATGTGTAAAAAGAAATT 

AATGTATGATGACATCTCACAGGTCTTGCCTTTAAATTACCCCTCCCTGCACACACATACAC 

AGATACACACACACAAATATAATGTAACGATCTTTTAGAAAGTTAAAAATGTATAGTAACTG 

ATTGAGGGGGAAAAAGAATGATCTTTATTAATGACAAGGGAAACCATGAGTAATGCCACAAT 

GGCATATTGTAAATGTCATTTTAAACATTGGTAGGCCTTGGTACATGATGCTGGATTACCTC 

TCTTAAAATGACACCCTTCCTCGCCTGTTGGTGCTGGCCCTTGGGGAGCTGGAGCCCAGCAT 

GCTGGGGAGTGCGGTCAGCTCCACACAGTAGTCCCCACGTGGCCCACTCCCGGCCCAGGCTG 

CTTTCCGTGTCTTCAGTTCTGTCCAAGCCATCAGCTCCTTGGGACTGATGAACAGAGTCAGA 

AGCCCAAAGGAATTGCACTGTGGCAGCATCAGACGTACTCGTCATAAGTGAGAGGCGTGTGT 

TGACTGATTGACCCAGCGCTTTGGAAATAAATGGCAGTGCTTTGTTCACTTAAAGGGACCAA 

GCTAAATTTGTATTGGTTCATGTAGTGAAGTCAAACTGTTATTCAGAGATGTTTAATGCATA 

TTTAACTTATTTAATGTATTTCATCTCATGTTTTCTTATTGTCACAAGAGTACAGTTAATGC 

TGCGTGCTGCTGAACTCTGTTGGGTGAACTGGTATTGCTGCTGGAGGGCTGTGGGCTCCTCT 

GTCTCTGGAGAGTCTGGTCATGTGGAGGTGGGGTTTATTGGGATGCTGGAGAAGAGCTGCCA 

GGAAGTGTTTTTTCTGGGTCAGTAAATAACAACTGTCATAGGGAGGGAAATTCTCAGTAGTG 

ACAGTCAACTCTAGGTTACCTTTTTTAATGAAGAGTAGTCAGTCTTCTAGATTGTTCTTATA 

CCACCTCTCAACCATTACTCACACTTCCAGCGCCCAGGTCCAAGTCTGAGCCTGACCTCCCC 

TTGGGGACCTAGCCTGGAGTCAGGACAAATGGATCGGGCTGCAGAGGGTTAGAAGCGAGGGC 

ACCAGCAGTTGTGGGTGGGGAGCAAGGGAAGAGAGAAACTCTTCAGCGAATCCTTCTAGTAC 

TAGTTGAGAGTTTGACTGTGAATTAATTTTATGCCATAAAAGACCAACCCAGTTCTGTTTGA 

CTATGTAGCATCTTGAAAAGAAAAATTATAATAAAGCCCCAAAATTAAGAAAA 



WO 99/46281 




BNSOOCtD- <W0 9946281 A2J_> 



wo 99/46281 




PCT/US99/05028 



FIGURE 109 



</usr/seqcib2/sst/DNA/Dnaseqs.min/ss.DNA53977 
<subunit 1 of 1, 24 3 aa, 1 stop 
<MW: 27228, pi: 7.43, NX(S/T): 2 

MAKVEQVLSLEPOHELKFRGPFTDWTTNLKLGNPTDRNVCFKVKTTAPRRYCVRPNSGIID 
AGAS 1 NVS VMLQP FDYDPNEKSKHKFMVQSMFAPTDTS DMEAVWKEAKPEDLMDSKLRCVFE 
LPAENDKPHDVEINKIISTTASKTETPIVSKSLSSSLDDTEVKKVMEECKRLQGEVQRLREE 
NKQFKEEDGLRMRKTVQSNSPISALAPTGKEEGLSTRLLALWLFFIVGVIIGKIAL 

Important features: 

Putative transmembrane domain: 

amino acids 224-239 

N-glycosylation site. 

amino acids 68-71 

N-royristoylation site. 

amino acids 59-64, 64-69 and 235-240 
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FIGURE no 



GTCAGTCTTCTAGATTGTCCTTATCCCACCTTTCAACCANTACTCACATTTCNAGCGCCCAG 
GTCCANGTCTGAGCCTGACTTCCCCTTGGGGACCTAGCCTGGAGTCAGGACAATGGNTCGGG 
CTGCAGAGGNTTAGAAGCGAGGGCACCAGCAGTTTTGGGTGGGGAGCAAGGGNNGAGAGT^ 
CTCTTCAGCGAATCCTTCTAGTACTAGTTGAGAGTTTGACTGTGAATTAATTTTATGCCATA 
A7VAGACNAACCCAGTTCTGTTTGACTATGTAGCATCTTGAAAAGAAAAATTATAATAAAGCC 
CCAAAATTAAGAATTCTTTTGTCATTTTGTCACATTTGCTCTATGGGGGGAATTATTATTTT 
ATCATTTTTATTATTTTGCCATTGGAAGGTTAACTTTAAAATGAGC 
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FIGURE in 

TATTGTAAAGGCCATTTTAAACCATTGGTAGGCCTTGGTACATGATGCTGGATTACCTCCTT 
AAATGACACCNTTCCTCGCCTGTTGGTGCTGGCCNTTGGGGAGCTGGAGCCCCAGCATGCTG 
GGGAGTGCGGTCAGCTCCACACAGTAGTCCCCACGTGGCCCACTCCCGGCCCAGGCTGCTTT 
CCGTGTCTTCAGTTCTGTCCAAGCCATCAGCTCCTTGGGACTGATGAACAGAGTCAGAAGCC 
CAAAGGAATTGCCACTGTGGCAGCATCAGACGTACTCGTCATAAGTGAGAGGCGTGTGTTGA 
CTGATTGACCCAGCGCTTTGGAAATAAATGGCAGTGCTTTGTTCACTTAAAGGGACCAAGCT 
AAATTGTATTGGTTCATGTAGTGAAGTCAAACTGTTATTCAGAGATGTTTAATGCATATTTA 
ACTTATTTAATGTATTTCATCTCATGTTTTCTTATTGTCACAAGAGTACAGTTAATGCTGCG 
TGCTGCTGAACTCTGTTGGGTGAACTGGTATTGCTGCTGGAGGGCTG 
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FIRURE 112 

CCCTGGTGGTTTTGTTCTTTAATTCGTTGGTGTAATTNTTGGGAAGATTGCTTGTAGAGGTA 
GNATGCACCNGGCTGGTAAATTGGATTGGTGGATCCACCATATCCATGGGATTTAAATTTAT 
CATAACCATGTGTAAAAAGAAATTAATGTATGATGACATNTCACAGGTATTGCCTTTAAATT 
ACCCATCCCTGNANACACATACACAGATACACANANACAAATNTAATGTT^CGATNTTTTAG 
AAAGTTAAAAATGTATAGTAAC 
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FIGURE 113 

GGTGGCCCATTCCCGGCCCAGGCTGCTTTCCGGTNTTCAGTTCTGTCCAAGCCATCAGCTCC 
TTGGGACTGATGAACAGAGTCAGAAGCCCAAAGGAATTGCAGTGTGGCAGCATNAGACGTAC 
TTGTNATAAGTGAGAGGCGTGTGTTGACTGATTGACCCAGCGCTTTGGAAATAAATGGCAGT 
GCTTTGTTCANTTAAAGGGACCAAGCTAAATTTGTATTGGTTCATGTAGTGAAGTCAAACTG 
TTATTCAGAGATGTTTAATGCATATTTAANTTATTTAATGTATTTNATNTCATGTTTTCTTA 
TTGTCACAAGAGTACAGTTAATGCTGCGTGCTGCTGAANTNTGTTGGGTGAACTGGTATTGC 
TGCTGGAGGGCTGTGGGCTCCTCTGTCTTTGGAGAGTCTGGTCATGTGGAGGTGGG 
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FIGURE 114 

TGCTTTCCGTGTCTTCAGTTCTGTCCAAGCCATCAGCTCCTTGGGACTTGATGAACAGAGTC 
AGAAGCCCAAAGGAATTGCACTGTGGCAGCATCAGACGTACTCGTCATAAGTGAGAGGCGTG 
TGTTGACTGATTGACCCAGCGCTTTGGAAATAAATGGCAGTGCTTTGTTCACTTAAAGGGAC 
CAAGCTAAATTTGTATTGGTTCATGTAGTGAAGTCAAACTGTTATTCAGAGATGTTTAATGC 
ATATTTAACTTATTTAATGTATTTCATCTCATGTTTTCTTATTGTCACAAGAGTACAGTTAA 
TGCTGCGTGC 
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FIGURE lis 

AAACCTTTAAAAGTTGAGGGGAAAAGAATGATCCTTTATTAATGACAAGGGAAACCNTGNGT 
AATGCCACAATGGCATATTGTAAATGTCATTTTAAACATTGGTAGGCCTTGGTACATGATGC 
TGGATTACCTCTCTTAAAATGACACCCTTCCTCGCCTGTTGGTGCTGGCCCTTGGGGAGCTN 
GAGCCCAGCATGCTGGGGAGTGCGGTCTGCTCCACACAGTAGTCCCCANGTGGCCCANTCCC 
GGCCCAGGCTGCTTTCCGTGTCTTCAGTTCTGTCCAAGCCATCAGCTCCTTGGGANTGATGA 
ACAGAGTCAGAAGCCCAAAGGAATTGCANTGTGGCAGCATCAGANGTANTNGTCATAAGTGA 
GAGGCGTGTGTTGANTGATTGACCCAGCGCTTTGGAAATAAATGGCAGTGCTTTGTTCANTT 
AAAGGGNCCAAGNTAAATTTGTATTGGTTCATGTAGTGAAGTCAAANTGTTATTCAGAGATG 
TTTAATGCATATTTAAWTTATTTAATGTATTTCATNTCATGTTTTCTTATTGTCACAAGGGT 
ACAGTTAATGCTGCGTGCTGCTGAANTCTGTTGGGTGAANTGGTATTGCTG 
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FinURE 116 

GGCCCTTGGGGAGCTGGAGCCCAGCATGCTGGGGAGTGCGGTCAGCTCCACACAGTAGTCCC 
CACGTGGCCCACTCCCGGCCCAGGCTGCTTTCCGTGTCTTCAGTTCTGTCCAAGCCATCAGC 
TCCTTGGGACTGATGAACAGAGTCAGAAGCCCAAAGGAATTGCACTGTGGCAGCATCAGACG 
TACTCGTCATAAGTGAGAGGCGTGTGTTGACTGATTGACCCAGCGCTTTGGT^TAAATGGC 
AGTGCTTTGTTCACTTTVAAGGGACCAAGCTAAATTTGTATTGGTTCATGTAGTGAAGTCAAA 
CTGTTATTCAGAGATGTTTAATGCATATTTAACTTATTTAATGTATTTCATCTCATGTTTTC 
TTATTGTCACAAGAGTACAGTTAATGCTGCGTGCTGCTGAACTCTGTTGGGTGAACTGGTAT 
TGCTGCTGGAGGGCTGTGGGCTCCTCTGTCTCTGGAGAGTCTGGTCATGTGGAGGTGGG 
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FIGURE 117 

GCGAGCTCCGGGTGCTGTGGCCCGGCCTTGGCGGGGCGGCCTCCGGCTCAGGCTGGCTGAGA 
GGCTCCCAGCTGCAGCGTCCCCGCCCGCCTCCTCGGGAGC7CTGATCTCAGCTGACAGTGCC 
CTCGGGGACCAAACAAGCCTGGCAGGGTCTCACTTTGTTGCCCAGGCTGGAGTTCAGTGCCA 
TGATCATGGTTTACTGCAGCCTTGACCTCCTGGGTTCAAGCGATCCTGCTGAGTAGCTGGGA 
CTACAGGACAAAATTAGAAGATCAAAMSGAAAATATGCTGCTTTGGTTGATATTTTTCACC 
CCTGGGTGGACCCTCATTGATGGATCTGAAATGGAATGGGATTTTATGTGGCACTTGAGAAA 
GGTACCCCGGATTGTCAGTGAAAGGACTTTCCATCTCACCAGCCCCGCATTTGAGGCAGATG 
CTAAGATGATGGTAAATACAGTGTGTGGCATCGAATGCCAGAAAGAACTCCCAACTCCCAGC 
CTTTCTGAATTGGAGGATTATCTTTCCTATGAGACTGTCTTTGAGAATGGCACCCGAACCTT 
AACCAGGGTGAAAGTTCAAGATTTGGTTCTTGAGCCGACTCAAAATATCACCACAAAGGGAG 
TATCTGTTAGGAGAAAGAGACAGGTGTATGGCACCGACAGCAGGTTCAGCATCTTGGACAAA 
AGGTTCTTAACCAATTTCCCTTTCAGCACAGCTGTGAAGCTTTCCACGGGCTGTAGTGGCAT 
TCTCATTTCCCCTCAGCATGTTCTAACTGCTGCCCACTGTGTTCATGATGGAAAGGACTATG 
TCAAAGGGAGTAAAAAGCTAAGGGTAGGGTTGTTGAAGATGAGGAATAAAAGTGGAGGCAAG 
AAACGTCGAGGTTCTAAGAGGAGCAGGAGAGAAGCTAGTGGTGGTGACCAAAGAGAGGGTAC 
CAGAGAGCATCTGCAGGAGAGAGCGAAGGGTGGGAGAAGAAGAAAAAAATCTGGCCGGGGTC 
AGAGGATTGCCGAAGGGAGGCCTTCCTTTCAGTGGACCCGGGTCAAGAATACCCACATTCCG 
AAGGGCTGGGCACGAGGAGGCATGGGGGACGCTACCTTGGACTATGACTATGCTCTTCTGGA 
GCTGAAGCGTGCTCACAAAAAGAAATACATGGAACTTGGAATCAGCCCAACGATCAAGAAAA 
TGCCTGGTGGAATGATCCACTTCTCAGGATTTGATAACGATAGGGCTGATCAGTTGGTCTAT 
CGGTTTTGCAGTGTGTCCGACGAATCCAATGATCTCCTTTACCAATACTGCGATGCTGAGTC 
GGGCTCCACCGGTTCGGGGGTCTATCTGCGTCTGAAAGATCCAGACAAAAAGAATTGGAAGC 
GCAAAATCATTGCGGTCTACTCAGGGCACCAGTGGGTGGATGTCCACGGGGTTCAGAAGGAC 
TACAACGTTGCTGTTCGCATCACTCCCCTAAAATACGCCCAGATTTGCCTCTGGATTCACGG 
GAACGATGCCAATTGTGCTTACGGC1&&CAGAGACCTGAAACAGGGCGGTGTATCATCTAAA 
TCACAGAGAAAACCAGCTCTGCTTACCGTAGTGAGATCACTTCATAGGTTATGCCTGGACTT 
GAACTCTGTCAATAGCATTTCAACATTTTTCAAAATCAGGAGATTTTCGTCCATTTAAAAAA 
TGTATAGGTGCAGATATTGAAACTAGGTGGGCACTTCAATGCCAAGTATATACTCTTCTTTA 
CATGGTGATGAGTTTCATTTGTAGAAAAATTTTGTTGCCTTCTTAAAAATTAGACACACTTT 
AAACCTTCAAACAGGTATTATAAATAACATGTGACTCCTTAATGGACTTATTCTCAGGGTCC 
TACTCTAAGAAGAATCTAATAGGATGCTGGTTGTGTATTAAATGTGAAATTGCATAGATAAA 
GGTAGATGGTAAAGCAATTAGTATCAGAATAGAGACAGAAAGTTACAACACAGTTTGTACTA 
CTCTGAGATGGATCCATTCAGCTCATGCCCTCAATGTTTATATTGTGTTATCTGTTGGGTCT 
GGGACATTTAGTTTAGTTTTTTTGAAGAATTACAAATCAGAAGAAAAAGCAAGCATTATAAA 
CAAAACTAATAACTGTTTTACTGCTTTAAGAAATAACAATTACAATGTGTATTATTTAAAAA 
TGGGAGAAATAGTTTGTTCTATGAAATAAACCTAGTTTAGAAATAGGGAAGCTGAGACATTT 
TAAGATCTCAAGTTTTTATTTAACTAATACTCAAAATATGGACTTTTCATGTATGCATAGGG 
AAGACACTTCACAAATTATGAATGATCATGTGTTGAAAGCCACATTATTTTATGCTATACAT 
TCTATGTATGAGGTGCTACATTTTTAGGACAAAGAATTCTGTAATCTTTTTCAAGAAAGAGT 
CTTTTTCTCCTTGACAAAATCCAGCTTTTGTATGAGGACTATAGGGTGAATTCTCTGATTAG 
TAATTTTAGATATGTCCTTTCCTAAAAATGAATAAAATTTATGAATATGA 
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FIGURE 118 



</usr/seqdb2/sst/DNA/Dnaseqs,min/ss .DNA57253 
<subunit 1 of 1, 413 aa, 1 stop 
<MW: 47070, pi: 9.92, NX(S/T): 3 

MENMLLWLIFFTPGWTLIDGSEMEWDE^WHLRKVPRIVSERTFHLTSPAFEADAKMM^ 

GIECQKELPTPSLSELEDYLSYETVFENGTRTLTRVIO/QDLVLEPTQNITTKGVSVRRKRQV 

YGTDSRFSILDKRFLTNFPFSTAVKLSTGCSGILISPQHVLTAAHCVHDGKDYVKGSKKLRV 

GLLKMRNKSGGKKRRGSKRSRREASGGDQREGTREHLQERAKGGRRRKKSGRGQRIAEGRPS 

FQWTRVKNTHIPKGWARGGMGDATLDYDYALLELKRAHKKKYMELGISPTIKKMPGGMIHFS 

GFDNDRADQLVYRFCSVSDESNDLLYQYCDAESGSTGSGVYLRLKDPDKKNWKRKIIAVYSG 

HQWVDVHGVQKDYNVAVR I T PLK YAQ I CLW I HGNDANCAYG 

Important features: 
Signal peptide: 
amino acids 1-16 

H-glycosylation sites. 

amino acids 90-93, 110-113 and 193-196 

Glycosaminoglycan attachment site, 
amino acids 236-239 

Serine proteases, trypsin family, histidine active site, 
amino acids 165-170 
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FIGURE 119 



AATGTGAGAGGGGCTGATGGAAGCTGATAGGCAGGACTGGAGTGTTAGCACCAGTACTGGAT 

GTGACAGCAGGCAGAGGAGCACTTAGCAGCTTATTCAGTGTCCGATTCTGATTCCGGCAAGG 

ATCCAAGC^GAATGCTGCCGTCGGGCAACTCCTGGCACACTGCTCCTCTTTCTGGCTTTC 

CTGCTCCTGAGTTCCAGGACCGCACGCTCCGAGGAGGACCGGGACGGCCTATGGGATGCCTG 

GGGCCCATGGAGTGAATGCTCACGCACC7GCGGGGGAGGGGCCTCCTACTCTCTGAGGCGCT 

GCCTGAGCAGC7VAGAGCTGTGAAGGAAGAAATATCCGATACAGAACATGCAGTAATGTGGAC 

TGCCCACCAGAAGCAGGTGATTTCCGAGCTCAGCAATGCTCAGCTCATAATGATGTCAAGCA 

CCATGGCCAGTTTTATGAATGGCTTCCTGTGTCTAATGACCCTGACAACCCATGTTCACTCA 

AGTGCCAAGCCAAAGGAACAACCCTGGTTGTTGAACTAGCACCTAAGGTCTTAGATGGTACG 

CGTTGCTATACAGAATCTTTGGATATGTGCATCAGTGGTTTATGCCAAATTGTTGGCTGCGA 

TCACCAGCTGGGAAGCACCGTCAAGGAAGATAACTGTGGGGTCTGCAACGGAGATGGGTCCA 

CCTGCCGGCTGGTCCGAGGGCAGTATAAATCCCAGCTCTCCGCAACCAAATCGGATGATACT 

GTGGTTGCACTTCCCTATGGAAGTAGACATATTCGCCTTGTCTTAAAAGGTCCTGATCACTT 

ATATCTGGAAACCAAAACCCTCCAGGGGACTAAAGGTGAAAACAGTCTCAGCTCCACAGGAA 

CTTTCCTTGTGGACAATTCTAGTGTGGACTTCCAGAAATTTCCAGACAAAGAGATACTGAGA 

ATGGCTGGACCACTCACAGCAGATTTCATTGTCAAGATTCGTAACTCGGGCTCCGCTGACAG 

TACAGTCCAGTTCATCTTCTATCAACCCATCATCCACCGATGGAGGGAGACGGATTTCTTTC 

CTTGCTCAGCAACCTGTGGAGGAGGTTATCAGCTGACATCGGCTGAGTGCTACGATCTGAGG 

AGCAACCGTGTGGTTGCTGACCAATACTGTCACTATTACCCAGAGAACATCAAACCCAAACC 

CAAGCTTCAGGAGTGCAACTTGGATCCTTGTCCAGCCAGTGACGGATACAAGCAGATCATGC 

CTTATGACCTCTACCATCCCCTTCCTCGGTGGGAGGCCACCCCATGGACCGCGTGCTCCTCC 

TCGTGTGGGGGGGGCATCCAGAGCCGGGCAGTTTCCTGTGTGGAGGAGGACATCCAGGGGCA 

TGTCACTTCAGTGGAAGAGTGGAAATGCATGTACACCCCTAAGATGCCCATCGCGCAGCCCT 

GCAACATTTTTGACTGCCCTAAATGGCTGGCACAGGAGTGGTCTCCGTGCACAGTGACATGT 

GGCCAGGGCCTCAGATACCGTGTGGTCCTCTGCATCGACCATCGAGGAATGCACACAGGAGG 

CTGTAGCCCAAAAACAAAGCCCCACATAAAAGAGGAATGCATCGTACCCACTCCCTGCTATA 

AACCCAAAGAGAAACTTCCAGTCGAGGCCAAGTTGCCATGGTTCAAACAAGCTCAAGAGCTA 

GAAGAAGGAGCTGCTGTGTCAGAGGAGCCCTCG1&&GTTGTAAAAGCACAGACTGTTCTATA 

TTTGAAACTGTTTTGTTTAAAGAAAGCAGTGTCTCACTGGTTGTAGCTTTCATGGGTTCTGA 

ACTAAGTGTAATCATCTCACCAAAGCTTTTTGGCTCTCAAATTAAAGATTGATTAGTTTCAA 
AAAAAAAAA 
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</usr/seqdb2/sst/DNA/Dnaseqs,min/ss.DNA58847 
<subunit 1 of 1, 525 aa, 1 stop 
<MW: 58416, pi: 6.62, NX(S/T): 1 

MECCRRATPGTLLLFLAFLLLSSRTARSEEDRDGLWDAWGPWSECSRTCGGGASYSLRRCLS 
SKSCEGRNIRYRTCSNVDCPPEAGDFRAQQCSAHNDVKHHGQFYEWLPVSMDPDNPCSLKCQ 
AKGTTLWELAPKVLDGTRCYTESLDMCISGLCQIVGCDHQLGSTVKEDNCGVCNGDGSTCR 
LVRGQYKSQLSATKSDDTWALPYGSRHIRLVLKGPDHLYLETKTLQGTKGENSLSSTGTFL 
VDNSSVDFQKFPDKEILRMAGPLTADFIVKIRNSGSADSTVQFIFYQPIIHRWRETDFFPCS 
ATCGGGYQLTSAECYDLRSNRWTUDQYCHYYPENIKPKPKLQECNLDPCPASDGYKQIMPYD 
LYHPLPRWEATPWTACSSSCGGGIQSRAVSCVEEDIQGHVTSVEEWKCMYTPKMPIAQPCNI 
FDCPKWLAQEWSPCTVTCGQGLRYRWLCIDHRGMHTGGCSPKTKPHIKEECIVPTPCYKPK 
EKLPVEAKLPWFKQAQELEEGAAVSEEPS 

Important features: 
Signal peptide: 

amino acids 1-25 

N-glycosylation site. 

amino acids 251-254 

Thrombospcndin 1 
amino acids 385-399 

von Willebrand factor type C domain proteins 
amino acids 385-399, 445-459 and 42-56 
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F IGURE 12 1 

CGGACGCGTGGGCGGCGGCTGCGGAACTCCCGTGGAGGGGCCGGTGGGCCCTCGGGCCTGAC 
AGMfiGCAGTGGCCACTGCGGCGGCAGTACTGGCCGCTCTGGGCGGGGCGCTGTGGCTGGCG 
GCCCGCCGGTTCGTGGGGCCCAGGGTCCAGCGGCTGCGCAGAGGCGGGGACCCCGGCCTCAT 
GCACGGGAAGACTGTGCTGATCACCGGGGCGAACAGCGGCCTGGGCCGCGCCACGGCCGCCG 
AGCTACTGCGCCTGGGAGCGCGGGTGATCATGGGCTGCCGGGACCGCGCGCGCGCCGAGGAG 
GCGGCGGGTCAGCTCCGCCGCGAGCTCCGCCAGGCCGCGGAGTGCGGCCCAGAGCCTGGCGT 
CAGCGGGGTGGGCGAGCTCATAGTCCGGGAGCTGGACCTCGCCTCGCTGCGCTCGGTGCGCG 
CCTTCTGCCAGGAAATGCTCCAGGAAGAGCCTAGGCTGGATGTCTTGATCAATAACGCAGGG 
ATCTTCCAGTGCCCTTACATGAAGACTGAAGATGGGTTTGAGATGCAGTTCGGAGTGAACCA 
TCTGGGGCACTTTCTACTCACCAATCTTCTCCTTGGACTCCTCAAAAGTTCAGCTCCCAGCA 
GGATTGTGGTAGTTTCTTCCAAACTTTATAAATACGGAGACATCAATTTTGATGACTTGAAC 
AGTGAACAAAGCTATAATAAAAGCTTTTGTTATAGCCGGAGCAAACTGGCTAACATTCTTTT 
TACCAGGGAACTAGCCCGCCGCTTAGAAGGCACAAATGTCACCGTCAATGTGTTGCATCCTG 
GTATTGTACGGACAAATCTGGGGAGGCACATACACATTCCACTGTTGGTCAAACCACTCTTC 
AATTTGGTGTCATGGGCTTTTTTCAAAACTCCAGTAGAAGGTGCCCAGACTTCCATTTATTT 
GGCCTCTTCACCTGAGGTAGAAGGAGTGTCAGGAAGATACTTTGGGGATTGTAAAGAGGAAG 
AACTGTTGCCCAAAGCTATGGATGAATCTGTTGCAAGAAAACTCTGGGATATCAGTGAAGTG 
ATGGTTGGCCTGCTAAAAX&fiGAACAAGGAGTAAAAGAGCTGTTTATAAAACTGCATATCAG 
TTATATCTGTGATCAGGAATGGTGTGGATTGAGAACTTGTTACTTGAAGAAAAAGAATTTTG 
ATATTGGAATAGCCTGCTAAGAGGTACATGTGGGTATTTTGGAGTTACTGAAAAATTATTTT 
TGGGATAAGAGAATTTCAGCAAAGATGTTTTAAATATATATAGTAAGTATAATGAATAATAA 
GTACAATGAAAAATACAATTATATTGTAAAATTATAACTGGGCAAGCATGGATGACATATTA 
ATATTTGTCAGAATTAAGTGACTCAAAGTGCTATCGAGAGGTTTTTCAAGTATCTTTGAGTT 
TCATGGCCAAAGTGTTAACTAGTTTTACTACAATGTTTGGTGTTTGTGTGGAAATTATCTGC 
CTGGTGTGTGCACACAAGTCTTACTTGGAATAAATTTACTGGTAC 
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FIGURE 122 

</usr/seqclb2/sst/DNA/Dnaseqs.min/ss.DNA58747 
<subunit 1 of 1, 336 aa, 1 stop 
<MW: 36865, pi: 9.15, NX(S/T): 2 

MAVATAAAVLAALGGALWLAARRFVGPRVQRLRRGGDPGLMHGKTVLITGANSGLGRATAAE 
LLRLGARVIMGCRDRARAEEAAGQLRRELRQAAECGPEPGVSGVGELIVRELDLASLRSVRA 
FCQEMLQEEPRLDVLINNAGIFQCPYMKTEDGFEMQFGVNHLGHFLLTNLLLGLLKSSAPSR 
IVWSSKLYKYGDINFDDLNSEQSYNKSFCYSRSKLANILFTRELARRLEGTNVTVNVLHPG 
IVRTNLGRHIHIPLLVKPLFNLVSWAFFKTPVEGAQTSIYLASSFEVEGVSGRYFGDCKEEE 
LLPKAMDESVARKLWDI SEVMVGLLK 

Important features: 
Signal peptide: 
amino acids 1-21 

Short-chain alcohol dehydrogenase family protein 
amino acids 134-144, 44-56 and 239-248 

M-glycosylation site. 

amino acids 212-215 and 239-242 
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FIGURE 123 

GGGGATTGTAAAGAGGAAGNACTGTGCCCAAAGNTATGGATGAATCTGTTGCAAGAAAATTN 

TGGGATATCAGTGAAGTGATGGTTNGCCTGCTAAAATAGGAACAAGGAGTAAAAGAGCTGTT 

TATAAAACTGCATATCAGTTATATCTGTGATCAGGAATGGTGTGGATTGAGAACTTGTTACT 

TGAAGAAAAAGAATTTTGATATTGGAATAGCCTGNTAAGAGGNACATGTGGGTATTTTGGAG 

TTACTGAAAAATTATTTTTGGGATAAGAGAATTTCAGCAAAGATGTTTTAAATATATATAGT 

AAGTATAATGAATAATAAGTACAATGAAAAATACAATTATATTGTAAAATTATAACTGGGCA 

AGCATGGATGACATATTAATATTTGTCAGAATTAAGTGACTCAAAGTGCTATCGAGAGGTTT 

TTCAAGTATCTTTGAGTTTCATGGCCAAAGTGTTAACTAGTTTTACTACAATGTTTGGTGTT 
TGTGTGGAAATTATCTGCCTGGCTT 
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FIGURE 124 

GAGAGGACGAGGTGCCGCTGCCTGGAGAATCCTCCGCTGCCGTCGGCTCCCGGAGCCCAGCC 

CTTTCCTAACCCAACCCAACCTAGCCCAGTCCCAGCCGCCAGCGCCTGTCCCTGTCACGGAC 

CCCAGCGTTACCMiSCATCCTGCCGTCTTCCTATCCTTACCCGACCTCAGATGCTCCCTTCT 

GCTCCTGGTAACTTGGGTTTTTACTCCTGTAACAACTGAAATAACAAGTCTTGCTACAGAGA 

ATATAGATGAAATTTTAAACAATGCTGATGTTGCTTTAGTAAATTTTTATGCTGACTGGTGT 

CGTTTCAGTCAGATGTTGCATCCAATTTTTGAGGAAGCTTCCGATGTCATTAAGGAAGAATT 

TCCAAATGAAAATCAAGTAGTGTTTGCCAGAGTTGATTGTGATCAGCACTCTGACATAGCCC 

AGAGATACAGGATAAGCAAATACCCAACCCTCAAATTGTTTCGTAATGGGATGATGATGAAG 

AGAGAATACAGGGGTCAGCGATCAGTGAAAGCATTGGCAGATTACATCAGGCAACAAAAAAG 

TGACCCCATTCAAGAAATTCGGGACTTAGCAGAAATCACCACTCTTGATCGCAGCAAAAGAA 

ATATCATTGGATATTTTGAGCAAAAGGACTCGGACAACTATAGAGTTTTTGAACGAGTAGCG 

AATATTTTGCATGATGACTGTGCCTTTCTTTCTGCATTTGGGGATGTTTCAAAACCGGAAAG 

ATATAGTGGCGACAACATAATCTACAAACCACCAGGGCATTCTGCTCCGGATATGGTGTACT 

TGGGAGCTATGACAAATTTTGATGTGACTTACAATTGGATTCAAGATAAATGTGTTCCTCTT 

GTCCGAGAAATAACATTTGAAAATGGAGAGGAATTGACAGAAGAAGGACTGCCTTTTCTCAT 

ACTCTTTCACATGAAAGAAGATACAGAAAGTTTAGAAATATTCCAGAATGAAGTAGCTCGGC 

AATTAATAAGTGAAAAAGGTACAATAAACTTTTTACATGCCGATTGTGACAAATTTAGACAT 

CCTCTTCTGCACATACAGAAAACTCCAGCAGATTGTCCTGTAATCGCTATTGACAGCTTTAG 

GCATATGTATGTGTTTGGAGACTTCAAAGATGTATTAATTCCTGGAAAACTCAAGCAATTCG 

TATTTGACTTACATTCTGGAAAACTGCACAGAGAATTCCATCATGGACCTGACCCAACTGAT 

ACAGCCCCAGGAGAGCAAGCCCAAGATGTAGCAAGCAGTCCACCTGAGAGCTCCTTCCAGAA 

ACTAGCACCCAGTGAATATAGGTATACTCTATTGAGGGATCGAGATGAGCTT1&&A7ACTTG 

AAAAACAGTTTGTAAGCCTTTCAACAGCAGCATCAACCTACGTGGTGGAAATAGTAAACCTA 

TATTTTCATAATTCTATGTGTATTTTTATTTTGAATAAACAGAAAGAAATTTAAAAAAAAAA 

aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa;^ 
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FIGURE 12B 

</usr/seqclb2/sst/DNA/Dnaseqs.inin/ss-DNA57 689 
<subunit 1 of 1, 406 aa, 1 stop 
<MW: 46927, pi: 5.21, NX(S/T): 0 

MHPAVFLSLPDLRCSLLLLVTWVFTPVTTEITSLATENIDEILNNADVALVNFYADWCRFSQ 
MLHPIFEEASDVIKEEFPNENQWFARVDCDQKSDIAQRYRISKYPTLKLFRNGMMMKREYR 
GQRSVKALADYIRQQKSDPIQEIRDLAEITTLDRSKRNIIGYFEQKDSDNYRVFERVANILH 
DDCAFLSAFGDVSKPERYSGDNIIYKPPGHSAPDMVYLGAMTNFDVTYNWIQDKCVPLVREI 
TFENGEELTEEGLP FL I L FHMKEDTESLE I FQNE VARQLI SEKGT INFLHADCDKFRHPLLH 

IQKTPADCPVIAIDSFRHMYVFGDFKDVLIPGKLKQFVFDLHSGKLHREFHHGPDPTDTAPG 
EQAQDVASSPPESSFQKLAPSEYRYTLLRDRDEL 

Important features: 
Signal peptide: 
amino acids 1-29 

Endoplasmic reticulum targeting sequence. 

amino acids 403-406 

Tyrosine kinase phosphorylation site. 

amino acids 203-211 

Thioredoxin family proteins 
amino acids 50-66 
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ATTAAGGAAGAATTTCCAAATGAAAATCAAGTAGTNTTTGCCAGAGTNGATTGTGATCAGCA 
CTCTGACATAGCCCAGAGATACAGGATAAGCAAATACCCAACCCTCAAATTGTTTCGTAATG 
GGATGATGATGAAGAGAGAATACAGGGGTCAGCGATCAGTGAAAGCATTGGCAGATTA 



BNSOOCID' <W0 M46281A2J^ 
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FIGURE 127 



AGAGGCCTCTCTGGAAGTTGTCCCGGGTGTTCGCCGCNGGAGCCCGGGTCGAGAGGACNAGG 
TGCCGCTGCCTGGAGAATCCTCCGCTGCCGTCGGCTCCCGGAGCCCAGCCCTTTCCTAACCC 
AACCCAACCTAGCCCNGTCCCAGCCGCCAGCGCCTGTCCCTGTCNCGGANCCCAGCGTNACC 
ATGCATCCTGCCGTCTTCCTATCCTTACCCGACCTCAGATGCTCCCTTCTGCTCCTGGTAAC 
TTGGGTTTTTACTCCTGTAACAACTGAAATAACNNGTCTTGATACNNAGAATATAGATGAAA 
TTTTAAACNATGCTGATGTGGCTTTAGTCAATTTTTATGCTGACTGGTGTCGTTTCAGTCAG 
ATGTGGCATCCAATTTTTGAGGANGCTTCCGATGTCATTAAGGAAGAATTTCCAAATGAAAA 
TCAAGTAGTGTTTGCCAGAGTTGATTGTGATCAGCACTCTGACATAGCCCAGAGATACAGGA 
TAAGCAAATACCCAACCCTCAAATTGTTTCGTAATGGGATGATGATGAAGAGAGAATACAGG 
GGTCAGCGATCAGTGATWVGCATTGGCAGATTACATCAGGC 
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GCCCACGCGTCCGMSGCGTTCACGTTCGCGGCCTTCTGCTACATGCTGGCGCTGCTGCTCA 
CTGCCGCGCTCATCTTCTTCGCCATTTGGCACATTATAGCATTTGATGAGCTGAAGACTGAT 
TACAAGAATCCTATAGACCAGTGTAATACCCTGAATCCCCTTGTACTCCCAGAGTACCTCAT 
CCACGCTTTCTTCTGTGTCATGTTTCTTTGTGCAGCAGAGTGGCTTACACTGGGTCTCAATA 
TGCCCCTCTTGGCATATCATATTTGGAGGTATATGAGTAGACCAGTGATGAGTGGCCCAGGA 
CTCTATGACCCTACAACCATCATGAATGCAGATATTCTAGCATATTGTCAGAAGGAAGGATG 
GTGCAAATTAGCTTTTTATCTTCTAGCATTTTTTTACTACCTATATGGCATGATCTATGTTT 
TGGTGAGCTCTTASAACAACACACAGAAGAATTGGTCCAGTTAAGTGCATGCAAAAAGCCAC 
CAAATGAAGGGATTCTATCCAGCAAGATCCTGTCCAAGAGTAGCCTGTGGAATCTGATCAGT 
TACTTTAAAAAATGACTCCTTATTTTTTAAATGTTTCCACATTTTTGCTTGTGGAAAGACTG 
TTTTCATATGTTATACTCAGATAAAGATTTTAAATGGTATTACGTATAAATTAATATAAAAT 
GATTACCTCTGGTGTTGACAGGTTTGAACTTGCACTTCTTAAGGAACAGCCATAATCCTCTG 
AATGATGCATTAATTACTGACTGTCCTAGTACATTGGAAGCTTTTGTTTATAGGAACTTGTA 
GGGCTCATTTTGGTTTCATTGAAACAGTATCTAATTATAAATTAGCTGTAGATATCAGGTGC 
TTCTGATGAAGTGAAAATGTATATCTGACTAGTGGGAAACTTCATGGGTTTCCTCATCTGTC 
ATGTCGATGATTATATATGGATACATTTACA/^AAATAATVAAGCGGGAATTTTCCCTTCGCTT 
GAATATTATCCCTGTATATTGCATGAATGAGAGATTTCCCATATTTCCATCAGAGTAATAAA 
TATACTTGCTTTAATTCTTAAGCATAAGTAAACATGATATAAAAATATATGCTGAATTACTT 
GTGAAGAATGCATTTAAAGCTATTTTAAATGTGTTTTTATTTGTAAGACATTACTTATTAAG 
AAATTGGTTATTATGCTTACTGTTCTAATCTGGTGGTAAAGGTATTCTTAAGAATTTGCAGG 
TACTACAGATTTTCAAAACTGAATGAGAGAAAATTGTATAACCATCCTGCTGTTCCTTTAGT 
GCAATACAATAAAACTCTGAAATTAAGACTC 
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FIGURE 129 

</usr/seqdb2/sst/DNA/Dnaseqs.min/ss,DNA23330 
<subunit 1 of 1, 144 aa, i stop 
<MW: 16699, pi: 5,60, NX(S/T); 0 

MAFTFAAFCYMLALLLTAALIFFAIWKIIAFDELKTDYKNPIDQCNTLNPLVLPEYLIHAFF 

CVMFLCAAEWLTLGLNMPLLAYHIWRYMSRPVMSGPGLYDPTTI^INADIIAYCQKEGWCKLA 
FYLLAFFYYLYGMIYVLVSS 

Important features: 
Signal peptide: 

amino acids 1-20 

Type II transmembrane domain: 
amino acids 11-31 

Other transmembrane domain: 
amino acids 57-77 and 123-143 
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FIGURE 130 

ATTATAGCATTTGATGAGCTGAAGACTGATTACAAGATCCTATAGACCAGTGTAATACCCTG 
AATCCCCTTGTACTCCCAGAGTACCTCATCCACGCTTTCTTCTGTGTCATGTTTCTTTGTGC 
AGCAGAGTGGCTTACACTGGGTCTCAATATGCCCCTCTTGGCATATCATATTTGGAGGTATA 
TGAGTAGACCAGTGATGAGTGGCCCAGGACTCTATGACCCTACAACCATCATGAATGCAGAT 
ATTCTAGCATATTGTCAGAAGGAAGGATGGTGCAAATTAGCTTTTTATCTTCTAGCATTTTT 
TTACTACCTATATGGCATGATCTATGTTTTGGTGAGCTCTTAGAACAACACACAGAAGAATT 
GGTCCAGTTAAGTGCATGCAAAAAGCCACCAAATGAAGGGATTCTATCCAGCAAGATCCTGT 
CCAAGAGTAGCCTGTGGAATCTGATCAGTTACTTTAAAAAATG 
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FIGURE 131 



CGGACGCGTGGGGGAAACCCTTCCGAGAAAACAGCAACAAGCTGAGCTGCTGTGACAGAGGG 
GAACAAGMSGCGGCGCCGAAGGGGAGCCTCTGGGTGAGGACCCAACTGGGGCTCCCGCCGC 
TGCTGCTGCTGACCATGGCCTTGGCCGGAGGTTCGGGGACCGCTTCGGC7GAAGCATTTGAC 
TCGGTCTTGGGTGATACGGCGTCTTGCCACCGGGCCTGTCAGTTGACCTACCCCTTGCACAC 
CTACCCTAAGGAAGAGGAGTTGTACGCATGTCAGAGAGGTTGCAGGCTGTTTTCAATTTGTC 
AGTTTGTGGATGATGGAATTGACTTAAATCGAACTAAATTGGAATGTGAATCTGCATGTACA 
GAAGCATATTCCCAATCTGATGAGCAATATGCTTGCCATCTTGGTTGCCAGAATCAGCTGCC 
ATTCGCTGAACTGAGACAAGAACAACTTATGTCCCTGATGCCAAAAATGCACCTACTCTTTC 
CTCTAACTCTGGTGAGGTCATTCTGGAGTGACATGATGGACTCCGCACAGAGCTTCATAACC 
TCTTCATGGACTTTTTATCTTCAAGCCGATGACGGAAAAATAGTTATATTCCAGTCTAAGCC 
AGAAATCCAGTACGCACCACATTTGGAGCAGGAGCCTACAAATTTGAGAGAATCATCTCTAA 
GCAAAATGTCCTATCTGCAAATGAGAAATTCACAAGCGCACAGGAATTTTCTTGAAGATGGA 
GAAAGTGATGGCTTTTTAAGATGCCTCTCTCTTAACTCTGGGTGGATTTTAACTACAACTCT 
TGTCCTCTCGGTGATGGTATTGCTTTGGATTTGTTGTGCAACTGTTGCTACAGCTGTGGAGC 
AGTATGTTCCCTCTGAGAAGCTGAGTATCTATGGTGACTTGGAGTTTATGAATGAACAAAAG 
CTAAACAGATATCCAGCTTCTTCTCTTGTGGTTGTTAGATCTAAAACTGAAGATCATGAAGA 
AGCAGGGCCTCTACCTACAAAAGTGAATCTTGCTCATTCTGAAATliaAGCATTTTTCTTTT 
AAAAGACAAGTGTAATAGACATCTAAAATTCCACTCCTCATAGAGCTTTTAAAATGGTTTCA 
TTGGATATAGGCCTTAAGAAATCACTATAAAATGCAAATAAAGTTACTCAAATCTGTG 
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FIGURE 132 



</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA26847 
<subunit 1 of 1, 323 aa, 1 stop 
<MW: 36223, pi: 5.06, NX(S/T): 1 

MAAPKGSLWVRTQLGLPPLLLLTMALAGGSGTASAEAFDSVLGDTASCHRACQLTYPLHTYP 
KEEELYACQRGCRL FS I CQFVDDGI DLNRTKLECESACTEAYSQSDEQYACHLGCQNQLPFA 
ELRQEQLMSLMPKMHLLFPLTLVRSFWSDMMDSAQSFITSSWTFYLQADDGKIVIFQSKPEI 
QYAPHLEQEPTNLRESSLSKMSYLQMRNSQAHRNFLEDGESDGFLRCLSLNSGWILTTTLVL 
SVMVLLWI CCATVATAVEQYVPSEKLS I YGDLEFMNEQKLNRYPASSLVWRSKTEDHEEAG 
PLPTKVNLAHSEI 

Important features : 
Signal peptide: 
amino acids 1-31 

Transmeinbrane domain: 

amino acids 241-260 

N-glycosylation site. 

amino acids 90-93 
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FIGURE 133 

TTGGGTGATACGGCGTCTTGCCACCGGGCCTGTCAGTTGACCTACCCCTTGCACACCTACCC 
TAAGGAAGAGGAGTTGTACGCATGTCAGAGAGGTTGCAGGCTGTTTTCAATTTGTCAGTTTG 
TGGATGATGGAATTGACTTAAATCGAACTAAATTGGAATGTGAATCTGCATGTACAGAAGCA 
TATTCCCAATCTGATGAGCAATATGCTTGCCATCTTGGTTGCCAGAATCAGCTGCCATTCGC 
TGAACTGAGACAAGAACAACTTATGTCCCTGATGCCAAAAATGCACCTACTCTTTCCTCTAA 
CTCTGGTGAGGTCATTCTGGAGTGACATGATGGACTCCGC 
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CACACTGGCCGGATCTTTTAGAGTCCTTTGACCTTGACCAAGGGTCNGGAAAACAGCAACAA 
GCTGAGCTGCTGTGACAGAGGGAACAAGATGGCGGCGCCGAAGGGAGCCTTTGGGTGAGGAC 
CCAACTGGGGCTCCCGCCGCTGCTGCTGCTGACCATGGCCTTGGCCGGAGGTTCGGGGACCG 
CTTCGGCTGAAGCATTTGACTCGGTCTTGGGTGATACGGCGTCTTGCCACCGGGCCTGTCAG 
TTGACCTACCCCTTGCACACCTACCCTAAGGAAGAGGAGTTGTACGCATGTCAGAGAGGTTG 
CAGGCTGTTTTCAATTTGTCAGTTTGTGGATGATGGAATTGACTTAAATCGAACTAAATTGG 
AATGTGAATCTGCATGTACAGAAGCATATTCCCAATCTGATGAGCAATATGCTTGCCATCTT 
GGTTGCCAGAATCAGCTGCCATTCGCTGAACTGAGACAAGAACAACTTATGTCCCTGATGCC 
AAAAATGCACCTACTCTTTCCTCTAACTCTGGTGAGGTCATTCTGGAGTGACATGATGGACT 

CCGC 
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FIGURE 135 



GCGAGGTGGCGATCGCTGAGAGGCAGGAGGGCCGAGGCGGGCCTGGGAGGCGGCCCGGAGGT 

GGGGCGCCGCTGGGGCCGGCCCGCACGGGCTTCATCTGAGGGCGCACGGCCCGCGACCGAGC 

GTGCGGACTGGCCTCCCAAGCGTGGGGCGACAAGCTGCCGGAGCTGCAJ^GGCCGCGGCTG 

GGGATTCTTGTTTGGCCTCCTGGGCGCCGTGTGGCTGCTCAGCTCGGGCCACGGAGAGGAGC 

AGCCCCCGGAGACAGCGGCACAGAGGTGCTTCTGCCAGGTTAGTGGTTACTTGGATGATTGT 

ACCTGTGATGTTGAAACCATTGATAGATTTAATAACTACAGGCTTTTCCCAAGACTACAAAA 

ACTTCTTGAAAGTGACTACTTTAGGTATTACAAGGTAAACCTGAAGAGGCCGTGTCCTTTCT 

GGAATGACATCAGCCAGTGTGGAAGAAGGGACTGTGCTGTCAAACCATGTCAATCTGATGAA 

GTTCCTGATGGAATTAAATCTGCGAGCTACAAGTATTCTGAAGAAGCCAATAATCTCATTGA 

AGAATGTGAACAAGCTGAACGACTTGGAGCAGTGGATGAATCTCTGAGTGAGGAAACACAGA 

AGGCTGTTCTTCAGTGGACCAAGCATGATGATTCTTCAGATAACTTCTGTGAAGCTGATGAC 

ATTCAGTCCCCTGAAGCTGAATATGTAGATTTGCTTCTTAATCCTGAGCGCTACACTGGTTA 

CAAGGGACCAGATGCTTGGAAAATATGGAATGTCATCTACGAAGAAAACTGTTTTAAGCCAC 

AGACAATTAAAAGACCTTTAAATCCTTTGGCTTCTGGTCAAGGGACAAGTGAAGAGAACACT 

TTTTACAGTTGGCTAGAAGGTCTCTGTGTAGAAAAAAGAGCATTCTACAGACTTATATCTGG 

CCTACATGCAAGCATTAATGTGCATTTGAGTGCAAGATATCTTTTACAAGAGACCTGGTTAG 

AAAAGAAATGGGGACACAACATTACAGAATTTCAACAGCGATTTGATGGAATTTTGACTGAA 

GGAGAAGGTCCAAGAAGGCTTAAGAACTTGTATTTTCTCTACTTAATAGAACTAAGGGCTTT 

ATCCAAAGTGTTACCATTCTTCGAGCGCCCAGATTTTCAACTCTTTACTGGAAATAAAATTC 

AGGATGAGGAAAACAAAATGTTACTTCTGGAAATACTTCATGAAATCAAGTCATTTCCTTTG 

CATTTTGATGAGAATTCATTTTTTGCTGGGGATAAAAAAGAAGCACACAAACTAAAGGAGGA 

CTTTCGACTGCATTTTAGAAATATTTCAAGAATTATGGATTGTGTTGGTTGTTTTAAATGTC 

GTCTGTGGGGAAAGCTTCAGACTCAGGGTTTGGGCACTGCTCTGAAGATCTTATTTTCTGAG 

AAATTGATAGCAAATATGCCAGAAAGTGGACCTAGTTATGAATTCCATCTAACCAGACAAGA 

AATAGTATCATTATTCAACGCATTTGGAAGAATTTCTACAAGTGTGAAAGAATTAGAAAACT 

TCAGGAACTTGTTACAGAATATTCATiaAAGAAAACAAGCTGATATGTGCCTGTTTCTGGAC 

AATGGAGGCGAAAGAGTGGAATTTCATTCAAAGGCATAATAGCAATGACAGTCTTAAGCCAA 

ACATTTTATATAAAGTTGCTTTTGTAAAGGAGAATTATATTGTTTTAAGTAAACACATTTTT 

AAAAATTGTGTTAAGTCTATGTATAATACTACTGTGAGTAAAAGTAATACTTTAATAATGTG 

GTACAAATTTTAAAGTTTAATATTGAATAAAAGGAGGATTATCAAATTAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 136 

</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA53974 
<subunit 1 of 1, 468 aa, 1 stop 
<MW: 54393, pi: 5.63, NX(S/T): 2 

MGRGWGFLFGLLGAVWLLSSGHGEEQPPETAAQRCFCQVSGYLDDCTCDVETIDRFNNYRLF 
PRLQKLLESDYFRYYKVNLKRPCPFWNDISQCGRRDCAVKPCQSDEVPDGIKSASYKYSEEA 
NNLIEECEQAERLGAVDESLSEETQKAVLQWTKHDDSSDNFCEADDIQSPEAEYVDLLLNPE 
RYTGYKGPDAWKIWNVIYEENCFKPQTIKRPLNPLASGQGTSEENTFYSWLEGLCVEKRAFY 
RLISGLHASINVHLSARYLLOETWLEKKWGHNITEFQQRFDGILTEGEGPRRLKNLYFLYLI 
ELRALSKVLPFFERPDFQLFTGNKIQDEENKMLLLEILHEIKSFPLHFDENSFFAGDKKEM 
KLKEDFRLHFRNISRIMDCVGCFKCRLWGKLQTQGLGTALKILFSEKLIANMPESGPSYEFH 
LTRQEIVSLFNAFGRISTSVKELENFRNLLQNIH 

Important features: 
Signal peptide: 

amino acids 1-23 

N-glycosylation site. 

amino acids 280-283 and 384-387 

Amidation site. 

amino acids 94-97 

Glycosaminoglycan attachment site. 

amino acids 20-23 and 223-226 

Aminotransferases class-V pyridoxal -phosphate 
amino acids 216-222 

Interleukin-7 proteins 
amino acids 338-343 
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FIGURE 137 

GCTGGAAATATGGATGTCATCTACGAGAAACTGTTTTAAGCCACAGACAATTAAAAGACCTT 

TAAATCCTTTGGCTTCTGGTCAAGGGACAAGTGAAGAGNACACTTTTTACAGTTGGCTAGAA 

GGTCTCTGTGTAGAAAAAAGAGCATTCTACAGACTTATATCTGGCCTACATGCAAGCATTAA 

TGTGCATTTGAGTGCAAGATATCTTTTACAAGAGACCTGGTTAGAAAAGAAATGGGGACACA 

ACATTACAGAATTTNAACAGCGATTTGATGGAATTTTGACTGAAGGAGAAGGTCCAAGAAGG 

CTTAAGAACTTGTATTTTCTCTACTTAATAGAACTAAGGGCTTTATCCAAAGTGTTACCATT 

CTTNGAGCGCCCAGATTTTCAACTNTTTACTGGAAATAAAATTCAGGATGAGGNAAACAAAA 

TGTTACTTTTGGAAATACTTCATGAAATCAAGTCATTTCCTTTGCATTTTGATGAGAATTCA 
TTTTTTTGCTG 
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cggacgcgtgggcggacgcgtgggcggacgcgtgggttgggagggggcaggatgggagggaa 
agtgaagaaaacagaaaaggagagggacagaggccagaggacttctcatactggacagaaac 
cgatcaggcmfigaactccccttcgtcactcacctgttcttgcccctggtgttcctgacagg 
tctctgctccccctttaacctggatgaacatcacccacgcctattcccagggccaccagaag 
ctgaatttggatacagtgtcttacaacatgttgggggtggacagcgatggatgctggtgggc 
gccccctgggatgggccttcaggcgaccggaggggggacgtttatcgctgccctgtaggggg 
ggcccacaatgccccatgtgccaagggccacttaggtgactaccaactggga;\attcatctc 

ATCCTGCTGTGAATATGCACCTGGGGATGTCTCTGTTAGAGACAGATGGTGATGGGGGATTC 

atggtgagcimggagagggtggtggcagtgtctctgaaggtccataaaagaaaaaagagaa 

GTGTGGTAAGGGAAAATGGTCTGTGTGGAGGGGTCAAGGAGTTAAAAACCCTAGAAAGCAAA 

aggtaggtaatgtcagggagtagtcttcatgcctccttcaactgggagcatgttctgagggt 

GCCCTCCCAAGCCTGGGAGTAACTATTTCCCCCATCCCCAGGCCTGTGCCCCTCTCTGGTCT 

cgtgcttgtggcagctctgtcttcagttctgggatatgtgcccgtgtggatgcttcattcca 
gcctcagggaagcctggcacccactgcccaacgtgagccagaggaaggctgagtacttggtt 
cccagaaggagatactgggtgggaaaaagatggggcaaagcggtatgatgcctggcatuvggg 

CCTGCATGGCTATCCTCATTGCTACCTAATGTGCTTGCAAAAGCTCCATGTTTCCTAACAGA 

ttcagactcctggccaggtgtggtggcccacacctgtaattctagcactttgggaggccaag 
gtgggcagatcacttgaggtcaggagttcaagaccagcctggccaacatggtgaaactccat 
ctctactaaaaaaaaaaaaatacaaaaattagctgggtgcgctagtgcatgcctgtaatctc 

ATCTACTCGGGAGGCTAAGACAGGAGACTCTCACTTCAACCCAGGAGGTGGAGGTTGCGGTG 

agccaagattgtgcctctgcactctagcgtgggtgacagagtaagcgagactccatctcaaa 

AATAATAATAATAATAATTCAGACTCCTTATCAGGAGTCCATGATCTGGCCTGGCACAGTAA 

ctcatgcctgtaatcccaacattttgggaggccaacgcaggaggattgcttgaggtctggag 

GTTTGAGACCAGCCTGGGCAACATAGAAAGACCCCATCTCTAAATAAATGTTTTAAAAAT 
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FIGURE 139 



></usr/seqcib2/sst/DNA/Dnaseqs.min/ss.DNA57039 
xsubunit 1 of 1, 124 aa, 1 stop 
XMW: 13352; pi: 5.99, NX(S/T): 1 

MELPFVTHLFLPLVFLTGLCSPFNLDEHHPRLFPGPPEAEFGYSVLQHVGGGQRWMLVGAPW 
DGPSGDRRGDVYRCPVGGAHNAFCAKGHLGDYQLGNSSHPAVNMHLGMSLLETDGDGGFMVS 

Important features: 
Signal peptide: 

amino acids 1-22 

Cell attachment secfuence. 

amino acids 70-73 

N-glycosylation site. 

amino acids 98-101 

Integrins alpha chain proteins 
amino acids 67-81 




8NS0OCID- <W0 9946281 A2.L> 



wo 99/46281 




PCT/tS99/050:8 



FIGURE 140 

CACAGTTCCCCACCATCACTCNTCCCATTCCTTCCAACTTTATTTTTAGCTTGCCATTGGGA 
GGGGGCAGGATGGGAGGGAAAGTGAAGAAAACAGAAAAGGAGAGGGACAGAGGCCAGAGGAC 
TTCTCATACTGGACAGAAACCGATCAGGCATGGAACTCCCCTTCGTCACTCACCTGTTCTTG 
CCCCTGGTGTTCCTGACAGGTCTCTGCTCCCCCTTTAACCTGGATGAACATCACCCACGCCT 
ATTCCCAGGGCCACCAGAAGCTGAATTTGGATACAGTGTCTTACAACATGTTGGGGGTGGAC 
AGCGATGGATGCTGGTGGGCGCCCCCTGGGATGGGCCTTCAGGCGACCGGAGGGGGGACGTT 
TATCGCTGCCCTGTAGGGGGGGCCCACAATGCCCCATGTGCCAAGGGCCACTTAGGTGACTA 
CCAACTGGGAAATTCATCTCATCCTGCTGTGAATATGCACCTGGGGATGTCTCTGTTAGAGA 
CAGATGGTGATGG 
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FIGURE 141 



AAAGTTACATTTTCTCTGGAACTCTCCTAGGCCACTCCCTGCTGATGCAACATCTGGGTTTG 

GGCAGAAAGGAGGGTGCTTCGGAGCCCGCCCTTTCTGAGCTTCCTGGGCCGGCTCTAGAACA 

ATTCAGGCTTCGCTGCGACTCAGACCTCAGCTCCAACATATGCATTCTGAAGAAAGATGGCT 

GAGATGGACAGAATGCTTTATTTTGGAAAGAAACAATGTTCTAGGTCAAACTGAGTCTACCA 

AMSCAGACTTTCACAATGGTTCTAGAAGAAATCTGGACAAGTCTTTTCATGTGGTTTTTCT 

ACGCATTGATTCCATGTTTGCTCACAGATGAAGTGGCCATTCTGCCTGCCCCTCAGAACCTC 

TCTGTACTCTCAACCAACATGAAGCATCTCTTGATGTGGAGCCCAGTGATCGCGCCTGGAGA 

AACAGTGTACTATTCTGTCGAATACCAGGGGGAGTACGAGAGCCTGTACACGAGCCACATCT 

GGATCCCCAGCAGCTGGTGCTCACTCACTGAAGGTCCTGAGTGTGATGTCACTGATGACATC 

ACGGCCACTGTGCCATACAACCTTCGTGTCAGGGCCACATTGGGCTCACAGACCTCAGCCTG 

GAGCATCCTGAAGCATCCCTTTAATAGAAACTCAACCATCCTTACCCGACCTGGGATGGAGA 

TCACCAAAGATGGCTTCCACCTGGTTATTGAGCTGGAGGACCTGGGGCCCCAGTTTGAGTTC 

CTTGTGGCCTACTGGAGGAGGGAGCCTGGTGCCGAGGAACATGTCAAAATGGTGAGGAGTGG 

GGGTATTCCAGTGCACCTAGAAACCATGGAGCCAGGGGCTGCATACTGTGTGAAGGCCCAGA 

CATTCGTGAAGGCCATTGGGAGGTACAGCGCCTTCAGCCAGACAGAATGTGTGGAGGTGCTIA 

GGAGAGGCCATTCCCCTGGTACTGGCCCTGTTTGCCTTTGTTGGCTTCATGCTGATCCTTGT 

GGTCGTGCCACTGTTCGTCTGGAAAATGGGCCGGCTGCTCCAGTACTCCTGTTGCCCCGTGG 

TGGTCCTCCCAGACACCTTGAAAATAACCAATTCACCCCAGAAGTTAATCAGCTGCAGAAGG 

GAGGAGGTGGATGCCTGTGCCACGGCTGTGATGTCTCCTGAGGAACTCCTCAGGGCCTGGAT 

CTCAX^GTTTGCGGAAGGGCCCAGGTGAAGCCGAGAACCTGGTCTGCATGACATGGAAACC 

ATGAGGGGACAAGTTGTGTTTCTGTTTTCCGCCACGGACAAGGGATGAGAGAAGTAGGAAGA 

GCCTGTTGTCTACAAGTCTAGAAGCAACCATCAGAGGCAGGGTGGTTTGTCTAACAGAACAC 

TGACTGAGGCTTAGGGGATGTGACCTCTAGACTGGGGGCTGCCACTTGCTGGCTGAGCAACC 

CTGGGAAAAGTGACTTCATCCCTTCGGTCCTAAGTTTTCTCATCTGTAATGGGGGAATTACC 

TACACACCTGCTAAACACACACACACAGAGTCTCTCTCTATATATACACACGTACACATAAA 

TACACCCAGCACTTGCAAGGCTAGAGGGAAACTGGTGACACTCTACAGTCTGACTGATTCAG 

TGTTTCTGGAGAGCAGGAGATAAATGTATGATGAGAATGATCAAGGACTCTACACACTGGGT 

GGCTTGGAGAGCCCACTTTCCCAGAATAATCCTTGAGAGAAAAGGAATCATGGGAGCAATGG 

TGTTGAGTTCACTTCAAGCCCAATGCCGGTGCAGAGGGGAATGGCTTAGCGAGCTCTACAGT 

AGGTGACCTGGAGGAAGGTCACAGCCACACTGAAAATGGGATGTGCATGAACACGGAGGATC 

CATGAACTACTGTAAAGTGTTGACAGTGTGTGCACACTGCAGACAGCAGGTGAAATGTATGT 

GTGCAATGCGACGAGAATGCAGAAGTCAGTAACATGTGCATGTTTGTTGTGCTCCTTTTTTC 

TGTTGGTAAAGTACAGAATTCAGCAAATAAAAAGGGCCACCCTGGCCAAAAGCGGTAAAAAA 
AAAAAAAAAA 




wo 99/46281 




PCT/US99. 05028 



FIQURE 142 



</usr/seqcib2/sst/DNA/Dnaseqs.min/ss.DNA57033 
<subunit 1 of 1, 311 aa, 1 stop 
<MW: 35076, pi: 5,04, NX(S/T): 2 

MQTFT^rVLEEIWTSLFMWFFYALIPCLLTDEVAILPAPQNLSVLSTNMKHLmwSPVIAPGE 
TVYYSVEYQGEYESLYTSHIWIPSSWCSLTEGPECDVTDDITATVPYNLRVRATLGSQTSAW 
SILKHPFNRNSTILTRPGMEITKDGFHLVIELEDLGPQFEFLVAYWRREP6AEEHVKMVRSG 
GIPVHLETMEPGAAYCVKAQTFVKAIGRYSAFSQTECVEVQGEAIPLVLALFAFVGET^LILV 
WPLFVWKMGRLLQYSCCPWVLPDTLKITNSPQKLISCRREEVDACATAVMSPEELLRAWIS 

Important features: 
Signal peptide: 
amino acids 1-29 

Transmembrane domain: 
amino acids 230-255 

N-glycosylation site. 

amino acids 40-43 and 134-137 

Tissue factor proteins . 

amino acids 92-119 

Integrins alpha chain proteins 
amino acids 232-262 
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FIGURE 143 

TCCTGCTGATGCACATCTGGGTTTGGCAAAAGGAGGTTGCTTCGAGCCGCCCTTTCTAGCTT 
CCTGGCCGGCTCTAGAACAATTCAGGCTTCGCTGCGACTAGACCTCAGCTCCAACATATGCA 
TTCTGAAGAAAGATGGCTGAGATGACAGAATGCTTTATTTTGGAAAGAAACAATGTTCTAGG 
TCAAACTGAGTCTACCAAATGCAGACTTTCACAATGGTTCTAGAAGAAATCTGGACAAGTCT 
TTTCATGTGGTTTTTCTACGCATTGATTCCATGTTTGCTCACAGATGAAGTGGCCATTCTGC 
CTGCCCCTCAGAACCTCTCTGTACTCTCAACCAACATGAAGCATCTCTTGATGTGGAGCCCA 
GTGATCGCGCCTGGAGAAACAGTGTACTATTCTGTCGAATACCAGGGGGAGTACGAGAGCCT 
GTACACGAGCCACATCTGGATCCCCAGCAGCTGGTGCTCACTCACTGAAGGTCCTGAGTGTG 
ATGTCACTGATGACATCACGGCCACTGTGCCATACAACCTTTGTGTCAGGGCCACATTGGGC 
TCACAGACCTCAGCCTGGAGCATCCTGAAGCATCCCTTTAATAGAAACTCAACCATCCTTAC 
CCGACCTGGGATGGAGATCACCAAAGATGGCTTNCACCTGGTTATTGAGCTGGAGGACCTGG 
GGCCCCAGTTTGAGTTCCTTGTGGCCTANTGGAGGAGGGGCGAACCCCTTGCGGCGCAAGGG 
GTTNGCGAACCCCTTGCGGCCGCTGGGGTATCTCTCGAGAAAAGAGAGGCCCAATATGACCC 
ACATACTCAATATGGACGAANTGCTATTGTCCACCTGTTTGAGTGGCGCTGGGTTGAT 
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FIGURE 144 

CCCACGCGTCCGCCCACGCGTCCGAGGGACAAGAGAGAAGAGAGACTGAAACAGGGAGAAGA 

GGCAGGAGAGGAGGAGGTGGGGAGAGCACGAAGCTGGAGGCCGACACTGAGGGAGGGCGGGA 

GGAGGTGAAGAAGGAGAGAGGGGAGAAGAGGCAGGAGCTGGAAAGGAGAGAGGGAGGAGGAG 

GAGGAGATGCGGGATGGAGACCTGGAGTTAGGTGGCTTGGGAGAGCTTAATGAAAAGAGAAC 

GGAGAGGAGGTGTGGGTTAG6AACCAAGAGGTAGCCCTGTGGGCAGCAGAAGGCTGAGAGGA 

GTAGGAAGATCAGGAGCTAGAGGGAGACTGGAGGGTTCCGGGAAAAGAGCAGAGGAAAGAGG 

AAAGACACAGAGAGACGGGAGAGAGAAGAAGAGTGGGTTTGAAGGGCGGATCTCAGTCCCTG 

GCTGCTTTGGCATTTGGGGAACTGGGACTCCCTGTGGGGAGGAGAGGAAAGCTGGAAGTCCT 

GGAGGGACAGGGTCCCAGAAGGAGGGGACAGAGGAGCTGAGAGAGGGGGGCAGGGCGTTGGG 

CAGGGGTCCCTCGGAGGCCTCCTGGGGAISGGGGCTGCAGCTCGTCTGAGCGCCCCTCGAGC 

GCTGGTACTCTGGGCTGCACTGGGGGCAGCAGCTCACATCGGACCAGCACCTGACCCCGAGG 

ACTGGTGGAGCTACAAGGATAATCTCCAGGGAAACTTCGTGCCAGGGCCTCCTTTCTGGGGC 

CTGGTGAATGCAGCGTGGAGTCTGTGTGCTGTGGGGAAGCGGCAGAGCCCCGTGGATGTGGA 

GCTGAAGAGGGTTCTTTATGACCCCTTTCTGCCCCCATTAAGGCTCAGCACTGGAGGAGAGA 

AGCTCCGGGGAACCTTGTACAACACCGGCCGACATGTCTCCTTCCTGCCTGCACCCCGACCT 

GTGGTCAATGTGTCTGGAGGTCCCCTCCTTTACAGCCACCGACTCAGTGAACTGCGGCTGCT 

GTTTGGAGCTCGCGACGGAGCCGGCTCGGAACATCAGATCAACCACCAGGGCTTCTCTGCTG 

AGGTGCAGCTCATTCACTTCAACCAGGAACTCTACGGGAATTTCAGCGCTGCCTCCCGCGGC 

CCCAATGGCCTGGCCATTCTCAGCCTCTTTGTCAACGTTGCCAGTACCTCTAACCCATTCCT 

CAGTCGCCTCCTTAACCGCGACACCATCACTCGCATCTCCTACAAGAATGATGCCTACTTTC 

TTCAAGACCTGAGCCrGGAGCTCCTGTTCCCTGAATCCTTCGGCTTCATCACCTATCAGGGC 

TCTCTCAGCACCCCGCCCTGCTCCGAGACTGTCACCTGGATCCTCATTGACCGGGCCCTCAA 

TATCACCTCCCTTCAGATGCACTCCCTGAGACTCCTGAGCCAGAATCCTCCATCTCAGATCT 

TCCAGAGCCTCAGCGGTAACAGCCGGCCCCTGCAGCCCTTGGCCCACAGGGCACTGAGGGGC 

AACAGGGACCCCCGGCACCCCGAGAGGCGCTGCCGAGGCCCCAACTACCGCCTGCATGTGGA 

TGGTGTCCCCCATGGTCGClfiaGACTCCCCTTCGAGGATTGCACCCGCCCGTCCTAAGCCTC 

CCCACAAGGCGAGGGGAGTTACCCCTAAAACAAAGCTATTAAAGGGACAGAATACTTA 
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FIGURE 14S 



</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA34353 
<subunit 1 of 1, 328 aa, 1 stop 
<MW: 36238, pi: 9.90, NX{S/T): 3 

MGAAARLSAPRALVLWAALGAAAHIGPAPDPEDWWSYKDNLQGNFVPGPPFWGLVNAAWSLC 

AVGKRQSPVDVELKRVLYDPFLPPLRLSTGGEKLRGTLYNTGRHVSFLPAPRPWNVSGGPL 

LYSHRLSELRLLFGARDGAGSEHQINHQGFSAEVQLIHFNQELYGNFSAASRGPNGLAILSL 

FVNVASTSNPFLSRLLNRDTITRISYKNDAYFLQDLSLELLFPESFGFITYQGSLSTPPCSE 

TVTWILIDRALNITSLQMKSLRLLSQNPPSQIFQSLSGNSRPLQPLAHRALRGNRDPRHPER 
RCRGPNYRLHVDGVPHGR 

In^ortant features: 
Signal peptide: 

amino acids 1-23 

Transmembrane domain: 
amino acids 177-199 

N-glycosylaticn site. 

amino acids 118-121, 170-173 and 260-263 

Eukaryotic-type carbonic anhydrases proteins 
amino acids 222-270, 128-164 and 45-92 
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FIGURE 146 

GGCGCCTGGTTCTGCGCGTACTGGCTGTACGGAGCAGGAGCAAGAGGTCGCCGCCAGCCTCC 

GCCGCCGAGCCTCGTTCGTGTCCCCGCCCCTCGCTCCTGCAGCTACTGCTCAGAAACGCTGG 

GGCGCCCACCCTGGCAGACTAACGAAGCAGCTCCCTTCCCACCCCAACTGCAGGTCTAATTT 

TGGACGCTTTGCCTGCCATTTCTTCCAGGTTGAGGGAGCCGCAGAGGCGGAGGCTCGCGTAT 

TCCTGCAGTCAGCACCCACGTCGCCCCCGGACGCTCGGTGCTCAGGCCCTTCGCGAGCGGGG 

CTCTCCGTCTGCGGTCCCTTGTGAAGGCTCTGGGCGGCTGCAGAGGCCGGCCGTCCGGTTTG 

GCTCACCTCTCCCAGGAAACTTCACACTGGAGAGCCAAAAGGAGTGGAAGAGCCTGTCTTGG 

AGATTTTCCTGGGGAAATCCTGAGGTCATTCATTMS3AAGTGTACCGCGCGGGAGTGGCTCA 

GAGTAACCACAGTGCTGTTCATGGCTAGAGCAATTCCAGCCATGGTGGTTCCCAATGCCACT 

TTATTGGAGAAACTTTTGGAAAAATACATGGATGAGGATGGTGAGTGGTGGATAGCCAAACA 

ACGAGGGAAAAGGGCCATCACAGACAATGACATGCAGAGTATTTTGGACCTTCATAATAAAT 

TACGAAGTCAGGTGTATCCAACAGCCTCTAATATGGAGTATATGACATGGGATGTAGAGCTG 

GAAAGATCTGCAGAATCCTGGGCTGAAAGTTGCTTGTGGGAACATGGACCTGCAAGCTTGCT 

TCCATCAATTGGACAGAATTTGGGAGCACACTGGGGAAGATATAGGCCCCCGACGTTTCATG 

TACAATCGTGGTATGATGAAGTGAAAGACTTTAGCTACCCATATGAACATGAATGCAACCCA 

TATTGTCCATTCAGGTGTTCTGGCCCTGTATGTACACATTATACACAGGTCGTGTGGGCAAC 

TAGTAACAGAATCGGTTGTGCCATTAATTTGTGTCATAACATGAACATCTGGGGGCAGATAT 

GGCCCAAAGCTGTCTACCTGGTGTGCAATTACTCCCCAAAGGGAAACTGGTGGGGCCATGCC 

CCTTACAAACATGGGCGGCCCTGTTCTGCTTGCCCACCTAGTTTTGGAGGGGGCTGTAGAGA 

AAATCTGTGCTACAAAGAAGGGTCAGACAGGTATTATCCCCCTCGAGAAGAGGAAACAAATG 

AAATAGAACGACAGCAGTCACAAGTCCATGACACCCATGTCCGGACAAGATCAGATGATAGT 

AGCAGAAATGAAGTCATAAGCGCACAGCAAATGTCCCAAATTGTTTCTTGTGAAGTAAGATT 

AAGAGATCAGTGCAAAGGAACAACCTGCAATAGGTACGAATGTCCTGCTGGCTGTTTGGATA 

GTAAAGCTAAAGTTATTGGCAGTGTACATTATGAAATGCAATCCAGCATCTGTAGAGCTGCA 

ATTCATTATGGTATAATAGACAATGATGGTGGCTGGGTAGATATCACTAGAC7AGGAAGAAA 

GCATTATTTCATCAAGTCCAATAGAAATGGTATTCAAACAATTGGCAAATATCAGTCTGCTA 

ATTCCTTCACAGTCTCTAAAGTAACAGTTCAGGCTGTGACTTGTGAAACAACTGTGGAACAG 

CTCTGTCCATTTCATAAGCCTGCTTCACATTGCCCAAGAGTATACTGTCCTCGTAACTGTAT 

GCAAGCAAATCCACATTATGCTCGTGTAATTGGAACTCGAGTTTATTCTGATCTGTCCAGTA 

TCTGCAGAGCAGCAGTACATGCTGGAGTGGTTCGAAATCACGGTGGTTATGTTGATGTAATG 

CCTGTGGACAAAAGAAAGACCTACATTGCTTCTTTTCAGAATGGAATCTTCTCAGAAAGTTT 

ACAGAATCCTCCAGGAGGAAAGGCATTCAGAGTGTTTGCTGTTGTG3S&AACTGAATACTTG 

GAAGAGGACCATAAAGACTATTCCAAATGCAATATTTCTGAATTTTGTATAAAACTGTAACA 

TTACTGTACAGAGTACATCAACTATTTTCAGCCCAAAAAGGTGCCAAATGCATATAAATCTT 

GATAAACAAAGTCTATAAAATAAAACATGGGACATTAGCTTTGGGAAAAGTAATGAAAATAT 

AATGGTTTTAGAAATCCTGTGTTAAATATTGCTATATTTTCTTAGCAGTTATTTCTACAGTT 

AATTACATAGTCATGATTGTTCTACGTTTCATATATTATATGGTGCTTTGTATATGCCACTA 

ATAAAATGAATCTAAACATTGAATGTGAATGGCCCTCAGAAAATCATCTAGTGCATTTAAAA 

ATAATCGACTCTAAAACTGAAAGAAACCTTATCACATTTTCCCCAGTTCAATGCTATGCCAT 

TACCAACTCCAAATAATCTCAAATAATTTTCCACTTAATAACTGTAAAGTTTTTTTCTGTTA 

ATTTAGGCATATAGAATATTAAATTCTGATATTGCACTTCTTATTTTATATAAAATAATCCT 

TTAATATCCAAATGAATCTGTTAAAATGTTTGATTCCTTGGGAATGGCCTTAAAAATAAATG 

TAATAAAGTCAGAGTGGTGGTATGAAAACATTCCTAGTGATCATGTAGTAAATGTAGGGTTA 

AGCATGGACAGCCAGAGCTTTCTATGTACTGTTAAAATTGAGGTCACATATTTTCTTTTGTA 

TCCTGGCAAATACTCCTGCAGGCCAGGAAGTATAATAGCAAAAAGTTGAACAAAGATGAACT 

AATGTATTACATTACCATTGCCACTGATTTTTTTTAAATGGTAAATGACCTTGTATATAAAT 

ATTGCCATATCATGGTACCTATAATGGTGATATATTTGTTTCTATGAAAAATGTATTGTGCT 

TTGATACTAAAAATCTGTAAAATGTTAGTTTTGGTAATTTTTTTTCTGCTGGTGGATTTACA 

TATTAAATTTTTTCTGCTGGTGGATAAACATTAAAATTAATCATGTTTCAAAAAAAAAAAAA 
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FIGURE 147 

</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA45417 
<subunit 1 of 1, 500 aa, 1 stop 
<MW: 56888, pi: 8.53, NX(S/T): 2 

MKCTAREWLRVTTVLFMARA.IPAMWPNATLLEKLLEKYMDEDGEWWIAKQRGKRAITDNDM 

QSILDLHNKLRSQVYPTASNMEYMTWDVELERSAESWAESCLWEHGPASLLPSIGQNLGAHW 

GRYRPPTFHVQSWYDEVKDFSYPYEHECNPYCPFRCSGPVCTHYTQWWATSNRIGCAINLC 

HNMNIWGQIWPKAVYLVCNYSPKGNWWGHAPYKHGRPCSACPPSFGGGCRENLCYKEGSDRY 

YPPREEETNEIERQQSQVHDTHVRTRSDDSSRNEVISAQQMSQIVSCEVRLRDQCKGTTCNR 

YECPAGCLDSKAKVIGSVHYEMQSSICRAAIHYGIIDNDGGWVDITRQGRKHYFIKSNRNGI 

QTIGKYQSANSFTVSKVTVQAVTCETTVEQLCPFHKPASHCPRVYCPRNCMQANPHYARVIG 

TRVYSDLSSICRAAVHAGWRNHGGYVDVMPVDKRKTYIASFQNGIFSESLQNPPGGKAFRV 
FAW 

In^rtant features: 
Signal peptide: 
amino acids 1-20 

Extracellular proteins SCP/Tpx-l/AgS/PR-l/Sc7 protein 
amino acids 165-186, 196-218, 134-146, 96-108 and 58-77 

N-glycosylation site 
amino acids 28-31 



'WO 9»465fl1A2_L> 



wo 99/46281 




PCT/US99/05028 



FIGURE 148 

GCGGAGACAAGCGCAGAGCGCAGCGCACGGCCACAGACAGCCCTGGGCATCCACCGACGGCG 

CAGCCGGAGCCAGCAGAGCCGGAAGGCGCGCCCCGGGCAGAGAAAGCCGAGCAGAGCTGGGT 

GGCGTCTCCGGGCCGCCGCTCCGACGGGCCAGCGCCCTCCCC&ISTCCCTGCTCCCACGCCG 

CGCCCCTCCGGTCAGCATGAGGCTCCTGGCGGCCGCGCTGCTCCTGCTGCTGCTGGCGCTGT 

ACACCGCGCGTGTGGACGGGTCCAAATGCAAGTGCTCCCGGAAGGGACCCAAGATCCGCTAC 

AGCGACGTGAAGAAGCTGGAAATGAAGCCAAAGTACCCGCACTGCGAGGAGAAGATGGTTAT 

CATCACCACCAAGAGCGTGTCCAGGTACCGAGGTCAGGAGCACTGCCTGCACCCCAAGCTGC 

AGAGCACCAAGC GC T TCATCAAGTGGTACAACGCCT GGAACGAGAAGCGCAGGGTCTACGAA 

GAAIASGGTGAAAAACCTCAGAAGGGAAAACTCCAAACCAGTTGGGAGACTTGTGCAAAGGA 

CTTTGCAGATTAAAAAAAAAAAAAAAAAAAAAAAAAT^AAAAAAAAAAAAAAAAAAGCCTTTC 

TTTCTCACAGGCATAAGACACAAATTATATATTGTTATGAAGCACTTTTTACCAACGGTCAG 

TTTTTACATTTTATAGCTGCGTGCGAAAGGCTTCCAGATGGGAGACCCATCTCTCTTGTGCT 

CCAGACTTCATCACAGGCTGCTTTTTATCAAAAAGGGGAAAACTCATGCCTTTCCTTTTTAA 

AAAATGCTTTTTTGTATTTGTCCATACGTCACTATACATCTGAGCTTTATAAGCGCCCGGGA 

GGAACAATGAGCTTGGTGGACACATTTCATTGCAGTGTTGCTCCATTCCTAGCTTGGGAAGC 

TTCCGCTTAGAGGTCCTGGCGCCTCGGCACAGCTGCCACGGGCTCTCCTGGGCTTATGGCCG 

GTCACAGCCTCAGTGTGACTCCACAGTGGCCCCTGTAGCCGGGCAAGCAGGAGCAGGTCTCT 

CTGCATCTGTTCTCTGAGGAACTCAAGTTTGGTTGCCAGAAAAATGTGCTTCATTCCCCCCT 

GGTTAATTTTTACACACCCTAGGAAACATTTCCAAGATCCTGTGATGGCGAGACAAATGATC 

CTTAAAGAAGGTGTGGGGTCTTTCCCAACCTGAGGATTTCTGAAAGGTTCACAGGTTCAATA 

TTTAATGCTTCAGAAGCATGTGAGGTTCCCAACACTGTCAGCAAAAACCTTAGGAGAAAACT 

TAAAAATATATGAATACATGCGCAATACACAGCTACAGACACACATTCTGTTGACAAGGGAA 

AACCTTCAAAGCATGTTTCTTTCCCTCACCACAACAGAACATGCAGTACTAAAGCAATATAT 

TTGTGATTCCCCATGTAATTCTTCAATGTTAAACAGTGCAGTCCTCTTTCGAAAGCTAAGAT 

GACCATGCGCCCTTTCCTCTGTACATATACCCTTAAGAACGCCCCCTCCACACACTGCCCCC 

CAGTATATGCCGCATTGTACTGCTGTGTTATATGCTATGTACATGTCAGAAACCATTAGCAT 

TGCATGCAGGTTTCATATTCTTTCTAAGATGGAAAGTAATAAAATATATTTGAAATGTAAAA 

AAAAAAAAAAA 
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FIGURE 149 

MSLLPRRAPPVSMRLLAAALLLLLLALYTARVDGSKCKCSRKGPKIRYSDVKKLEMKPKYPH 
CEEKMVIITTKSVSRYRGQEHCLHPKLQSTKRFIKWYNAWNEKRRVYEE 
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FIGURE 150 

GCCCCAGGGACTGCTATGGCTTCCTTTGTTGTTCACCCCGGTCTGCGTC&ISTTAAACTCCA 

ATGTCCTCCTGTGGTTAACTGCTCTTGCCATCAAGTTCACCCTCATTGACAGCCAAGCACAG 

TATCCAGTTGTCAACACAAATTATGGCAAAATCCGGGGCCTAAGAACACCGTTACCCAATGA 

GATCTTGGGTCCAGTGGAGCAGTACTTAGGGGTCCCCTATGCCTCACCCCCCACTGGAGAGA 

GGCGGTTTCAGCCCCCAGAACCCCCGTCCTCCTGGACTGGCATCCGAAATACTACTCAGTTT 

GCTGCTGTGTGCCCCCAGCACCTGGATGAGAGATCCTTACTGCATGACATGCTGCCCATCTG 

GTTTACCGCCAATTTGGATACTTTGATGACCTATGTTCAAGATCAAAATGAAGACTGCCTTT 

ACTTAAACATCTACGTGCCCACGGAAGATGGAGCCAACACT^GATU^CGCAGATGATATA 

ACGAGTAATGACCGTGGTGAAGACGAAGATATTCATGATCAGAACAGTAAGAAGCCCGTCAT 

GGTCTATATCCATGGGGGATCTTACATGGAGGGCACCGGCAACATGATTGACGGCAGCATTT 

TGGCAAGCTACGGAAACGTCATCGTGATCACCATTAACTACCGTCTGGGAATACTAGGGTTT 

TTAAGTACCGGTGACCAGGCAGCAAAAGGCAACTATGGGCTCCTGGATCAGATTCAAGCACT 

GCGGTGGATTGAGGAGAATGTGGGAGCCTTTGGCGGGGACCCCAAGAGAGTGACCATCTTTG 

GCTCGGGGGCTGGGGCCTCCTGTGTCAGCCTGTTGACCCTGTCCCACTACTCAGAAGGTCTC 

TTCCAGAAGGCCATCATTCAGAGCGGCACCGCCCTGTCCAGCTGGGCAGTGAACTACCAGCC 

GGCCAAGTACACTCGGATATTGGCAGACAAGGTCGGCTGCAACATGCTGGACACCACGGACA 

TGGTAGAATGCCTGCGGAACAAGAACTACAAGGAGCTCATCCAGCAGACCATCACCCCGGCC 

ACCTACCACATAGCCTTCGGGCCGGTGATCGACGGCGACGTCATCCCAGACGACCCCCAGAT 

CCTGATGGAGCAAGGCGAGTTCCTCAACTACGACATCATGCTGGGCGTCAACCAAGGGGAAG 

GCCTGAAGTTCGTGGACGGCATCGTGGATAACGAGGACGGTGTGACGCCCAACGACTTTGAC 

TTCTCCGTGTCCAACTTCGTGGACAACCTTTACGGCTACCCTGAAGGGAAAGACACTTTGCG 

GGAGACTATCAAGTTCATGTACACAGACTGGGCCGATAAGGAAAACCCGGAGACGCGGCGGA 

AAACCCTGGTGGCTCTCTTTACTGACCACCAGTGGGTGGCCCCCGCCGTGGCCGCCGACCTG 

CACGCGCAGTACGGCTCCCCCACCTACTTCTATGCCTTCTATCATCACTGCCAAAGCGAAAT 

GAAGCCCAGCTGGGCAGATTCGGCCCATGGTGATGAGGTCCCCTATGTCTTCGGCATCCCCA 

TGATCGGTCCCACCGAGCTCTTCAGTTGTAACTTTTCCAAGAACGACGTCATGCTCAGCGCC 

GTGGTCATGACCTACTGGACGAACTTCGCCAAAACTGGTGATCCAAATCAACCAGTTCCTCA 

GGATACCAAGTTCATTCACACAAAACCCAACCGCTTTGAAGAAGTGGCCTGGTCCAAGTATA 

ATCCCAAAGACCAGCTCTATCTGCATATTGGCTTGAAACCCAGAGTGAGAGATCACTACCGG 

GCAACGAAAGTGGCTTTCTGGTTGGAACTCGTTCCTCATTTGCACAACTTGAACGAGATATT 

CCAGTATGTTTCAACAACCACAAAGGTTCCTCCACCAGACATGACATCATTTCCCTATGGCA 

CCCGGCGATCTCCCGCCAAGATATGGCCAACCACCAAACGCCCAGCAATCACTCCTGCCAAC 

AATCCCAAACACTCTAAGGACCCTCACAAAACAGGGCCTGAGGACACAACTGTCCTCATTGA 

AACCAAACGAGATTATTCCACCGAATTAAGTGTCACCATTGCCGTCGGGGCGTCGCTCCTCT 

TCCTCAACATCTTAGCTTTTGCGGCGCTGTACTACAAAAAGGACAAGAGGCGCCATGAGACT 

CACAGGCGCCCCAGTCCCCAGAGAAACACCACAAATGATATCGCTCACATCCAGAACGAAGA 

GATCATGTCTCTGCAGATGAAGCAGCTGGAACACGATCACGAGTGTGAGTCGCTGCAGGCAC 

ACGACACACTGAGGCTCACCTGCCCGCCAGACTACACCCTCACGCTGCGCCGGTCGCCAGAT 

GACATCCCACTTATGACGCCAAACACCATCACCATGATTCCAAACACACTGACGGGGATGCA 

GCCTTTGCACACTTTTAACACCTTCAGTGGAGGACAAAACAGTACAAATTTACCCCACGGAC 

ATTCCACCACTAGAGTAT^CTTTGCCCTATTTCCCTTCCTATCCCTCTGCCCTACCCGCTC 

AGCAACATAGAAGAGGGAAGGAAAGAGAGAAGGAAAGAGAGAGAGAAAGAAAGTCTCCAGAC 

CAGGAATGTTTTTGTCCCACTGACTTAAGACTUU^TGCAAAAAGGCAGTCATCCCATCCCG 

GCAGACCCTTATCGTTGGTGTTTTCCAGTATTACAAGATCAACTTCTGACCCTGTGAAATGT 

GAGAAGTACACATTTCTGTTAAAATAACTGCTTTAAGATCTCTACCACTCCAATCAATGTTT 

AGTGTGATAGGACATCACCATTTCAAGGCCCCGGGTGTTTCCAACGTCATGGAAGCAGCTGA 

CACTTCTGAAACTCAGCCAAGGACACTTGATATTTTTTAATTACAATGGAAGTTTAAACATT 

TCTTTCTGTGCCACACAATGGATGGCTCTCCTTAAGTGAAGAAAGAGTCAATGAGATTTTGC 

CCAGCACATGGAGCTGTAATCCAGAGAGAAGGAAACGTAGAAATTTATTATTAAAAGAATGG 

ACTGTGCAGCGAAATCTGTACGGTTCTGTGCAAAGAGGTGTTTTGCCAGCCTGAACTATATT 

TAAGAGACTTTGT 



BNSOOaO: <WO__»462eiA2_L> 



wo 99/46281 




PCT/US99/05028 



FIGURE 151 

MLNSNVLLWLTALAIKFTLIDSQAQYPWNTNYGKIRGLRTPLPNEILGPVEQYLGVPYASP 
PTGERRFQPPEPPSSWTGIRNTTQFAAVCPQHLDERSLLHDMLPIWFTANLDTLMTYVQDQN 
EDCLYLNIYVPTEDGANTKKNADDITSNDRGEDEDIHDQNSKKPVMVYIHGGSYMEGTGNMI 
DGS I LAS YGNVI VI T I NYRLGI LGFLS TGDQAAKGNYGLLDQ I QALRW I EENVGAFGGDPKR 

VTI FGSGAGASCVSLLTLSHYSEGLFQKAI IQSGTALSSWAVNYQPAKYTRILADKVGCNML 

DTTDMVECLRNKNYKELIQQTITPATYHIAFGPVIDGDVIPDDPQILMEQGEFLNYDIMLGV 

NQGEGLKFVDGIVDNEDGVTPNDFDFSVSNFVDNLYGYPEGKDTLRETIKFMYTDWADKENP 

ETRRKTLVALFTDHQWVAPAVAADLHAQYGSPTYFYAFYHHCQSEMKPSWADSAHGDEVPYV 

FGIPMIGPTELFSCNFSKNDVMLSAWMTYWTNFAKTGDPNQPVPQDTKFIHTKPNRFEEVA 

WSKYNPKDQLYLHIGLKPRVRDHYRATKVAFWLELVPHLHNLNEIFQYVSTTTKVPPPDMTS 

FPYGTRRSPAKIWPTTKRPAITPANNPKHSKDPHKTGPEDTTVLIETKRDYSTELSVTIAVG 

ASLLFLNILAFAALYYKKDKRRHETHRRPSPQRNTTNDIAHIQNEEIMSLQMKQLEHDHECE 

SLQAHDTLRLTCPPDYTLTLRRSPDDIPLMTPNTITMIPNTLTGMQPLHTFNTFSGGQNSTN 
LPHGHSTTRV 
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FIGURE 1S2 

GGGAAAGMfiGCGGCGACTCTGGGACCCCTTGGGTCGTGGCAGCAGTGGCGGCGATGTTTGT 

CGGCTCGGGATGGGTCCAGGATGTTACTCCTTCTTCTTTTGTTGGGGTCTGGGCAGGGGCCA 

CAGCAAGTCGGGGCGGGTCAAACGTTCGAGTACTTGAAACGGGAGCACTCGCTGTCGAAGCC 

CTACCAGGGTGTGGGCACAGGCAGTTCCTCACTGTGGAATCTGATGGGCAATGCCATGGTGA 

TGACCCAGTATATCCGCCTTACCCCAGATATGCAAAGTAAACAGGGTGCCTTGTGGAACCGG 

GTGCCATGTTTCCTGAGAGACTGGGAGTTGCAGGTGCACTTCAAAATCCATGGACAAGGAAA 

GAAGAATCTGCATGGGGATGGCTTGGCAATCTGGTACACAAAGGATCGGATGCAGCCAGGGC 

CTGTGTTTGGAAACATGGACAAATTTGTGGGGCTGGGAGTATTTGTAGACACCTACCCCAAT 

GAGGAGAAGCAGCAAGAGCGGGTATTCCCCTACATCTCAGCCATGGTGAACAACGGCTCCCT 

CAGCTATGATCATGAGCGGGATGGGCGGCCTACAGAGCTGGGAGGCTGCACAGCCATTGTCC 

GCAATCTTCATTACGACACCTTCCTGGTGATTCGCTACGTCAAGAGGCATTTGACGATAATG 

ATGGATATTGATGGCAAGCATGAGTGGAGGGACTGCATTGAAGTGCCCGGAGTCCGCCTGCC 

CCGCGGCTACTACTTCGGCACCTCCTCCATCACTGGGGATCTCTCAGATAATCATGATGTCA 

TTTCCTTGAAGTTGTTTGAACTGACAGTGGAGAGAACCCCAGAAGAGGAAAAGCTCCATCGA 

GATGTGTTCTTGCCCTCAGTGGACAATATGAAGCTGCCTGAGATGACAGCTCCACTGCCGCC 

CCTGAGTGGCCTGGCCCTCTTCCTCATCGTCTTTTTCTCCCTGGTGTTTTCTGTATTTGCCA 

TAGTCATTGGTATCATACTCTACAACAAATGGCAGGAACAGAGCCGAAAGCGCTTCTACIS& 

GCCCTCCTGCTGCCACCACTTTTGTGACTGTCACCCATGAGGTATGGAAGGAGCAGGCACTG 

GCCTGAGCATGCAGCCTGGAGAGTGTTCTTGTCTCTAGCAGCTGGTTGGGGACTATATTCTG 

TCACTGGAGTTTTGAATGCAGGGACCCCGCATTCCCATGGTTGTGCATGGGGACATCTAACT 

CTGGTCTGGGAAGCCACCCACCCCAGGGCAATGCTGCTGTGATGTGCCTTTCCCTGCAGTCC 

TTCCATGTGGGAGCAGAGGTGTGAAGAGAATTTACGTGGTTGTGATGCCAAAATCACAGAAC 

AGAATTTCATAGCCCAGGCTGCCGTGTTGTTTGACTCAGAAGGCCCTTCTACTTCAGTTTTG 

AATCCACAAAGAATTAAAAACTGGTAACACCACAGGCTTTCTGACCATCCATTCGTTGGGTT 

TTGCATTTGACCCAACCCTCTGCCTACCTGAGGAGCTTTCTTTGGAAACCAGGATGGAAACT 

TCTTCCCTGCCTTACCTTCCTTTCACTCCATTCATTGTCCTCTCTGTGTGCAACCTGAGCTG 

GGAAAGGCATTTGGATGCCTCTCTGTTGGGGCCTGGGGCTGCAGAACACACCTGCGTTTCAC 

TGGCCTTCATTAGGTGGCCCTAGGGAGATGGCTTTCTGCTTTGGATCACTGTTCCCTAGCAT 

GGGTCTTGGGTCTATTGGCATGTCCATGGCCTTCCCAATCAAGTCTCTTCAGGCCCTCAGTG 

AAGTTTGGCTAAAGGTTGGTGTAAAAATCAAGAGAAGCCTGGAAGACATCATGGATGCCATG 

GATTAGCTGTGCAACTGACCAGCTCCAGGTTTGATCAAACCAAAAGCAACATTTGTCATGTG 

GTCTGACCATGTGGAGATGTTTCTGGACTTGCTAGAGCCTGCTTAGCTGCATGTTTTGTAGT 

TACGATTTTTGGAATCCCACTTTGAGTGCTGAAAGTGTAAGGAAGCTTTCTTCTTACACCTT 

GGGCTTGGATATTGCCCAGAGAAGT^TTTGGCTTTTTTTTTCTTAATGGACAAGAGACAGT 

TGCTGTTCTCATGTTCCAAGTCTGAGAGCAACAGACCCTCATCATCTGTGCCTGGAAGAGTT 

CACTGTCATTGAGCAGCACAGCCTGAGTGCTGGCCTCTGTCAACCCTTATTCCACTGCCTTA 

TTTGACAAGGGGTTACATGCTGCTCACCTTACTGCCCTGGGATTAAATCAGTTACAGGCCAG 

AGTCTCCTTGGAGGGCCTGGAACTCTGAGTCCTCCTATGAACCTCTGTAGCCTAAATGAAAT 

TCTTAAAATCACCGATGGAACCAAA/^AAAAAAAAAAAAAGGGCGGCCGCGACTCTAGAGTCG 

ACCTGCAGTAGGGATAACAGGGTAATAAGCTTGGCCGCCATGG 
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></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA50911 
xsubunxt 1 of 1, 348 aa, 1 sCop 
XMW: 39711, pi: 8.70, ^^X(S/T): 1 

MAATLGPLGSWQQWRRCLSARDGSRMLLLLLLLGSGQGPQQVGAGQTFEYLKREHSLSKPYQ 

GVGTGSSSLWNLMGNAMVMTQYIRLTPDMQSKQGALWNRVPCFLRDWELQVHFKIHGQGKKN 

LHGDGLAIWYTKDRMQPGPVFGNMDKEVGLGVFVDTYPNEEKQQERVFPYISAMVNNGSLSY 

DHERDGRPTELGGCTAIVRNLHYDTFLVIRYVKRHLTIMMDIDGKHEWRDCIEVPGVRLPRG 

YYFGTSSITGDLSDNHDVISLKLFELTVERTPEEEKLHRDVFLPSVDNMKLPEMTAPLPPLS 
GLALFLIVFFSLVFSVFAIVIGIILYNKWQEQSRKRFY 
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CCGAGCCGGGCGCGCAGCGACGGAGCTGGGGCCGGCCTGGGACCATGGGCGTGAGTGCAATC 

TACGGATCAGTCTCTGATGGTGGGTCGTTAACCTCAGTGGGGACTCCAAGATTTCCATGAAG 

AAAATCAGTTGTCTTCATTCAAGAATTGGGGTCTGGCTCAGAATTCCTGCAGCTGGTGAAAA 

TCTGTTTTCTAGAAGAGGTTTAATTAATGCCTGCAGTCTGACATGTTCCCGATTTGAGGTGA 

AACCATGAAGAGAAAATAGAATACTTAATA&ISCTTTTCCGCAACCGCTTCTTGCTGCTGCT 

GGCCCTGGCTGCGCTGCTGGCCTTTGTGAGCCTCAGCCTGCAGTTCTTCCACCTGATCCCGG 

TGTCGACTCCTAAGAATGGAATGAGTAGCAAGAGTCGAAAGAGAATCATGCCCGACCCTGTG 

ACGGAGCCCCCTGTGACAGACCCCGTTTATGAAGCTCTTTTGTACTGCAACATCCCCAGTGT 

GGCCGAGCGCAGCATGGAAGGTCATGCCCCGCATCATTTTAAGCTGGTCTCAGTGCATGTGT 

TCATTCGCCACGGAGACAGGTACCCACTGTATGTCATTCCCAAAACAAAGCGACCAGAAATT 

GACTGCACTCTGGTGGCTAACAGGAAACCGTATCACCCAAAACTGGAAGCTTTCATTAGTCA 

CATGTCAAAAGGATCCGGAGCCTCTTTCGAAAGCCCCTTGAACTCCTTGCCTCTTTACCCAA 

ATCACCCATTGTGTGAGATGGGAGAGCTCACACAGACAGGAGTTGTGCAGCATTTGCAGAAC 

GGTCAGCTGCTGAGGGATATCTATCTAAAGAAACACAAACTCCTGCCCAATGATTGGTCTGC 

AGACCAGCTCTATTTAGAGACCACTGGGAAAAGCCGGACCCTACAAAGTGGGCTGGCCTTGC 

TTTATGGCTTTCTCCCAGATTTTGACTGGAAGAAGATTTATTTCAGGCACCAGCCAAGTGCG 

CTGTTCTGCTCTGGAAGCTGCTATTGCCCGGTAAGAAACCAGTATCTGGAAAAGGAGCAGCG 

TCGTCAGTACCTCCTACGTTTGAAAAACAGCCAGCTGGAGAAGACCTACGGGGAGATGGCCA 

AGATCGTGGATGTCCCCACCAAGCAGCTTAGAGCTGCCAACCCCATAGACTCCATGCTCTGC 

CACTTCTGCCACAATGTCAGCTTTCCCTGTACCAGAAATGGCTGTGTTGACATGGAGCACTT 

CAAGGTAATTAAGACCCATCAGATCGAGGATGAAAGGGAAAGACGGGAGAAGAAATTGTACT 

TCGGGTATTCTCTCCTGGGTGCCCACCCCATCCTGAACCAAACCATCGGCCGGATGCAGCGT 

GCCACCGAGGGCAGGAAAGAAGAGCTCTTTGCCCTCTACTCTGCTCATGATGTCACTCTGTC 

ACCAGTTCTCAGTGCCTTGGGCCTTTCAGAAGCCAGGTTCCCAAGGTTTGCAGCCAGGTTGA 

TCTTTGAGCTTTGGCAAGACAGAGAAAAGCCCAGTGAACATTCCGTCCGGATTCTTTACAAT 

GGCGTCGATGTCACATTCCACACCTCTTTCTGCCAAGACCACCACAAGCGTTCTCCCAAGCC 

CATGTGCCCGCTTGA/yWlCTTGGTCCGCTTTGTGAAAAGGGACATGTTTGTAGCCCTGGGTG 

GCAGTGGTACAAATTATTATGATGCATGTCACAGGGAAGGATTCIft&AAGGTATGCAGTACA 

GCAGTATAGAATCCATGCCAATACAGAGCATAGGGAAAGGTCCACTTCTAGTTTTGTCTGTT 

ACTAAGGGTAGAAGATTATTGCTTTTTAAAGGCTAAATATTGTTTGTGGGAACCACAGATGG 

TTGGGGTTGAACAGTAAGCACATTGCTGCAATGTGGTACGTGAATTGCTTGGTACAAAATGG 

CCAGTTCACAGAGGAATAGAAGGTACTTTATCATAGCCAGACTTCGCTTAGAATGCCAGAAT 

AATATAGTTCAAGACCTGAAGTTGCCAATCCAAGTTTGCACTCTTCTGGCCTGCCCCATGTT 

ACTATGTGATGGAACCAGCACACCTCAACCAAAATTTTTTTAATCTTAGACATTTTTACCTT 

GTCCTTGTTAAGAATTTCTTGAAGTGATTTATCTAAAATAAAGGTTGGCAAACTTTTTCTGT 

AAAGGGCCAGATTGTAAATATTTCAGACTGTGTGGACCAAAAGGCCACATACAGTCTCTGTC 

ATAACTACTCAACTCTGTTTCTGAAGCAGGAAAGCCACCACAGACAGTACATAAAGGAATAT 

GTGTAGCTGGGTTCCCAGGCCAGACAAAACAGATGGTGACCAGACTTGGCCCCTGGGCTGTA 

GTTTGCTGACCCCTCATCTAAAAAATAGGCTATACTACAATTGCACTTCCAGCACTTTGAGA 

ACGAGTTGAATACCAAGAATTATTCAATGGTTCCTCCAGTAACTTCTGCTAGAAACACAGAA 

TTTGGTCTGTATCTGACACTAGAACAAAACTTGAGGGTAAATAAACATTGAATTAGAATGAA 

TCATAGAAAACTGATTAGAAGAATACTTGATGTTTATGATGATTGTGGTACAAGATAGTTTT 

AAGTATGTTCTAAATATTTGTCTGCTGTAGTCTATTTGCTGTATATGCTGAAATTTTTGTAT 

GCCATTTAGTATTTTTATAGTTTAGGAAAATATTTTCTAAGACCAGTTTTAGATGACTCTTA 

TTCCTGTAGTAATATTCAATTTGCTGTACCTGCTTGGTGGTTAGAAGGAGGCTAGAAGATGA 

ATTCAGGCACTTTCTTCCAATAAAACTAATTATGGCTCATTCCCTTTGACAAGCTGTAGAAC 

TGGATTCATTTTTAAACCATTTTCATCAGTTTCAAATGGTAAATTCTGATTGATTTTTAAAT 

GCGTTTTTGGAAGAACTTTGCTATTAGGTAGTTTACAGATCTTTATAAGGTGTTTTATATAT 

TAGAAGCAATTATAATTACATCTGTGATTTCTGAACTAATGGTGCTAATTCAGAGAAATGGA 

AAGTGAAAGTGAGATTCTCTGTTGTCATCGGCATTCCAACTTTTTCTCTTTGTTTTTGTCCA 

GTGTTGCATTTGAATATGTCTGTTTCTATAAATAAATTTTTTAAGAATAA 
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></usr/seqdb2/ssc/DNA/Dnaseqs.min/ss.DNA4 8329 
xsubunit 1 of 1, 480 aa, 1 stop 
XMW: 55240, pi: 9.30, NX(S/T): 2 

MLFRNRFLLLLALAALLAFVSLSLQFFHLIPVSTPKNGMSSKSRKRIMPDPVTEPPVTDPVY 
EALLYCNIPSVAERSMEGHAPHHFKLVSVHVFIRHGDRYPLYVIPKTKRPEIDCTLVANRKP 
YHPKLEAFISHMSKGSGASFESPLNSLPLYPNHPLCEMGELTQTGWQHLQNGQLLRDIYLK 
KHKLLPNDWSADQLYLETTGKSRTLQSGIiALLYGFLPDFDWKKIYFRHQPSALFCSGSCYCP 
VRNQYLEKEQRRQYLLRLKNSQLEKTYGEMAKIVDVPTKQLRAANPIDSMLCHFCHNVSFPC 
TRNGCVDMEHFKVIKTHQIEDERERREKKLYFGYSLLGAHPILNQTIGRMQRATEGRKEELF 

ALYSAHDVTLSPVLSALGLSEARFPRFAARLIFELWQDREKPSEHSVRILYNGVDVTFHTSF 
CQDHHKRSPKPMCPLENLVRFVKRDMFVALGGSGTNYYDACHREGF 
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AAAAAAGCTCACTAAAGTTTCTATTAGAGCGAATACGGTAGATTTCCATCCCCTTTTGAAGA 
ACAGTAC TG T GGAG C TAT TT AAGAGAT AAAAACGAAATATCCTTTCTGGGAGTTCAAGATTG 
TGCAGTAATTGGTTAGGACTCTGAGCGCCGCTGTTCACCAATCGGGGAGAGAAAAGCGGAGA 
TCCTGCTCGCCTTGCACGCGCCTGAAGCACAAAGCAGATAGCTAGGAATGAACCATCCCTGG 
GAGTATGTGGAAACAACGGAGGAGCTCTGACTTCCCAACTGTCCCATTCTATGGGCGAAGGA 

ACTGCTCCTGACTTCAGTGGTTAAGGGCAGAATTGAAAATAATTCTGGAGGAAGATAAGAAI 

SATTCCTGCGCGACTGCACCGGGACTACAAAGGGCTTGTCCTGCTGGGAATCCTCCTGGGGA 

CTCTGTGGGAGACCGGATGCACCCAGATACGCTATTCAGTTCCGGAAGAGCTGGAGAAAGGC 

TCTAGGGTGGGCGACATCTCCAGGGACCTGGGGCTGGAGCCCCGGGAGCTCGCGGAGCGCGG 

AGTCCGCATCATCCCCAGAGGTAGGACGCAGCTTTTCGCCCTGAATCCGCGCAGCGGCAGCT 

TGGTCACGGCGGGCAGGATAGACCGGGAGGAGCTCTGTATGGGGGCCATCAAGTGTCAATTA 

AATCTAGACATTCTGATGGAGGATAAAGTGAAAATATATGGAGTAGAAGTAGAAGTAAGGGA 

CATTAACGACAATGCGCCTTACTTTCGTGAAAGTGAATTAGAAATAAAAATTAGTGAAAATG 

CAGCCACTGAGATGCGGTTCCCTCTACCCCACGCCTGGGATCCGGATATCGGGAAGAACTCT 

CTGCAGAGCTACGAGCTCAGCCCGAACACTCACTTCTCCCTCATCGTGCAAAATGGAGCCGA 

CGGTAGTAAGTACCCCGAATTGGTGCTGAAACGCGCCCTGGACCGCGAAGAAAAGGCTGCTC 

ACCACCTGGTCCTTACGGCCTCCGACGGGGGCGACCCGGTGCGCACAGGCACCGCGCGCATC 

CGCGTGATGGTTCTGGATGCGAACGACAACGCACCAGCGTTTGCTCAGCCCGAGTACCGCGC 

GAGCGTTCCGGAGAATCTGGCCTTGGGCACGCAGCTGCTTGTAGTCAACGCTACCGACCCTG 

ACGAAGGAGTCAATGCGGAAGTGAGGTATTCCTTCCGGTATGTGGACGACAAGGCGGCCCAA 

GTTTTCAAACTAGATTGTAATTCAGGGACAATATCAACAATAGGGGAGTTGGACCACGAGGA 

GTCAGGATTCTACCAGATGGAAGTGCAAGCAATGGATAATGCAGGATATTCTGCGCGAGCCA 

AAGTCCTGATCACTGTTCTGGACGTGAACGACAATGCCCCAGAAGTGGTCCTCACCTCTCTC 

GCCAGCTCGGTTCCCGAAAACTCTCCCAGAGGGACATTAATTGCCCTTTTAAATGTAAATGA 

CCAAGATTCTGAGGAAAACGGACAGGTGATCTGTTTCATCCAAGGAAATCTGCCCTTTA7VAT 

TAGAAAAATCTTACGGAAATTACTATAGTTTAGTCACAGACATAGTCTTGGATAGGGAACAG 

GTTCCTAGCTACAACATCACAGTGACCGCCACTGACCGGGGAACCCCGCCCCTATCCACGGA 

AACTCATATCTCGCTGAACGTGGCAGACACCAACGACAACCCGCCGGTCTTCCCTCAGGCCT 

CCTATTCCGCTTATATCCCAGAGAACAATCCCAGAGGAGTTTCCCTCGTCTCTGTGACCGCC 

CACGACCCCGACTGTGAAGAGAACGCCCAGATCACTTATTCCCTGGCTGAGAACACCATCCA 

AGGGGCAAGCCTATCGTCCTACGTGTCCATCAACTCCGACACTGGGGTACTGTATGCGCTGA 

GCTCCTTCGACTACGAGCAGTTCCGAGACTTGCAAGTGAAAGTGATGGCGCGGGACAACGGG 

CACCCGCCCCTCAGCAGCAACGTGTCGTTGAGCCTGTTCGTGCTGGACCAGAACGACAATGC 

GCCCGAGATCCTGTACCCCGCCCTCCCCACGGACGGTTCCACTGGCGTGGAGCTGGCTCCCC 

GCTCCGCAGAGCCCGGCTACCTGGTGACCAAGGTGGTGGCGGTGGACAGAGACTCCGGCCAG 

AACGCCTGGCTGTCCTACCGTCTGCTCAAGGCCAGCGAGCCGGGACTCTTCTCGGTGGGTCT 

GCACACGGGCGAGGTGCGCACGGCGCGAGCCCTGCTGGACAGAGACGCGCTCAAGCAGAGCC 

TCGTAGTGGCCGTCCAGGACCACGGCCAGCCCCCTCTCTCCGCCACTGTCACGCTCACCGTG 

GCCGTGGCCGACAGCATCCCCCAAGTCCTGGCGGACCTCGGCAGCCTCGAGTCTCCAGCTAA 

CTCTGAAACCTCAGACCTCACTCTGTACCTGGTGGTAGCGGTGGCCGCGGTCTCCTGCGTCT 

TCCTGGCCTTCGTCATCTTGCTGCTGGCGCTCAGGCTGCGGCGCTGGCACAAGTCACGCCTG 

CTGCAGGCTTCAGGAGGCGGCTTGACAGGAGCGCCGGCGTCGCACTTTGTGGGCGTGGACGG 

GGTGCAGGCTTTCCTGCAGACCTATTCCCACGAGGTTTCCCTCACCACGGACTCGCGGAAGA 

GTCACCTGATCTTCCCCCAGCCCAACTATGCAGACATGCTCGTCAGCCAGGAGAGCTTTGAA 

AAAAGCGAGCCCCTTTTGCTGTCAGGTGATTCGGTATTTTCTAAAGACAGTCATGGGTTAAT 

TGAGGTGAGTTTATATCAAATCTTCTTTCTTTTTTTTTTTAATTGCTCTGTCTCCCAAGCTG 

GAGTGCAGCGGTACGATCATAGCTCACTGCGGCCTCAAACTCCTAGGCTCAAGCAATTATCC 

CACCTTTGCCTCCGGTGTAACAGGGACTACAGGTGCAAGCCACCTACTGTCTGCCTATCTAT 

CTATCTATCTATCTATCTATCTATCTATCTATCTATCTATCTATTACTTTCTTGTACAGACG 

GGAGTCTCACGCCTGTAATCCCAGTACTTTGGGAGGCCGAGGCGGGTGGATCACCTGAGGTT 

GGGAGTTTGAGACCAGCCIfiACCTU^CATGGAGAAACCCCGTCTATACTAAAAAAATACAAAA 

TTAGCCGGGCGTGGTGGTGCATGTCTGTAATCCCAGCTACTTGGGAGGCTGAGTCAGGAGAA 

TTGCTTTAACCTGGGAGGTGGAGGTTGCAATGAGCTGAGATTGTGCCATTGCACTCCAGCCT 

GGGCAACAAGAGTGAAACTCTATCTCA 
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></usr/seqcib2/sst/DNA/Dnaseqs .min/ss . DNA48306 
xsubunit 1 of 1, 916 aa, 1 stop 
XMW: 100204, pi: 4.92, NX(S/T): 4 

MIPARLHRDYKGLVLLGILLGTLWETGCTQIRYSVPEELEKGSRVGDISRDLGLEPRELAER 
GVRIIPRGRTQLFALNPRSGSLVTAGRIDREELCMGAIKCQLNLDILMEDKVKIYGVEVEVR 
DINDNAPYFRESELEIKISENAATEMRFPLPHAWDPDIGKNSLQSYELSPNTHFSLIVQNGA 
DGSKYPELVLKRALDREEKAAHHLVLTASDGGDPVRTGTARIRVMVLDANDNAPAFAQPEYR 
ASVPENLALGTQLLWNATDPDEGVNAEVRYSFRYVDDKAAQVFKLDCNSGTISTIGELDHE 
ESGFYQMEVQAMDNAGYSARAKVLITVLDVNDNAPEWLTSLASSVPENSPRGTLIALLNVN 
DQDSEENGQVICFIQGNLPFKLEKSYGNYYSLVTDIVLDREQVPSYNITVTATDRGTPPLST 
ETHISLNVADTNDNPPVFPQASYSAYIPENNPRGVSLVSVTAHDPDCEENAQITYSLAENTI 
QGAS LS S YVS I NS DTG VL YALS S FDYEQFRDLQVKVMARDNGHPPLS S NVS LS L FVLDQNDN 
APEILYPALPTDGSTGVELAPRSAEPGYLVTKWAVDRDSGQNAWLSYRLLKASEPGLFSVG 
LHTGEVRTARALLDRDALKQSLWAVQDHGQPPLSATVTLTVAVADS I PQVLADLGSLESPA 
NSETSDLTLYLWAVAAVSCVFLAFVILLLALRLRRWHKSRLLQASGGGLTGAPASHFVGVD 
GVQAFLQTYSHEVSLTTDSRKSHLIFPQPNYADMLVSQESFEKSEPLLLSGDSVFSKDSHGL 
lEVSLYQIFFLFFFNCSVSQAGVQRYDHSSLRPQTPRLKQLSHLCLRCNRDYRCKPPTVCLS 
I YLS I YLS I YLS I YLLLSCTDGSLTPVI PVLWEAEAGGS PEVGSLRPA 
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CCCAGGCTCTAGTGCAGGAGGAGAAGGAGGAGGAGCAGGAGGTGGAGATTCCCAGTTAAAAG 
GCTCCAGAATCGTGTACCAGGCAGAGAACTGAAGTACTGGGGCCTCCTCCACTGGGTCCGAA 
TCAGTAGGTGACCCCGCCCCTGGATTCTGGAAGACCTCACCAISGGACGCCCCCGACCTCGT 
GCGGCCAAGACGTGGATGTTCCTGCTCTTGCTGGGGGGAGCCTGGGCAGGACACTCCAGGGC 
ACAGGAGGACAAGGTGCTGGGGGGTCATGAGTGCCAACCCCATTCGCAGCCTTGGCAGGCGG 
CCTTGTTCCAGGGCCAGCAACTACTCTGTGGCGGTGTCCTTGTAGGTGGCAACTGGGTCCTT 
ACAGCTGCCCACTGTAAAAAACCGAAATACACAGTACGCCTGGGAGACCACAGCCTACAGAA 
TAAAGATGGCCCAGAGCAAGAAATACCTGTGGTTCAGTCCATCCCACACCCCTGCTACAACA 
GCAGCGATGTGGAGGACCACAACCATGATCTGATGCTTCTTCAACTGCGTGACCAGGCATCC 
CTGGGGTCCAAAGTGAAGCCCATCAGCCTGGCAGATCATTGCACCCAGCCTGGCCAGAAGTG 
CACCGTCTCAGGCTGGGGCACTGTCACCAGTCCCCGAGAGAATTTTCCTGACACTCTCAACT 
GTGCAGAAGTAAAAATCTTTCCCCAGAAGAAGTGTGAGGATGCTTACCCGGGGCAGATCACA 
GATGGCATGGTCTGTGCAGGCAGCAGCAAAGGGGCTGACACGTGCCAGGGCGATTCTGGAGG 
CCCCCTGGTGTGTGATGGTGCACTCCAGGGCATCACATCCTGGGGCTCAGACCCCTGTGGGA 
GGTCCGACAAACCTGGCGTCTATACCAACATCTGCCGCTACCTGGACTGGATCAAGAAGATC 
ATAGGCAGCAAGGGCIS&TTCTAGGATAAGCACTAGATCTCCCTTAATAAACTCACAACTCT 
CTGGTTC 




BNSOOCID- <W0 9946281 A2_L> 



wo 99/462«l 



PCT/US99/050:8 
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</u5r/seqcib2/ssc/DNA/Dnaseqs.min/ss.DNA48336 
<subunit 1 of 1, 260 aa, 1 stop 
<MW: 28048. pi: 7.87, NX(S/T): 1 

MGRPRPRAAKTWMFLLLLGGAWAGHSRAQEDKVLGGHECQPHSQPWQAALFQGQQLLCGGVL 
VGGNWVLTAAHCKKPKYTVRLGDHSLQNKDGPEQEIPWQSIPHPCYNSSDVEDHNHDLMLL 
QLRDQASLGSKVKPISIADHCTQPGQKCTVSGWGTVTSPRENFPDTLNCAEVKIFPQKKCED 
AYPGQITDGMVCAGSSKGADTCQGDSGGPLVCDGALQGITSWGSDPCGRSDKPGVYTNICRY 
LDWIKKIIGSKG 

Important Features: 
Signal peptide: 
amino acids 1--23 

Transmembrane domain: 

amino acids 51-71 

N-glycosylation site, 
amino acids 110-113 

Serine proteases, trypsin family, histidine active site. 

amino acids 69-74 and 207-217 

Tyrosine kinase phosphoxrylation site, 
amino acids 182-188 

Kringle domain proteins motif 
amino acids 205-217 
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FIRURE 160 

GGCGCCGGTGCACCGGGCGGGCTGAGCGCCTCCTGCGGCCCGGCCTGCGCGCCCCGGCCCGC 

CGCGCCGCCCACGCCCCAACCCCGGCCCGCGCCCCCTAGCCCCCGCCCGGGCCCGCGCCCGC 

GCCCGCGCCCAGGTGAGCGCTCCGCCCGCCGCGAGGCCCCGCCCCGGCCC3CCCCCGCCCCG 

CCCCGGCCGGCGGGGGAACCGGGCGGATTCCTCGCGCGTCAAACCACCTGATCCCATAAAAC 

ATTCATCCTCCCGGCGGCCCGCGCTGCGAGCGCCCCGCCAGTCCGCGCCGCCGCCGCCCTCG 

CCCTGTGCGCCCTGCGCGCCCTGCGCACCCGCGGCCCGAGCCCAGCCAGAGCCGGGCGGAGC 

GGAGCGCGCCGAGCCTCGTCCCGCGGCCGGGCCGGGGCCGGGCCGTAGCGGCGGCGCCTGGA 

TGCGGACCCGGCCGCGGGGAGACGGGCGCCCGCCCCGAAACGACTTTCAGTCCCCGACGCGC 

CCCGCCCAACCCCTACGAISAAGAGGGCGTCCGCTGGAGGGAGCCGGCTGCTGGCATGGGTG 

CTGTGGCTGCAGGCCTGGCAGGTGGCAGCCCCATGCCCAGGTGCCTGCGTATGCTACAATGA 

GCCCAAGGTGACGACAAGCTGCCCCCAGCAGGGCCTGCAGGCTGTGCCCGTGGGCATCCCTG 

CTGCCAGCCAGCGCATCTTCCTGCACGGCAACCGCATCTCGCATGTGCCAGCTGCCAGCTTC 

CGTGCCTGCCGCAACCTCACCATCCTGTGGCTGCACTCGAATGTGCTGGCCCGAATTGATGC 

GGCTGCCTTCACTGGCCTGGCCCTCCTGGAGCAGCTGGACCTCAGCGATAATGCACAGCTCC 

GGTCTGTGGACCCTGCCACATTCCACGGCCTGGGCCGCCTACACACGCTGCACCTGGACCGC 

TGCGGCCTGCAGGAGCTGGGCCCGGGGCTGTTCCGCGGCCTGGCTGCCCTGCAGTACCTCTA 

CCTGCAGGACAACGCGCTGCAGGCACTGCCTGATGACACCTTCCGCGACCTGGGCAACCTCA 

CACACCTCTTCCTGCACGGCAACCGCATCTCCAGCGTGCCCGAGCGCGCCTTCCGTGGGCTG 

CACAGCCTCGACCGTCTCCTACTGCACCAGAACCGCGTGGCCCATGTGCACCCGCATGCCTT 

CCGTGACCTTGGCCGCCTCATGACACTCTATCTGTTTGCCAACAATCTATCAGCGCTGCCCA 

CTGAGGCCCTGGCCCCCCTGCGTGCCCTGCAGTACCTGAGGCTCAACGACAACCCCTGGGTG 

TGTGACTGCCGGGCACGCCCACTCTGGGCCTGGCTGCAGAAGTTCCGCGGCTCCTCCTCCGA 

GGTGCCCTGCAGCCTCCCGCAACGCCTGGCTGGCCGTGACCTCAAACGCCTAGCTGCCAATG 

ACCTGCAGGGCTGCGCTGTGGCCACCGGCCCTTACCATCCCATCTGGACCGGCAGGGCCACC 

GATGAGGAGCCGCTGGGGCTTCCCAAGTGCTGCCAGCCAGATGCCGCTGACAAGGCCTCAGT 

ACTGGAGCCTGGAAGACCAGCTTCGGCAGGCAATGCGCTGAAGGGACGCGTGCCGCCCGGTG 

ACAGCCCGCCGGGCAACGGCTCTGGCCCACGGCACATCAATGACTCACCCTTTGGGACTCTG 

CCTGGCTCTGCTGAGCCCCCGCTCACTGCAGTGCGGCCCGAGGGCTCCGAGCCACCAGGGTT 

CCCCACCTCGGGCCCTCGCCGGAGGCCAGGCTGTTCACGCAAGAACCGCACCCGCAGCCACT 

GCCGTCTGGGCCAGGCAGGCAGCGGGGGTGGCGGGACTGGTGACTCAGAAGGCTCAGGTGCC 

CTACCCAGCCTCACCTGCAGCCTCACCCCCCTGGGCCTGGCGCTGGTGCTGTGGACAGTGCT 

TGGGCCCTGCIS&CCCCCAGCGGACACAAGAGCGTGCTCAGCAGCCAGGTGTGTGTACATAC 

GGGGTCTCTCTCCACGCCGCCAAGCCAGCCGGGCGGCCGACCCGTGGGGCAGGCCAGGCCAG 

GTCCTCCCTGATGGACGCCTGCCGCCCGCCACCCCCATCTCCACCCCATCATGTTTACAGGG 

TTCGGCGGCAGCGTTTGTTCCAGAACGCCGCCTCCCACCCAGATCGCGGTATATAGAGATAT 

GCATTTTATTTTACTTGTGTAAAAATATCGGACGACGTGGAATAAAGAGCTCTTTTCTTAAA 

AAAA 



WO 99/46281 




BNSDOCIO- <W0 9W62eiA2.L> 



wo 99/46281 



PCT/US99/05028 



FIGURE 161 



></usr/seqclb2/sst/DNA/Dnaseqs .min/ss • DNA44184 
xsubunit 1 of 1, 473 aa, 1 stop 
XMW: 50708, pi: 9.28, NX{S/T) : 6 

MKRASAGGSRLLAWVLWLQAWQVAAPCPGACVCYNEPKVTTSCPQQGLQAVPVGIPAASQRI 
FLHGNRISHVPAASFRACRNLTILWLHSNVLARIDAAAFTGLALLEQLDLSDNAQLRSVDPA 
TFHGLGRLHTLHLDRCGLQELGPGLFRGLAALQYLYLQDNALQALPDDTFRDLGNLTHLFLH 
GNRISSVPERAFRGLHSLDRLLLHQNRVAHVHPHAFRDLGRLMTLYLFANNLSALPTEALAP 
LRALQYLRLNDNPWVCDCRARPLWAWLQKFRGSSSEVPCSLPQRLAGRDLKRLAANDLQGCA 
VATGPYHPIWTGRATDEEPLGLPKCCQPDAADKASVLEPGRPASAGNALKGRVPPGDSPPGN 
GSGPRHINDSPFGTLPGSAEPPLTAVRPEGSEPPGFPTSGPRRRPGCSRKNRTRSKCRLGQA 
GSGGGGTGDSEGSGALPSLTCSLTPLGLALVLWTVLGPC 

Important features: 
Signal peptide: 

amino acids 1-26 

Leucine zipper pattern. 

amino acids 135-156 

Glycosaminoglycan attachment site, 
amino acids 436-439 

N-glycosylation site. 

amino acids 82-85, 179-183, 237-240, 372-375 and 423-426 

VWFC domain 

amino acids 411-425 
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FIGURE 162 



GGAAGTCCACGGGGAGCTTGGATGCCAAAGGGAGGACGGCTGGGTCCTCTGGAGAGGACTAC 

TCACTGGCATATTTCTGAGGTATCTGTAGAATAACCACAGCCTCAGATACTGGGGACTTTAC 

AGTCCCACAGAACCGTCCTCCCAGGAAGCTGAATCCAGCAAGAACAAIGGAGGCCAGCGGGA 

AGCTCATTTGCAGACAAAGGCAAGTCCTTTTTTCCTTTCTCCTTTTGGGCTTATCTCTGGCG 

GGCGCGGCGGAACCTAGAAGCTATTCTGTGGTGGAGGAAACTGAGGGCAGCTCCTTTGTCAC 

CAATTTAGCAAAGGACCTGGGTCTGGAGCAGAGGGAATTCTCCAGGCGGGGGGTTAGGGTTG 

TTTCCAGAGGGAACAAACTACATTTGCAGCTCAATCAGGAGACCGCGGATTTGTTGCTAAAT 

GAGAAATTGGACCGTGAGGATCTGTGCGGTCACACAGAGCCCTGTGTGCTACGTTTCCAAGT 

GTTGCTAGAGAGTCCCTTCGAGTTTTTTCAAGCTGAGCTGCAAGTAATAGACATAAACGACC 

ACTCTCCAGTATTTCTGGACAAACAAATGTTGGTGAAAGTATCAGAGAGCAGTCCTCCTGGG 

ACTACGTTTCCTCTGAAGAATGCCGAAGACTTAGATGTAGGCCAAAACAATATTGAGAACTA 

TATAATCAGCCCCAACTCCTATTTTCGGGTCCTCACCCGCAAACGCAGTGATGGCAGGAAAT 

ACCCAGAGCTGGTGCTGGACAAAGCGCTGGACCGAGAGGAAGAAGCTGAGCTCAGGTTAACA 

CTCACAGCACTGGATGGTGGCTCTCCGCCCAGATCTGGCACTGCTCAGGTCTACATCGAAGT 

CCTGGATGTCAACGATAATGCCCCTGAATTTGAGCAGCCTTTCTATAGAGTGCAGATCTCTG 

AGGACAGTCCGGTAGGCTTCCTGGTTGTGAAGGTCTCTGCCACGGATGTAGACACAGGAGTC 

AACGGAGAGATTTCCTATTCACTTTTCCAAGCTTCAGAAGAGATTGGCAAAACCTTTAAGAT 

CAATCCCTTGACAGGAGAAATTGJ^ACTAAAAAAACAACTCGATTTCGAAAAACTTCAGTCCT 

ATGAAGTCAATATTGAGGCAAGAGATGCTGGAACCTTTTCTGGAAAATGCACCGTTCTGATT 

CAAGTGATAGATGTGAACGACCATGCCCCAGAAGTTACCATGTCTGCATTTACCAGCCCAAT 

ACCTGAGAACGCGCCTGAAACTGTGGTTGCACTTTTCAGTGTTTCAGATCTTGATTCAGGAG 

AAAATGGGAAAATTAGTTGCTCCATTCAGGAGGATCTACCCTTCCTCCTGAAATCCGCGGAA 

AAC T T T TACACCC TAG TAACGGAGAGACCACTAGACAGAGAAAGCAGAGCGGAATACAACAT 

CACTATCACTGTCACTGACTTGGGGACCCCTATGCTGATAACACAGCTCAATATGACCGTGC 

TGATCGCCGATGTCAATGACAACGCTCCCGCCTTCACCCAAACCTCCTACACCCTGTTCGTC 

CGCGAGAACAACAGCCCCGCCCTGCACATCCGCAGCGTCAGCGCTACAGACAGAGACTCAGG 

CACCAACGCCCAGGTCACCTACTCGCTGCTGCCGCCCCAGGACCCGCACCTGCCCCTCACAT 

CCCTGGTCTCCATCAACGCGGACAACGGCCACCTGTTCGCCCTCAGGTCTCTGGACTACGAG 

GCCCTGCAGGGGTTCCAGTTCCGCGTGGGCGCTTCAGACCACGGCTCCCCGGCGCTGAGCAG 

CGAGGCGCTGGTGCGCGTGGTGGTGCTGGACGCCAACGACAACTCGCCCTTCGTGCTGTACC 

CGCTGCAGAACGGCTCCGCGCCCTGCACCGAGCTGGTGCCCCGGGCGGCCGAGCCGGGCTAC 

CTGGTGACCAAGGTGGTGGCGGTGGACGGCGACTCGGGCCAGAACGCCTGGCTGTCGTACCA 

GCTGCTCAAGGCCACGGAGCTCGGTCTGTTCGGCGTGTGGGCGCACAATGGCGAGGTGCGCA 

CCGCCAGGCTGCTGAGCGAGCGCGACGCGGCCAAGCACAGGCTGGTGGTGCTGGTCAAGGAC 

73ATGGCGAGCCTCCGCGCTCGGCCACCGCCACGCTGCACGTGCTCCTGGTGGACGGCTTCTC 

CCAGCCCTACCTGCCTCTCCCGGAGGCGGCCCCGACCCAGGCCCAGGCCGACTTGCTCACCG 

TCTACCTGGTGGTGGCGTTGGCCTCGGTGTCTTCGCTCTTCCTCTTTTCGGTGCTCCTGTTC 

GTGGCGGTGCGGCTGTGTAGGAGGAGCAGGGCGGCCTCGGTGGGTCGCTGCTTGGTGCCCGA 

GGGCCCCCTTCCAGGGCATCTTGTGGACATGAGCGGCACCAGGACCCTATCCCAGAGCTACC 

AGTATGAGGTGTGTCTGGCAGGAGGCTCAGGGACCAATGAGTTCAAGTTCCTGAAGCCGATT 

ATCCCCAACTTCCCTCCCCAGTGCCCTGGGAAAGAAATACAAGGAAATTCTACCTTCCCCAA 

TAACTTTGGGTTCAATATTCAGTS&CCATAGTTGACTTTTACATTCCATAGGTATTTTATTT 

TGTGGCATTTCCATGCCAATGTTTATTTCCCCCAATTTGTGTGTATGTAATATTGTACGGAT 

TTACTCTTGATTTTTCTCATGTTCTTTCTCCCTTTGTTTTAAAGTGAACATTTACCTTTATT 

CCTGGTTCTT 
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FIGURE 163 

</usr/seqdb2/sst/DNA/Dnaseqs.inin/ss.DNA48314 
<subunit 1 of 1, 798 aa, 1 stop 
<MW: 87552, pi: 4.84, NX(S/T): 5 

MEASGKLICRQRQVLFSFLLLGLSLAGAAEPRSYSWEETEGSSFVTNLAKDLGLEQREFSR 
RGVR WS RGNKLHLQLNQE TADLLLNEKL DRE DLCGHTE PCVLRFQVLLES P FE FFQAELQV 

IDINDHSPVFLDKQMLVKVSESSPPGTTFPLKNAEDLDVGQNNIENYIISPNSYFRVLTRKR 
SDGRKYPELVLDKALDREEEAELRLTLTALDGGSPPRSGTAQVYIEVLDVNDNAPEFEQPFY 
RVQISEDSPVGFLWKVSATDVDTGVNGEISYSLFQASEEIGKTFKINPLTGEIELKKQLDF 
EKLQS YE VN I EARDAGT FS GKCTVLI QVI DVNDHAPEVTMSAFTS P I PENAPET WALFSVS 

DLDSGENGKISCSIQEDLPFLLKSAENFYTLLTERPLDRESRAEYNITITVTDLGTPMLITQ 
LNMTVLIADVNDNAPAFTQTSYTLFVRENNSPALHIRSVSATDRDSGTNAQVTYSLLPPQDP 
HLPLTSLVSINADNGHLFALRSLDYEALQGFQFRVGASDHGSPALSSEALVRVWLDANDNS 
PFVLYPLQNGSAPCTELVPRAAEPGYLVTKWAVDGDSGQNAWLSYQLLKATELGLFGVWAH 
NGEVRTARLLSERDAAKHRLWLVKDNGEPPRSATATLHVLLVDGFSQPYLPLPEAAPTQAQ 
ADLLTVYLWALASVSSLFLFSVLLFVAVRLCRRSRAASVGRCLVPEGPLPGHLVDMSGTRT 
LSQSYQYEVCLAGGSGTNEFKFLKPIIPNFPPQCPGKEIQGNSTFPNNFGFNIQ 

Inportant features: 
Signal peptide: 
amino acids 1-26 

Transmembrane domain: 
amino acids 685-712 

Cadherins extracellular repeated domain signature. 

amino acids 122-132, 231-241, 336-346, 439-449 and 549-559 

ATP/GTP-binding site motif A (P-loop) . 
amino acids 285-292 

N-glycosylation site. 

amino acids 418-421, 436-439, 567-570 and 786-789 
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ACCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCGCGTAGCCGTGC 

GCCGATTGCCTCTCGGCCTGGGCA^GTCCCGGCTGCCGGTCGACGACCGCCCCGCGTCAT 

GCGGCTCCTCGGCTGGTGGCAAGTATTGCTGTGGGTGCTGGGACTTCCCGTCCGCGGCGTGG 

AGGTTGCAGAGGAAAGTGGTCGCTTATGGTCAGAGGAGCAGCCTGCTCACCCTCTCCAGGTG 

GGGGCTGTGTACCTGGGTGAGGAGGAGCTCCTGCATGACCCGATGGGCCAGGACAGGGCAGC 

AGAAGAGGCCAATGCGGTGCTGGGGCTGGACACCCAAGGCGATCACATGGTGATGCTGTCTG 

TGAT TCC TGGGGAAGC TGAGGACAAAGTGAGTTCAGAGCCTAGCGGCGTCACCTGTGGTGCT 

GGAGGAGCGGAGGACTCAAGGTGCAACGTCCGAGAGAGCCTTTTCTCTCTGGATGGCGCTGG 

AGCACACTTCCCTGACAGAGAAGAGGAGTATTACACAGAGCCAGAAGTGGCGGAATCTGACG 

CAGCCCCGACAGAGGACTCCAATAACACTGAAAGTCTGAAATCCCCAAAGGTGAACTGTGAG 

GAGAGAAACATTACAGGATTAGAAAATTTCACTCTGAAAATTTTAAATATGTCACAGGACCT 

TATGGATTTTCTGAACCCAAACGGTAGTGACTGTACTCTAGTCCTGTTTTACACCCCGTGGT 

GCCGCTTTTCTGCCAGTTTGGCCCCTCACTTTAACTCTCTGCCCCGGGCATTTCCAGCTCTT 

CACTTTTTGGCACTGGATGCATCTCAGCACAGCAGCCTTTCTACCAGGTTTGGCACCGTAGC 

TGTTCCTAATATTTTATTATTTCAAGGAGCTAAACCAATGGCCAGATTTAATCATACAGATC 

GAACACTGGAAACACTGAAAATCTTCATTTTTAATCAGACAGGTATAGAAGCCAAGAAGAAT 

GTGGTGGTAACTCAAGCCGACCAAATAGGCCCTCTTCCCAGCACTTTGATAAAAAGTGTGGA 

CTGGTTGCTTGTATTTTCCTTATTCTTTTTAATTAGTTTTATTATGTATGCTACCATTCGAA 

CTGAGAGTATTCGGTGGCTAATTCCAGGACAAGAGCAGGAACATGTGGAGl&fiTGATGGTCT 

GAAAGAAGTTGGAAAGAGGAACTTCAATCCTTCGTTTCAGAAATTAGTGCTACAGTTTCATA 

CATTTTCTCCAGTGACGTGTTGACTTGAAACTTCAGGCAGATTAAAAGAATCATTTGTTGAA 

CAACTGAATGTATAAAAAAATTATAAACTGGTGTTTTAACTAGTATTGCAATAAGCAAATGC 

AAAAATATTCAATAG 
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FIGURE 16B 

■ ></usr/seqdb2/sst/DNA/Dnaseqs.min/ss,DNA4 8333 
xsubunit 1 of 1, 360 aa, 1 stop 
XMW: 39885, pi: 4.79, NX(S/T): 7 

MVPAAGRRPPRVMRLLGWWQVLLWVLGLPVRGVEVAEESGRLWSEEQPAHPLQVGAVYLGEE 
ELLHDPMGQDRAAEEANAVLGLDTQGDHMVMLSVIPGEAEDKVSSEPSGVTCGAGGAEDSRC 
NVRE S L FS LDGAG AH FPDRE EE YYTE PEVAESDAAPTEDSNNTES LKS PKVNCEERNI TGLE 

NFTLKILNMSQDLMDFLNPNGSDCTLVLFYTPWCRFSASLAPHFNSLPRAFPALHFLALDAS 
OHSSLSTRFGTVAVPNILLFQGAKPMARFNHTDRTLETLKIFIFNQTGIEAKKNVWTQADQ 
IGPLPSTLIKSVDWLLVFSLFFLISFIMYATIRTESIRWLIPGQEQEHVE 

Important features: 
Signal peptide: 
amino acids 1-25 

Transmembrane domain: 
amino acids 321-340 

Homologous region to dilsufide isomerase 

amino acids 212-302 

N-glycosylation site. 

amino acids 165-168, 181-184, 187-190, 194-197, 206-209, 278-281 
and 293-296 

Thioredoxin domain 
amino acids 211-227 
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FIfiURE 166 



CCCGGCTCCGCTCCCTCTGCCCCCTCGGGGTCGCGCGCCCACG&ISCTGCAGGGCCCTGGCT 

CGCTGCTGCTGCTCTTCCTCGCCTCGCACTGCTGCCTGGGCTCGGCGCGCGGGCTCTTCCTC 

TTTGGCCAGCCCGACTTCTCCTACAAGCGCAGCAATTGCAAGCCCATCCCGGTCAACCTGCA 

GCTGTGCCACGGCATCGAATACCAGAACATGCGGCTGCCCAACCTGCTGGGCCACGAGACCA 

TGAAGGAGGTGCTGGAGCAGGCCGGCGCTTGGATCCCGCTGGTCATGAAGCAGTGCCACCCG 

GACACCAAGAAGTTCCTGTGCTCGCTCTTCGCCCCCGTCTGCCTCGATGACCTAGACGAGAC 

CATCCAGCCATGCCACTCGCTCTGCGTGCAGGTGAAGGACCGCTGCGCCCCGGTCATGTCCG 

CCTTCGGCTTCCCCTGGCCCGACATGCTTGAGTGCGACCGTTTCCCCCAGGACAACGACCTT 

TGCATCCCCCTCGCTAGCAGCGACCACCTCCTGCCAGCCACCGAGGAAGCTCCAAAGGTATG 

TGAAGCCTGCAAAAATAAAAATGATGATGACAACGACATAATGGAAACGCTTTGTAAAAATG 

ATTTTGCACTGAAAATAAAAGTGAAGGAGATAACCTACATCAACCGAGATACCAAAATCATC 

CTGGAGACCAAGAGCAAGACCATTTACAAGCTGAACGGTGTGTCCGAAAGGGACCTGAAGAA 

ATCGGTGCTGTGGCTCAAAGACAGCTTGCAGTGCACCTGTGAGGAGATGAACGACATCAACG 

CGCCCTATCTGGTCATGGGACAGAAACAGGGTGGGGAGCTGGTGATCACCTCGGTGAAGCGG 

TGGCAGAAGGGGCAGAGAGAGTTCAAGCGCATCTCCCGCAGCATCCGCAAGCTGCAGTGCI& 

fiTCCCGGCATCCTGATGGCTCCGACAGGCCTGCTCCAGAGCACGGCTGACCATTTCTGCTCC 

GGGATCTCAGCTCCCGTTCCCCAAGCACACTCCTAGCTGCTCCAGTCTCAGCCTGGGCAGCT 

TCCCCCTGCCTTTTGCACGTTTGCATCCCCAGCATTTCCTGAGTTATAAGGCCACAGGAGTG 

GATAGCTGTTTTCACCTAAAGGAAAAGCCCACCCGAATCTTGTAGAAATATTCAAACTAATA 

AAATCATGAATATTTTAA 
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FIGURE 167 

></usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA50920 
xsubunit 1 of l, 295 aa, 1 stop 
XMW: 33518, p J : 7.74, NX(S/T): 0 

MLQGPGSLLLLFLASHCCLGSARGLFLFGQPDFSYKRSNCKPIPVNLQLCHGIEYQNMRLPN 
LLGHETMKEVLEQAGAWIPLVMKQCHPDTKKFLCSLFAPVCLDDLDETIQPCHSLCVQVKDR 
CAPVMSAFGFPWPDMLECDRFPQDNDLCIPLASSDHLLPATEEAPKVCEACKNKNDDDNDIM 
ETLCKNDFALKIKVKEITYINRDTKIILETKSKTIYKLNGVSERDLKKSVLWLKDSLQCTCE 
EMNDINAPYLVMGQKQGGELVITSVKRWQKGQREFKRISRSIRKLQC 

Important features: 
Signal peptide: 

amino acids 1-20 

Cysteine rich domain, homolgous to frizzled N terminus 
amino acids 6-153 
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FIGURE 168 



GTGGAGGCCGCCGACGAISGCGGGGCCGACGGAGGCCGAGACGGGGTTGGCCGAGCCCCGGG 

CCCTGTGCGCGCAGCGGGGCCACCGCACCTACGCGCGCCGCTGGGTGTTCCTGCTCGCGATC 

AGCCTGCTCAACTGCTCCAACGCCACGCTGTGGCTCAGCTTTGCACCTGTGGCTGACGTCAT 

TGCTGAGGACTTGGTCCTGTCCATGGAGCAGATCAACTGGCTGTCACTGGTCTACCTCGTGG 

TATCCACCCCATTTGGCGTGGCGGCCATCTGGATCCTGGACTCCGTCGGGCTCCGTGCGGCG 

ACCATCCTGGGTGCGTGGCTGAACTTTGCCGGGAGTGTGCTACGCATGGTGCCCTGCATGGT 

TGTTGGGACCCAAAACCCATTTGCCTTCCTCATGGGTGGCCAGAGCCTCTGTGCCCTTGCCC 

AGAGCCTGGTCATCTTCTCTCCAGCCAAGCTGGCTGCCTTGTGGTTCCCAGAGCACCAGCGA 

GCCACGGCCAACATGCTCGCCACCATGTCGAACCCTCTGGGCGTCCTTGTGGCCAATGTGCT 

GTCCCCTGTGCTGGTCAAGAAGGGTGAGGACATTCCGTTAATGCTCGGTGTCTATACCATCC 

CTGCTGGCGTCGTCTGCCTGCTGTCCACCATCTGCCTGTGGGAGAGTGTGCCCCCCACCCCG 

CCCTCTGCCGGGGCTGCCAGCTCCACCTCAGAGAAGTTCCTGGATGGGCTCAAGCTGCAGCT 

CATGTGGAACAAGGCCTATGTCATCCTGGCTGTGTGCTTGGGGGGAATGATCGGGATCTCTG 

CCAGCTTCTCAGCCCTCCTGGAGCAGATCCTCTGTGCAAGCGGCCACTCCAGTGGGTTTTCC 

GGCCTCTGTGGCGCTCTCTTCATCACGTTTGGGATCCTGGGGGCACTGGCTCTCGGCCCCTA 

TGTGGACCGGACCAAGCACTTCACTGAGGCCACCAAGATTGGCCTGTGCCTGTTCTCTCTGG 

CCTGCGTGCCCTTTGCCCTGGTGTCCCAGCTGCAGGGACAGACCCTTGCCCTGGCTGCCACC 

TGCTCGCTGCTCGGGCTGTTTGGCTTCTCGGTGGGCCCCGTGGCCATGGAGTTGGCGGTCGA 

GTGTTCCTTCCCCGTGGGGGAGGGGGCTGCCACAGGCATGATCTTTGTGCTGGGGCAGGCCG 

AGGGAATACTCATCATGCTGGCAATGACGGCACTGACTGTGCGACGCTCGGAGCCGTCCTTG 

TCCACCTGCCAGCAGGGGGAGGATCCACTTGACTGGACAGTGTCTCTGCTGCTGATGGCCGG 

CCTGTGCACCTTCTTCAGCTGCATCCTGGCGGTCTTCTTCCACACCCCATACCGGCGCCTGC 

AGGCCGAGTCTGGGGAGCCCCCCTCCACCCGTAACGCCGTGGGCGGCGCAGACTCAGGGCCG 

GGTGTGGACCGAGGGGGAGCAGGAAGGGCTGGGGTCCTGGGGCCCAGCACGGCGACTCCGGA 

GTGCACGGCGAGGGGGGCCTCGCTAGAGGACCCCAGAGGGCCCGGGAGCCCCCACCCAGCCT 

GCCACCGAGCGACTCCCCGTGCGCAAGGCCCAGCAGCCACCGACGCGCCCTCCCGCCCCGGC 

AGACTCGCAGGCAGGGTCCAAGCGTCCAGGTTTATTGACCCGGCTGGGTCTCACTCCTCCTT 

CTCCTCCCCGTGGGTGATCACGIASCTGAGCGCCTTGTAGTCCAGGTTGCCCGCCACATCGA 

TGGAGGCGAACTGGAACATCTGGTCCACCTGCGGGCGGGGGCGAAAGGGCTCCTTGCGGGCT 

CCGGGAGCGAATTACAAGCGCGCACCTGAAAA 
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FIGURE 169 

></usr/seqdb2/sst/DNA/Dnaseqs.niin/ss.DNA50988 
xsubunit 1 of 1, 560 aa, 1 stop 
XMW: 58427, pi: 6.86, NX(S/T): 2 

MAGPTEAETGLAEPRALCAQRGHRTYARRWVFLLAISLLNCSNATLWLSFAPVADVIAEDLV 
LSMEQ I NWLS L VYL WS T P FGVAAI W I LDS VGLRAAT I LGAWLNFAGS VLRMVPCMWGTQN 
P FAFLMGGQS LCALAQS LVI FS PAKLAALWFPEHQRATANMLATMSNPLGVLVANVLS PVLV 
KKGEDIPLMLGVYTIPAGWCLLSTICLWESVPPTPPSAGAASSTSEKFLDGLKLQLMWNKA 
YVILAVCLGGMIGISASFSALLEQILCASGHSSGFSGLCGALFITFGILGALALGPYVDRTK 
HFTEATK I GLCLFS LACVP FALVSQLQGQTLALAATCS LLGL FGFS VGPVAMELAVECS FPV 
GEGAATGiMIFVLGQAEGILIMLAMTALTVRRSEPSLSTCQQGEDPLDWTVSLLLMAGLCTFF 
SCILAVFFHTPYRRLQAESGEPPSTRNAVGGADSGPGVDRGGAGRAGVLGPSTATPECTARG 
ASLEDPRGPGS PHPACHRATPRAQGPAATDAPSRPGRLAGRVQASRFI DPAGSHSS FSS PWVIT 

Important features: 

Potential Transmembrane domains: 

amino acids 30-50, 61-79, 98-112, 126-146, 169-182, 201-215, 248- 

268, 280-300, 318-337, 341-357, 375-387, 420-441 

N-glycosylation site, 
amino acids 40-43 and 43-46 

Glycosaminoglycan attachment site, 
amino acids 468-471 
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GTCCCACATCCTGCTCAACTGGGTCAGGTCCCTCTTAGACCAGCTCTTGTCCATCATTTGCT 

GAAGTGGACCAACTAGTTCCCCAGTAGGGGGTCTCCCCTGGCAATTCTTGATCGGCGTTTGG 

ACATCTCAGATCGCTTCCAATGAAGATGGCCTTGCCTTGGGGTCCTGCTTGTTTCATAATCA 

TCTAACTATGGGACAAGGTTGTGCCGGCAGCTCTGGGGGAAGGAGCACGGGGCTGATCAAGC 

CATCCAGGAAACACTGGAGGACTTGTCCAGCCTTGAAAGAACTCTAGTGGTTTCTGAATCTA 

GCCCACTTGGCGGTAAGCMiSATGCAACTTCTGCAACTTCTGCTGGGGCTTTTGGGGCCAGG 

TGGCTACTTATTTCTTTTAGGGGATTGTCAGGAGGTGACCACTCTCACGGTGAAATACCAAG 

TGTCAGAGGAAGTGCCATCTGGTACAGTGATCG6GAAGCTGTCCCAGGAACTGGGCCGGGAG 

GAGAGGCGGAGGCAAGCTGGGGCCGCCTTCCAGGTGTTGCAGCTGCCTCAGGCGCTCCCCAT 

TCAGGTGGACTCTGAGGAAGGCTTGCTCAGCACAGGCAGGCGGCTGGATCGAGAGCAGCTGT 

GCCGACAGTGGGATCCCTGCCTGGTTTCCTTTGATGTGCTTGCCACAGGGGATTTGGCTCTG 

ATCCATGTGGAGATCCAAGTGCTGGACATCAATGACCACCAGCCACGGTTTCCCAAAGGCGA 

GCAGGAGCTGGAAATCTCTGAGAGCGCCTCTCTGCGAACCCGGATCCCCCTGGACAGAGCTC 

TTGACCCAGACACAGGCCCTAACACCCTGCACACCTACACTCTGTCTCCCAGTGAGCACTTT 

GCCTTGGATGTCATTGTGGGCCCTGATGAGACCAAACATGCAGAACTCATAGTGGTGAAGGA 

GCTGGACAGGGAAATCCATTCATTTTTTGATCTGGTGTTAACTGCCTATGACAATGGGAACC 

CCCCCAAGTCAGGTACCAGCTTGGTCAAGGTCAACGTCTTGGACTCCAATGACAATAGCCCT 

GCGTTTGCTGAGAGTTCACTGGCACTGGAAATCCAAGAAGATGCTGCACCTGGTACGCTTCT 

CATAAAACTGACCGCCACAGACCCTGACCAAGGCCCCAATGGGGAGGTGGAGTTCTTCCTCA 

GTAAGCACATGCCTCCAGAGGTGCTGGACACCTTCAGTATTGATGCCAAGACAGGCCAGGTC 

ATTCTGCGTCGACCTCTAGACTATGAAAAGAACCCTGCCTACGAGGTGGATGTTCAGGCAAG 

GGACCTGGGTCCCAATCCTATCCCAGCCCATTGCAAAGTTCTCATCAAGGTTCTGGATGTCA 

ATGACAACATCCCAAGCATCCACGTCACATGGGCCTCCCAGCCATCACTGGTGTCAGAAGCT 

CTTCCCAAGGACAGTTTTATTGCTCTTGTCATGGCAGATGACTTGGATTCAGGACACAATGG 

TTTGGTCCACTGCTGGCTGAGCCAAGAGCTGGGCCACTTCAGGCTGAAAAGAACTAATGGCA 

ACACATACATGTTGCTAACCAATGCCACACTGGACAGAGAGCAGTGGCCCAAATATACCCTC 

ACTCTGTTAGCCCAAGACCAAGGACTCCAGCCCTTATCAGCCAAGAAACAGCTCAGCATTCA 

GATCAGTGACATCAACGACAATGCACCTGTGTTTGAGAAAAGCAGGTATGAAGTCTCCACGC 

GGGAAAACAACTTACCCTCTCTTCACCTCATTACCATCAAGGCTCATGATGCAGACTTGGGC 

ATTAATGGAAAAGTCTCATACCGCATCCAGGACTCCCCAGTTGCTCACTTAGTAGCTATTGA 

CTCCAACACAGGAGAGGTCACTGCTCAGAGGTCACTGAACTATGAAGAGATGGCCGGCTTTG 

AGTTCCAGGTGATCGCAGAGGACAGCGGGCAACCCATGCTTGCATCCAGTGTCTCTGTGTGG 

GTCAGCCTCTTGGATGCCAATGATAATGCCCCAGAGGTGGTCCAGCCTGTGCTCAGCGATGG 

AAAAGCCAGCCTCTCCGTGCTTGTGAATGCCTCCACAGGCCACCTGCTGGTGCCCATCGAGA 

CTCCCAATGGCTTGGGCCCAGCGGGCACTGACACACCTCCACTGGCCACTCACAGCTCCCGG 

CCATTCCTTTTGACAACCATTGTGGCAAGAGATGCAGACTCGGGGGCAAATGGAGAGCCCCT 

CTACAGCATCCGCAATGGAAATGAAGCCCACCTCTTCATCCTCAACCCTCATACGGGGCAGC 

TGTTCGTCAATGTCACCAATGCCAGCAGCCTCATTGGGAGTGAGTGGGAGCTGGAGATAGTA 

GTAGAGGACCAGGGAAGCCCCCCCTTACAGACCCGAGCCCTGTTGAGGGTCATGTTTGTCAC 

CAGTGTGGACCACCTGAGGGACTCAGCCCGCAAGCCTGGGGCCTTGAGCATGTCGATGCTGA 

CGGTGATCTGCCTGGCTGTACTGTTGGGCATCTTCGGGTTGATCCTGGCTTTGTTCATGTCC 

ATCTGCCGGACAGAAAAGAAGGACAACAGGGCCTACAACTGTCGGGAGGCCGAGTCCACCTA 

CCGCCAGCAGCCCAAGAGGCCCCAGAAACACATTCAGAAGGCAGACATCCACCTCGTGCCTG 

TGCTCAGGGGTCAGGCAGGTGAGCCTTGTGAAGTCGGGCAGTCCCACAAAGATGTGGACAAG 

GAGGCGATGATGGAAGCAGGCTGGGACCCCTGCCTGCAGGCCCCCTTCCACCTCACCCCGAC 

CCTGTACAGGACGCTGCGTAATCAAGGCAACCAGGGAGCACCGGCGGAGAGCCGAGAGGTGC 

TGCAAGACACGGTCAACCTCCTTTTCAACCATCCCAGGCAGAGGAATGCCTCCCGGGAGAAC 

CTGAACCTTCCCGAGCCCCAGCCTGCCACAGGCCAGCCACGTTCCAGGCCTCTGAAGGTTGC 

AGGCAGCCCCACAGGGAGGCTGGCTGGAGACCAGGGCAGTGAGGAAGCCCCACAGAGGCCAC 

CAGCCTCCTCTGCAACCCTGAGACGGCAGCGACATCTCAATGGCAAAGTGTCCCCTGAGAAA 

GAATCAGGGCCCCGTCAGATCCTGCGGAGCCTGGTCCGGCTGTCTGTGGCTGCCTTCGCCGA 

GCGGAACCCCGTGGAGGAGCTCACTGTGGATTCTCCTCCTGTTCAGCAAATCTCCCAGCTGC 

TGTCCTTGCTGCATCAGGGCCAATTCCAGCCCAAACCAAACCACCGAGGAAATAAGTACTTG 

GCCAAGCCAGGAGGCAGCAGGAGTGCAATCCCAGACACAGATGGCCCAAGTGCAAGGGCTGG 
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FIGURE 1 7QB 

AGGCCAGACAGACCCAGAACAGGAGGAAGGGCCTTTGGATCCTGAAGAGGACCTCTCTGTGA 
AGCAACTGCTAGAAGAAGAGCTGTCAAGTCTGCTGGACCCCAGCACAGGTCTGGCCCTGGAC 
CGGCTGAGCGCCCCTGACCCGGCCTGGATGGCGAGACTCTCTTTGCCCCTCACCACCAACTA 
CCGTGACAATGTGATCTCCCCGGATGCTGCAGCCACGGAGGAGCCGAGGACCTTCCAGACGT 
TCGGCAAGGCAGAGGCACCAGAGCTGAGCCCAACAGGCACGAGGCTGGCCAGCACCTTTGTC 
TCGGAGATGAGCTCACTGCTGGAGATGCTGCTGGAACAGCGCTCCAGCATGCCCGTGGAGGC 
CGCCTCCGAGGCGCTGCGGCGGCTCTCGGTCTGCGGGAGGACCCTCAGTTTAGACTTGGCCA 
CCAGTGCAGCCTCAGGCATGAAAGTGCAAGGGGACCCAGGTGGAAAGACGGGGACTGAGGGC 
AAGAGCAGAGGCAGCAGCAGCAGCAGCAGGTGCCTGIS&ACATACCTCAGACGCCTCTGGAT 
CCAAGAACCAGGGGCCTGAGGATCTGTGGACAAGAGCTGGTTTCTAAAATCTTGTAACTCAC 
TAGCTAGCGGCGGCCTGAGAACTTTAGGGTGACTGATGCTACCCCCACAGAGGAGGCAAGAG 
CCCCAGGACTAACAGCTGACTGACCAAAGCAGCCCCTTGTAAGCAGCTCTGAGTCTTTTGGA 
GGACAGGGACGGTTTGTGGCTGAGATAAGTGTTTCCTGGCAAAACATATGTGGAGCACAAAG 
GGTCAGTCCTCTGGCAGAACAGATGCCACGGAGTATCACAGGCAGGAAAGGGTGGCCTTCTT 
GGGTAGCAGGAGTCAGGGGGCTGTACCCTGGGGGTGCCAGGAAATGCTCTCTGACCTATCAA 
TAAAGGAAAAGCAGTAAAAAAAAAAAAAAAAAAAA 
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FIGURE 171 



</usr/seqcib2/sst/DNA/Dnaseqs .min/ss.DNA48331 
<subunit 1 of 1, 1184 aa, 1 stop 
<MW: 129022, pi: 5.20, NX(S/T): 5 

MMQLLQLLLGLLGPGGYLFLLGDCQEVTTLTVKYQVSEEVPSGTVIGKLSQELGREERRRQA 

GAAFQVLQLPQALPIQVDSEEGLLSTGRRLDREQLCRQWDPCLVSFDVLATGDLALIHVEIQ 

VLDINDHQPRFPKGEQELEISESASLRTRIPLDRALDPDTGPNTLHTYTLSPSEHFALDVIV 

G P DE TKHAE L I WKE LDRE I HS FFDLVLTAYDNGNPPKS GTS LVKVNVLDSNDNS PAFAES S 

LALEIQEDAAPGTLLIKLTATDPDQGPNGEVEFFLSKHMPPEVLDTFSIDAKTGQVILRRPL 

DYEKNPAYEVDVQARDLGPNPI PAHCKVLIKVLDVNDNI PS IHVTWASQPSLVSEALPKDSF 

lALVMADDLDSGHNGLVHCWLSQELGHFRLKRTNGNTYMLLTNATLDREQWPKYTLTLLAQD 

QGLQ PLSAKKQLS I Q I S D INDNAPVFEKSRYEVSTRENNLPSLHL IT IKAHDADLGINGKVS 

YR I QDS P VAHLVA I DSNTGEVT AQRS LNYEEMAG FE FQVIAEDSGQPMLAS SVSVWVSLLDA 

NDNAPEWQPVLSDGKASLSVLVNASTGHLLVPIETPNGLGPAGTDTPPLATHSSRPFLLTT 

IVARDADSGANGEPLYSIRNGNEAHLFILNPHTGQLFVNVTNASSLIGSEWELEIWEDQGS 

PPLQTRALLRVMFVTSVDHLRDSARKPGALSMSMLTVICLAVLLGIFGLILALFMSICRTEK 

KDNRAYNCREAESTYRQQPKRPQKHIQKADIHLVPVLRGQAGEPCEVGQSHKDVDKEAMMEA 

GWDPCLQAPFHLTPTLYRTLRNQGNQGAPAESREVLQDTVNLLFNHPRQRNASRENLNLPEP 

QPATGQPRSRPLKVAGSPTGRLAGDQGSEEAPQRPPASSATLRRQRHLNGKVSPEKESGPRQ 

ILRSLVRLSVAAFAERNPVEELTVDSPPVQQISQLLSLLHQGQFQPKPNHRGNKYLAKPGGS 

RSAIPDTDGPSARAGGQTDPEQEEGPLDPEEDLSVKQLLEEELSSLLDPSTGLALDRLSAPD 

PAWMARLS LPLTTNYRDNVI S PDAAATEEPRTFQTFGKAEAPELS PTGTRLiAST FVSEMSSL 

LEMLLEQRSSMPVEAASEALRRLSVCGRTLSLDLATSAASGMKVQGDPGGKTGTEGKSRGSS 

SSSRCL 

Inportant features: 
Signal peptide: 
amino acids 1-13 

Transmembrane domain: 
amino acids 719-739 

N-glycosylation site. 

amino acids 415-418, 582-585, 659-662, 662-665 amd 857-860 

Cadherins extracellular repeated domain signature. 

amino acids 123-133, 232-242, 340-350, 448-458 and 553-563 
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FIGURE 172 

CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGG 
CAGACCGTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAG 
GCAGGAGCCTTCCTTACACTTCGCCAISAGTTTCCTCATCGACTCCAGCATCATGATTACCT 
CCCAGATACTATTTTTTGGATTTGGGTGGCTTTTCTTCATGCGCCAATTGTTTAAAGACTAT 
GAGATACGTCAGTATGTTGTACAGGTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCAT 
GTTTGAGCTCATCATCTTTGAAATCTTAGGAGTATTGAATAGCAGCTCCCGTTATTTTCACT 
GGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGGTTTTCATGGTGCCTTTTTACATTGGC 
TATTTTATTGTGAGCAATATCCGACTACTGCATAAACAACGACTGCTTTTTTCCTGTCTCTT 
ATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCCCATTCTCAGCCCAA 
AACATGGGATCTTATCCATAGAACAGCTCATCAGCCGGGTTGGTGTGATTGGAGTGACTCTC 
ATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTCCT 
CAGGAATGTGACTGACACGGATATTCTAGCCCTGGAACGGCGACTGCTGCAAACCATGGATA 
TGATCATAAGCAAAAAGAAAAGGATGGCAATGGCACGGAGAACAATGTTCCAGAAGGGGGAA 
GTGCATAACAAACCATCAGGTTTCTGGGGAATGATAAAAAGTGTTACCACTTCAGCATCAGG 
AAGTGAAAATCTTACTCTTATTCAACAGGAAGTGGATGCTTTGGAAGAATTAAGCAGGCAGC 
TTTTTCTGGAAACAGCTGATCTATATGCTACCAAGGAGAGAATAGAATACTCCAAAACCTTC 
AAGGGGAAATATTTTAATTTTCTTGGTTACTTTTTCTCTATTTACTGTGTTTGGAAAATTTT 
CATGGCTACCATCAATATTGTTTTTGATCGAGTTGGGAAAACGGATCCTGTCACAAGAGGCA 
TTGAGATCACTGTGAATTATCTGGGAATCCAATTTGATGTGAAGTTTTGGTCCCAACACATT 
TCCTTCATTCTTGTTGGAATAATCATCGTCACATCCATCAGAGGATTGCTGATCACTCTTAC 
CAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTGCTATTAGCAC 
AGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTTAGAA 
TACCGCACCATAATCACTGAAGTCCTTGGAGAACTGCAGTTCAACTTCTATCACCGTTGGTT 
TGATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAAC 
AGGCACCAGAGAAGCAAATGGCACCTlGakACTTAAGCCTACTACAGACTGTTAGAGGCCAGT 
GGTTTCAAAATTTAGATATAAGAGGGGGGAAAAATGGAACCAGGGCCTGACATTTTATAAAC 
AAACAAAATGCTATGGTAGCATTTTTCACCTTCATAGCATACTCCTTCCCCGTCAGGTGATA 
CTATGACCATGAGTAGCATCAGCCAGAACATGAGAGGGAGAACTAACTCAAGACAATACTCA 
GCAGAGAGCATCCCGTGTGGATATGAGGCTGGTGTAGAGGCGGAGAGGAGCCAAGAAACTAA 
AGGTGAAAAATACACTGGAACTCTGGGGCAAGACATGTCTATGGTAGCTGAGCCAAACACGT 
AGGATTTCCGTTTTAAGGTTCACATGGAAAAGGTTATAGCTTTGCCTTGAGATTGACTCATT 
AAAATCAGAGACTGTAACAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGCGACTCTAGAGTCG 
ACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATG 
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FIGURE 173 



MSFLIDSSIMITSQILFFG FGWL FFMRQL FKD YE IRQYWQVI FS VT FAFSCTMFELI I FE I 
LGVLNSSSRYFHWKMNLCVILLILVFMVPFYIGYFIVSNIRLLHKQRLLFSCLLWLTFMYFF 
WKLGDP FP I LS PKHG I LS I EQLI SRVGVIGVTLMALLSGFGAVNCPYTYMS YFLRNVTDTDI 
LALERRLLQTMDMI I SKKKRMAMARRTMFQKGEVHNKPSGFWGMIKSVTTSASGSENLTLIQ 
QEVDALEELSRQLFLETADLYATKERIEYSKTFKGKYFNFLGYFFSIYCVWKIEMATINIVF 
DRVGKTDPVTRGIEITVNYLGIQFDVKFWSQHISFILVGIIIVTSIRGLLITLTKFFYAISS 
S KS SNV I VLLLAQ IMGM Y FVS SVLL I RMSMPLE YRT 1 1 TEVLGELQFNFYHRWFDVI FLVSA 
LS S I L FL YLAHKQAPEKQMAP 

Important features: 
Signal peptide: 
amino acids 1-23 

Potential transmembrane domains: 

amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 
425-444 

N-glycosylation sites. 

amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobalamin-binding proteins 
amino acids 151-160 
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FIGURE 174 

CATGGGAAGTGGAGCCGGAGCCTTCCTTACACTCGCCATGAGTTTCCTCATCGACTCCAGCA 
TCATGATTACCTCCCNGANACTATTTTTTGGATTTGGGTGGCTTTTCTTCNGCGCCAATGTT 
TAAAGACTATGAGATACGTCAGTATGTTGTACNGGTGATCTTCTCCGTGACGTTTGCCATTT 
CTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTNGGAGTATTGAATAGCAGCTCCCGT 
TATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGGTTNTCATGGTGCCTTT 
TTACATTGGCTATTTTATTGTGAGCAATATCCGACTACTGCATAAACAACGACTGCTTTTTT 
CCTGTCTCTTATGGCTGACCTTTATGTATTTCCAG 
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GTGTTGCCCTTGGGGAGGGGAAGGGGAGCCNGGCCCTTTCCTAAAATTTGGCCAAGGGTTTC 
TTTNTTGAATTCCGGGTTNNGNATACCTTCCCAGAAAATATTTTTTGGATTTGGGGTAGNTT 
TTTTTCATGCGCCAATTGTTTAAAGACTATGAGATACGTCAGTATGTTGTACAGGTGATNTT 
NTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATNTTTGAAATNTTAGGAG 
TATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATC 
CTGGTTTTCATGGTGCCTTTTTACATTGGCTATTTTATTGTGAGCAATATCCGACTACTGCA 
TAAACAACGACTGCTTTTTTCCTGTCTNTTATGGCTGACCTTTATGTATTTNTTNTGGAAAN 
TAGGAGATCCCTTTCCCATTCTC 
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FIGURE 176A 

CTCGCGCAGGGATCGTCCCM5GCCGGGGCTCGGAGCCGCGACCCTTGGGGGGCCTCCGGGA 
TTTGCTACCTTTTTGGCTCCCTGCTCGTCGAACTGCTCTTCTCACGGGCTGTCGCCTTCAAT 
CTGGACGTGATGGGTGCCTTGCGCAAGGAGGGCGAGCCAGGCAGCCTCTTCGGCTTCTCTGT 
GGCCCTGCACCGGCAGTTGCAGCCCCGACCCCAGAGCTGGCTGCTGGTGGGTGCTCCCCAGG 
CCCTGGCTCTTCCTGGGCAGCAGGCGAATCGCACTGGAGGCCTCTTCGCTTGCCCGTTGAGC 
CTGGAGGAGACTGACTGCTACAGAGTGGACATCGACCAGGGAGCTGATATGCAAAAGGAAAG 
CAAGGAGAACCAGTGGTTGGGAGTCAGTGTTCGGAGCCAGGGGCCTGGGGGCAAGATTGTTA 
CCTGTGCACACCGATATGAGGCAAGGCAGCGAGTGGACCAGATCCTGGAGACGCGGGATATG 
ATTGGTCGCTGCTTTGTGCTCAGCCAGGACCTGGCCATCCGGGATGAGTTGGATGGTGGGGA 
ATGGAAGTTCTGTGAGGGACGCCCCCAAGGCCATGAACAZ^TTTGGGTTCTGCCAGCAGGGCA 
CAGCTGCCGCCTTCTCCCCTGATAGCCACTACCTCCTCTTTGGGGCCCCAGGAACCTATAAT 
TGGAAGGGCACGGCCAGGGTGGAGCTCTGTGCACAGGGCTCAGCGGACCTGGCACACCTGGA 
CGACGGTCCCTACGAGGCGGGGGGAGAGAAGGAGCAGGACCCCCGCCTCATCCCGGTCCCTG 
CCAACAGCTACTTTGGCTTCTCTATTGACTCGGGGAAAGGTCTGGTGCGTGCAGAAGAGCTG 
AGCTTTGTGGCTGGAGCCCCCCGCGCCAACCACAAGGGTGCTGTGGTCATCCTGCGCAAGGA 
CAGCGCCAGTCGCCTGGTGCCCGAGGTTATGCTGTCTGGGGAGCGCCTGACCTCCGGCTTTG 
GCTACTCACTGGCTGTGGCTGACCTCAACAGTGATGGCTGGCCAGACCTGATAGTGGGTGCC 
CCCTACTTCTTTGAGCGCCAAGAAGAGCTGGGGGGTGCTGTGTATGTGTACTTGAACCAGGG 
GGGTCACTGGGCTGGGATCTCCCCTCTCCGGCTCTGCGGCTCCCCTGACTCCATGTTCGGGA 
TCAGCCTGGCTGTCCTGGGGGACCTCAACCAAGATGGCTTTCCAGATATTGCAGTGGGTGCC 
CCCTTTGATGGTGATGGGAAAGTCTTCATCTACCATGGGAGCAGCCTGGGGGTTGTCGCCAA 
ACCTTCACAGGTGCTGGAGGGCGAGGCTGTGGGCATCAAGAGCTTCGGCTACTCCCTGTCAG 
GCAGCTTGGATATGGATGGGAACCAATACCCTGACCTGCTGGTGGGCTCCCTGGCTGACACC 
GCAGTGCTCTTCAGGGCCAGACCCATCCTCCATGTCTCCCATGAGGTCTCTATTGCTCCACG 
AAGCATCGACCTGGAGCAGCCCAACTGTGCTGGCGGCCACTCGGTCTGTGTGGACCTAAGGG 
TCTGTTTCAGCTACATTGCAGTCCCCAGCAGCTATAGCCCTACTGTGGCCCTGGACTATGTG 
TTAGATGCGGACACAGACCGGAGGCTCCGGGGCCAGGTTCCCCGTGTGACGTTCCTGAGCCG 
TAACCTGGAAGAACCCAAGCACCAGGCCTCGGGCACCGTGTGGCTGAAGCACCAGCATGACC 
GAGTCTGTGGAGACGCCATGTTCCAGCTCCAGGAAAATGTCAAAGACAAGCTTCGGGCCATT 
GTAGTGACCTTGTCCTACAGTCTCCAGACCCCTCGGCTCCGGCGACAGGCTCCTGGCCAGGG 
GCTGCCTCCAGTGGCCCCCATCCTCAATGCCCACCAGCCCAGCACCCAGCGGGCAGAGATCC 
ACTTCCTGAAGCAAGGCTGTGGTGAAGACAAGATCTGCCAGAGCAATCTGCAGCTGGTCCAC 
GCCCGCTTCTGTACCCGGGTCAGCGACACGGAATTCCAACCTCTGCCCATGGATGTGGATGG 
AACAACAGCCCTGTTTGCACTGAGTGGGCAGCCAGTCATTGGCCTGGAGCTGATGGTCACCA 
ACCTGCCATCGGACCCAGCCCAGCCCCAGGCTGATGGGGATGATGCCCATGAAGCCCAGCTC 
CTGGTCATGCTTCCTGACTCACTGCACTACTCAGGGGTCCGGGCCCTGGACCCTGCGGAGAA 
GCCACTCTGCCTGTCCAATGAGAATGCCTCCCATGTTGAGTGTGAGCTGGGGAACCCCATGA 
AGAGAGGTGCCCAGGTCACCTTCTACCTCATCCTTAGCACCTCCGGGATCAGCATTGAGACC 
ACGGAACTGGAGGTAGAGCTGCTGTTGGCCACGATCAGTGAGCAGGAGCTGCATCCAGTCTC 
TGCACGAGCCCGTGTCTTCATTGAGCTGCCACTGTCCATTGCAGGAATGGCCATTCCCCAGC 
AACTCTTCTTCTCTGGTGTGGTGAGGGGCGAGAGAGCCATGCAGTCTGAGCGGGATGTGGGC 
AGCAAGGTCAAGTATGAGGTCACGGTTTCCAACCAAGGCCAGTCGCTCAGAACCCTGGGCTC 
TGCCTTCCTCAACATCATGTGGCCTCATGAGATTGCCAATGGGAAGTGGTTGCTGTACCCAA 
TGCAGGTTGAGCTGGAGGGCGGGCAGGGGCCTGGGCAGAAAGGGCTTTGCTCTCCCAGGCCC 
AACATCCTCCACCTGGATGTGGACAGTAGGGATAGGAGGCGGCGGGAGCTGGAGCCACCTGA 
GCAGCAGGAGCCTGGTGAGCGGCAGGAGCCCAGCATGTCCTGGTGGCCAGTGTCCTCTGCTG 
AGAAGAAGAAAAACATCACCCTGGACTGCGCCCGGGGCACGGCCAACTGTGTGGTGTTCAGC 
TGCCCACTCTACAGCTTTGACCGCGCGGCTGTGCTGCATGTCTGGGGCCGTCTCTGGAACAG 
CACCTTTCTGGAGGAGTACTCAGCTGTGAAGTCCCTGGAAGTGATTGTCCGGGCCAACATCA 
CAGTGAAGTCCTCCATAAAGAACTTGATGCTCCGAGATGCCTCCACAGTGATCCCAGTGATG 
GTATACTTGGACCCCATGGCTGTGGTGGCAGAAGGAGTGCCCTGGTGGGTCATCCTCCTGGC 
TGTACTGGCTGGGCTGCTGGTGCTAGCACTGCTGGTGCTGCTCCTGTGGAAGATGGGATTCT 
TCAAACGGGCGAAGCACCCCGAGGCCACCGTGCCCCAGTACCATGCGGTGAAGATTCCTCGG 
GAAGACCGACAGCAGTTCAAGGAGGAGAAGACGGGCACCATCCTGAGGAACAACTGGGGCAG 
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FinURE 176B 



CCCCCGGCGGGAGGGCCCGGATGCACACCCCATCCTGGCTGCTGACGGGCATCCCGAGCTGG 
GCCCCGATGGGCATCCAGGGCCAGGCACCGCCIi^GTTCCCATGTCCCAGCCTGGCCTGTGG 
CTGCCCTCCATCCCTTCCCCAGAGATGGCTCCTTGGGATGAAGAGGGTAGAGTGGGCTGCTG 
GTGTCGCATCAAGATTTGGCAGGATCGGCTTCCTCAGGGGCACAGACCTCTCCCACCCACAA 
GAACTCCTCCCACCCAACTTCCCCTTAGAGTGCTGTGAGATGAGAGTGGGTAAATCAGGGAC 
AGGGCCATGGGGTAGGGTGAGAAGGGCAGGGGTGTCCTGATGCAAAGGTGGGGAGAAGGGAT 
CCTAATCCCTTCCTCTCCCATTCACCCTGTGTAACAGGACCCCAAGGACCTGCCTCCCCGGA 
AGTGCCTTAACCTAGAGGGTCGGGGAGGAGGTTGTGTCACTGACTCAGGCTGCTCCTTCTCT 
AGTTTCCCCTCTCATCTGACCTTAGTTTGCTGCCATCAGTCTAGTGGTTTCGTGGTTTCGTC 
TATTTATTAAAAAATATTTGAGAACAAAAAAAAAAAAAAAAAAAA 
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FIGURE 177 

></usr/seqcib2/sst/DNA/Dnaseqs.min/ss.DNA55737 
xsubunit 1 of 1, 1141 aa, 1 stop 
XMW: 124671, pi: 5.82, NX(S/T): 5 

MAGARSRDPWGASGICYLFGSLLVELLFSRAVAFNLDVMGALRKEGEPGSLFGFSVALHRQL 
QPRPQSWLLVGAPQALALPGQQANRTGGLFACPLSLEETDCYRVDIDQGAEWQKESKENQWL 
GVSVRSQGPGGKIVTCAHRYEARQRVDQILETRDMIGRCFVLSQDLAIRDELDGGEWKFCEG 
RPQGHEQFGFCQQGTAAAFSPDSHYLLFGAPGTYNWKGTARVELCAQGSADLAHLDDGPYEA 
GGEKEQDPRL I PVPANS YFGFS IDSGKGLVRAEELS FVAGAPRANHKGAWILRKDSASRLV 
PEVMLSGERLTSGFGYSLAVADLNSDGWPDLIVGAPYFFERQEELGGAVYVYLNQGGHWAGI 
S PLRLCGS PDSMFG I SLAVLGDLNQDGFPDIAVGAPFDGDGKVFI YHGSSLGWAKPSQVLE 
GEAVGIKSFGYSLSGSLDMDGNQYPDLLVGSLADTAVLFRARPILHVSHEVSIAPRSIDLEQ 
PNCAGGHSVCVDLRVCFSYIAVPSSYSPTVALDYVLDADTDRRLRGQVPRVTFLSRNLEEPK 
HQASGTVWLKHQHDRVCGDAMFQLQENVKDKLRAIWTLSYSLQTPRLRRQAPGQGLPPVAP 
ILNAHQPSTQRAEIHFLKQGCGEDKICQSNLQLVHARFCTRVSDTEFQPLPMDVDGTTALFA 
LSGQPVIGLELMVTNLPSDPAQPQADGDDAHEAQLLVMLPDSLHYSGVRALDPAEKPLCLSN 
ENASHVECELGNPMKRGAQVTFYLILSTSGISIETTELEVELLLATISEQELHPVSARARVF 
lELPLSIAGMAIPQQLFFSGWRGERAMQSERDVGSKVKYEVTVSNQGQSLRTLGSAFLNIM 
WPHEIANGKWLLYPMQVELEGGQGPGQKGLCSPRPNILHLDVDSRDRRRRELEPPEQQEPGE 
RQEPSMSWWPVSSAEKKKNITLDCARGTANCWFSCPLYSFDRAAVLHVWGRLWNSTFLEEY 
SAVKS LEVI VRANI TVKS S I KNLMLRDASTVI PVMVYLDPMAWAEGVPWWVI LLAVLAGLL 
VLALLVLLLWKMGFFKRAKHPEATVPQYHAVKIPREDRQQFKEEKTGTILRNNWGSPRREGP 
DAHPILAADGHPELGPDGHPGPGTA 



In^rtant features: 
Signal peptide: 

amino acids 1-33 



Transmembrane domain: 
amino acids 1039-1064 



N-glycosylation sites. 

amino acids 86-89, 746-749, 949-952, 985-988 and 1005-1008 



Integrins alpha chain proteins. 

amino acids 1064-1071, 384-408, 1041-1071, 317-346, 443-465, 385- 
407, 215-224, 634-647, 85-99, 322-346, 470-479, 442-466, 379-408 
and 1031-1047 
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FIflURE 178 



CGCGCCGGGCGCAGGGAGCTGAGTGGACGGCTCGAGACGGCGGCGCGTGCAGCAGCTCCAGA 

AAGCAGCGAGTTGGCAGAGCAGGGCTGCATTTCCAGCAGGAGCTGCGAGCACAGTGCTGGCT 

CACAACAAGMfiCTCAAGGTGTCAGCCGTACTGTGTGTGTGTGCAGCCGCTTGGTGCAGTCA 

GTCTCTCGCAGCTGCCGCGGCGGTGGCTGCAGCCGGGGGGCGGTCGGACGGCGGTAATTTTC 

TGGATGATAAACAATGGCTCACCACAATCTCTCAGTATGACAAGGAAGTCGGACAGTGGAAC 

AAAT TC CGAGACGAAGTAGAGGATGATTATTTCCGCACTTGGAGTCCAGGAAAACCCTTCGA 

TCAGGCTTTAGATCCAGCTAAGGATCCATGCTTAAAGATGAAATGTAGTCGCCATAAAGTAT 

GCATTGCTCAAGATTCTCAGACTGCAGTCTGCATTAGTCACCGGAGGCTTACACACAGGATG 

AAAGAAGCAGGAGTAGACCATAGGCAGTGGAGGGGTCCCATATTATCCACCTGCAAGCAGTG 

CCCAGTGGTCTATCCCAGCCCTGTTTGTGGTTCAGATGGTCATACCTACTCTTTTCAGTGCA 

AACTAGAATATCAGGCATGTGTCTTAGGAAAACAGATCTCAGTCAAATGTGAAGGACATTGC 

CCATGTCCTTCAGATAAGCCCACCAGTACAAGCAGAAATGTTAAGAGAGCATGCAGTGACCT 

GGAGTTCAGGGAAGTGGCAAACAGATTGCGGGACTGGTTCAAGGCCCTTCATGAAAGTGGAA 

GTCAAAACAAGAAGACAAAAACATTGCTGAGGCCTGAGAGAAGCAGATTCGATACCAGCATC 

TTGCCAATTTGCAAGGACTCACTTGGCTGGATGTTTAACAGACTTGATACAAACTATGACCT 

GCTATTGGACCAGTCAGAGCTCAGAAGCATTTACCTTGATAAGAATGAACAGTGTACCAAGG 

CATTCTTCAATTCTTGTGACACATACAAGGACAGTTTAATATCTAATAATGAGTGGTGCTAC 

TGCTTCCAGAGACAGCAAGACCCACCTTGCCAGACTGAGCTCAGCAATATTCAGAAGCGGCA 

AGGGGTAAAGAAGCTCCTAGGACAGTATATCCCCCTGTGTGATGAAGATGGTTACTACAAGC 

CAACACAATGTCATGGCAGTGTTGGACAGTGCTGGTGTGTTGACAGATATGGAAATGAAGTC 

ATGGGATCCAGAATAAATGGTGTTGCAGATTGTGCTATAGATTTTGAGATCTCCGGAGATTT 

TGCTAGTGGCGATTTTCATGAATGGACTGATGATGAGGATGATGAAGACGATATTATGAATG 

ATGAAGATGAAATTGAAGATGATGATGAAGATGAAGGGGATGATGATGATGGTGGTGATGAC 

CATGATGTATACATTTSATTGATGACAGTTGAAATCAATAAATTCTACATTTCTAATATTTA 

CAAAAATGATAGCCTATTTAAAATTATCTTCTTCCCCAATAACAAAATGATTCTAAACCTCA 

CATATATTTTGTATAATTATTTGAAAAATTGCAGCTAAAGTTATAGAACTTTATGTTTAAAT 

AAGAATCATTTGCTTTGAGTTTTTATATTCCTTACACAAAAAGAAAATACATATGCAGTCTA 

GTCAGACAAAATAAAGTTTTGAAGTGCTACTATAATAAATTTTTCACGAGAACAAACTTTGT 

AAATCTTCCATAAGCAAAATGACAGCTAGTGCTTGGGATCGTACATGTTAATTTTTTGAAAG 

ATAATTCTAAGTGAAATTTAAAATAAATAAATTTTTAATGACCTGGGTCTTAAGGATTTAGG 

AAAAATATGCATGCTTTAATTGCATTTCCAAAGTAGCATCTTGCTAGACCTAGATGAGTCAG 

GATAACAGAGAGATACCACATGACTCCAAAAAAAAAAAAAAA 
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FIGURE 179 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA49829 
xsubunit 1 of 1, 436 aa, 1 stop 
><MW: 49429, pi: 4.80, NX(S/T): 0 

MLKVSAVLCVCAAAWCSQSLAAAAAVAAAGGRSDGGNFLDDKQWLTTISQYDKEVGQWNKFR 
DEVEDDYFRTWS PGKP FDQALDPAKDPCLKMKCSRHKVCI AQDSQTAVC I S HRRLTHRJMKEA 
GVDHRQWRGPILSTCKQCPWYPSPVCGSDGHTYSFQCKLEYQACVLGKQISVKCEGHCPCP 
S DKPTS TS RNVKRACSDLE FREVANRLRDWFKALHESGSQNKKTKTLLRPERS RFDTS ILP I 
CKDSLGWMFNRLDTNYDLLLDQSELRSIYLDKNEQCTKAFFNSCDTYKDSLISNNEWCYCFQ 
RQQDPPCQTELSNIQKRQGVKKLLGQYIPLCDEDGYYKPTQCHGSVGQCWCVDRYGNEVMGS 
RINGVADCAI DFE I SGDFASGDFHEWTDDEDDEDD IMNDEDE lEDDDEDEGDDDDGGDDHDVY I 

In^sortant features: 
Signal peptide: 

amino acids 1-16 

Leucine zipper pattern, 
amino acids 246-267 

N-myristoylation sites. 

amino acids 357-362, 371-376 and 376-381 

Thyroglobulin type-1 repeat proteins 
amino acids 353-365 and 339-352 
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FinURE 180A 

CAGACTCCAGATTTCCCTGTCAACCACGAGGAGTCCAGAGAGGAAACGCGGAGCGGAGACAA 

CAGTACCTGACGCCTCTTTCAGCCCGGGATCGCCCCAGCAGGGMSGGCGACAAGATCTGGC 

TGCCCTTCCCCGTGCTCCTTCTGGCCGCTCTGCCTCCGGTGCTGCTGCCTGGGGCGGCCGGC 

TTCACACCTTCCCTCGATAGCGACTTCACCTTTACCCTTCCCGCCGGCCAGAAGGAGTGCTT 

CTACCAGCCCATGCCCCTGAAGGCCTCGCTGGAGATCGAGTACCAAGTTTTAGATGGAGCAG 

GATTAGATATTGATTTCCATCTTGCCTCTCCAGAAGGCAAAACCTTAGTTTTTGAACAAAGA 

AAATCAGATGGAGTTCACACTGTAGAGACTGAAGTTGGTGATTACATGTTCTGCTTTGACAA 

TACATTCAGCACCATTTCTGAGAAGGTGATTTTCTTTGAATTAATCCTGGATAATATGGGAG 

AACAGGCACAAGAACAAGAAGATTGGAAGAAATATATTACTGGCACAGATATATTGGATATG 

AAACTGGAAGACATCCTGGAATCCATCAACAGCATCAAGTCCAGACTAAGCAAAAGTGGGCA 

CATACAAATTCTGCTTAGAGCATTTGAAGCTCGTGATCGAAACATACAAGAAAGCAACTTTG 

ATAGAGTCAATTTCTGGTCTATGGTTAATTTAGTGGTCATGGTGGTGGTGTCAGCCATTCAA 

GTTTATATGCTGAAGAGTCTGTTTGAAGATAAGAGGAAAAGTAGAACTiaaAACTCCAAACT 

AGAGTACGTAACATTGAAAAATGAGGCATAAAAATGCAATAAACTGTTACAGTCAAGACCAT 

TAATGGTCTTCTCCAAAATATTTTGAGATATAAAAGTAGGAAACAGGTATAATTTTAATGTG 

AAAATTAAGTCTTCACTTTCTGTGCAAGTAATCCTGCTGATCCAGTTGTACTTAAGTGTGTA 

ACAGGAATATTTTGCAGAATATAGGTTTAACTGAATGAAGCCATATTAATAACTGCATTTTC 

CTAACTTTGAAAAATTTTGCAAATGTCTTAGGTGATTTAAATAAATGAGTATTGGGCCTAAT 

TGCAACACCAGTCTGTTTTTAACAGGTTCTATTACCCAGAACTTTTTTGTAAATGCGGCAGT 

TACAAATTAACTGTGGAAGTTTTCAGTTTTAAGTTATAAATCACCTGAGAATTACCTAATGA 

TGGATTGAATAAATCTTTAGACTACAAAAGCCCAACTTTTCTCTATTTACATATGCATCTCT 

CCTATAATGTAAATAGAATAATAGCTTTGAAATACAATTAGGTTTTTGAGATTTTTATAACC 

AAATACATTTCAGTGTAACATATTAGCAGAAAGCATTAGTCTTTGTACTTTGCTTACATTCC 

CAAAAGCTGACATTTTCACGATTCTTAAAAACACAAAGTTACACTTACTAAAATTAGGACAT 

GTTTTCTCTTTGAAATGAAGAATATAGTTTAAAAGCTTCCTCCTCCATAGGGACACATTTTC 

TCTAACCCTTAACTAAAGTGTAGGATTTTAAAATTAAATGTGAGGTAAAATAAGTTTATTTT 

TAATAGTATCTGTCAAGTTAATATCTGTCAACAGTTAATAATCATGTTATGTTAATTTTAAC 

ATGATTGCTGACTTGGATAATTCATTATTACCAGCAGTTATGAAGGAAATATTGCTAAAATG 

ATCTGGGCCTACCATAAATAAATATCTCCTTTTCTGAGCTCTAAGAATTATCAGAAAACAGG 

AAAGAATTTAGAAAAACTTGAGAAAACCTAATCCAAAATAAAATTCACTTAAGTAGAACTAT 

AAATAAATATCTAGAATCTGACTGGCTCATCATGACATCCTACTCATAACATAAATCAAAGG 

AGATGATTAATTTCCAGTTAGCTGGAAGAAACTTTGGCTGTAGGTTTTTATTTTCTACAAGA 

ATTCTGGTTTGAATTATTTTTGTAAGCAGGTACATTTTATAAAATGTAAGCCCTACTGTAAG 

GTTTAGCACTGGGTGTACATATTTATTAAAAATTTTTATTATAACAACTTTTATTAAAATGG 

CCTTTCTGAACACTTTATTTATTGATGTTGAAGTAAGGATTAGAAACATAGACTCCCAAGTT 

TTAAACACCTAAATGTGAATAACCCATATATACAACAAAGTTTCTGCCATCTAGCTTTTTGA 

AGTCTATGGGGGTCTTACTCAAGTACTAGTAATTTAACTTCATCATGAATGAACTATAATTT 

TTAAGTTATGCCCATTTATAACGTTGTTTATGACTACATTGTGAGTTAGAAACAAACTTAAA 

ATTTGGGGTATAGAACCCCTCAACAGGTTAGTAATGCTGGAATTCTTGATGAGCAATAATGA 

TAACCAGAGAGTGATTTCATTTACACTCATAGTAGTATAAAAAGAGATACATTTCCCTCTTA 

GGCCCCTGGGAGAAGAGCAGCTTAGATTTCCCTACTGGCAAGGTTTTTAAAAATGAGGTAAA 

TGCCGTATATGATCAATTACCTTAATTGGCCAAGAAAATGCTTCAGGTGTCTAGGGGTATCC 

TCTGCAACACTTGCAGAACAAAGGTCAATAAGATCCTTGCCTATGAATACCCCTCCCTTTTG 

CGCTGTTAAATTTGCAATGAGAAGCAAATTTACAGTACCATAACTAATAAAGCAGGGTACAG 

ATATAAACTACTGCATCTTTTCTATAAAACTGTGATTAAGAATTCTACCTCTCCTGTATGGC 

TGTTACTGTACTGTACTCTCTGACTCCTTACCTAACAATGAATTTGTTACATAATCTTCTAC 

ATGTATGATTTGTGCCACTGATCTTAAACCTATGATTCAGTAACTTCTTACCATATAAAAAC 

GATAATTGCTTTATTTGGAAAAGAATTTAGGAATACTAAGGACAATTATTTTTATAGACAAA 

GTAAAAAGACAGATATTTAAGAGGCATAACCAAAAAAGCAAAACTTGTAAACAGAGTAAAAA 

TCTTTAATATTTCTAAAGACATACTGTTTATCTGCTTCATATGCTTTTTTTAATTTCACTAT 

TCCATTTCTAAATTAAAGTTATGCTAAATTGAGTAAGCTGTTTATCACTTAACAGCTCATTT 

TGTCTTTTTCAATATACAAATTTTAAAAATACTACAATATTTAACTAAGGCCCAACCGATTT 

CCATAATGTAGCAGTTACCGTGTTCACCTCACACTAAGGCCTAGAGTTTGCTCTGATATGCA 

TTTGGATGATTAATGTTATGCTGTTCTTTCATGTGAATGTCAAGACATGGAGGGTGTTTGTA 
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FIGURE laOR 

ATTTTATGGTAAAATTAATCCTTCTTACACATAATGGTGTCTTAAAATTGACAAAAAATGAG 

CACTTACAATTGTATGTCTCCTCAAATGAAGATTCTTTATGTGAAATTTTAAAAGACATTGA 

TTCCGCATGTAAGGATTTTTCATCTGAAGTACAATAATGCACAATCAGTGTTGCTCAAACTG 

CTTTATACTTATAAACAGCCATCTTAAATAAGCAACGTATTGTGAGTACTGATATGTATATA 
ATAAAAATTATCAAAGGAAAA 
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FIRURE 181 

></usr/seqdb2/sst/DNA/Dnaseqs.min/ss .DNA52196 
xsubunit 1 of 1, 229 aa, 1 stop 
XMW: 26017, pi: 4.73, NX(S/T): 0 

MGDKIWLPFPVLLLAALPPVLLPGAAGFTPSLDSDFTFTLPAGQKECFYQPMPLKASLEIEY 
QVLDGAGLDIDFHLASPEGKTLVFEQRKSDGVHTVETEVGDYMFCFDNTFSTISEKVIFFEL 
ILDNMGEQAQEQEDWKKYITGTDILDMKLEDILESINSIKSRLSKSGHIQILLRAFEARDRN 
IQESNFDRVNFWSMVNLWMWVSAIQVYMLKSLFEDKRKSRT 

Important features: 
Signal peptide: 
amino acids 1-23 

Transmembrane domain: 
amino acids 195-217 

N-royristoylation site, 
amino acids 4 3-48 

Tyrosine kinase phosphorylation site, 
amino acids 55-62 
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FIGURE 18? 

CCATCCCTGAGATCTTTTTATAAAAAACCCAGTCTTTGCTGACCAGACAAAGCATACCAGAT 
CTCACCAGAGAGTCGCAGACACT^CTGCCTCCCATGGCCCTGCCCAGTGTGTCCTGGATG 
CTGCTTTCCTGCCTCATTCTCCTGTGTCAGGTTCAAGGTGAAGAAACCCAGAAGGAACTGCC 
CTCTCCACGGATCAGCTGTCCCAAAGGCTCCAAGGCCTATGGCTCCCCCTGCTATGCCTTGT 
TTTTGTCACCAAAATCCTGGATGGATGCAGATCTGGCTTGCCAGAAGCGGCCCTCTGGAAAA 
CTGGTGTCTGTGCTCAGTGGGGCTGAGGGATCCTTCGTGTCCTCCCTGGTGAGGAGCATTAG 
TAACAGCTACTCATACATCTGGATTGGGCTCCATGACCCCACACAGGGCTCTGAGCCTGATG 
GAGATGGATGGGAGTGGAGTAGCACTGATGTGATGAATTACTTTGCATGGGAGAAAAATCCC 
TCCACCATCTTAAACCCTGGCCACTGTGGGAGCCTGTCAAGAAGCACAGGATTTCTGAAGTG 
GAAAGATTATAACTGTGATGCAAAGTTACCCTATGTCTGCAAGTTCAAGGACiaSGGCAGGT 
GGGAAGTCAGCAGCCTCAGCTTGGCGTGCAGCTCATCATGGACATGAGACCAGTGTGAAGAC 
TCACCCTGGAAGAGAATATTCTCCCCAAACTGCCCTACCTGACTACCTTGTCATGATCCTCC 
TTCTTTTTCCTTTTTCTTCACCTTCATTTCAGGCTTTTCTCTGTCTTCCATGTCTTGAGATC 
TCAGAGAATAATAATAAAAATGTTACTTTATAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 183 

</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA56965 
<subunit 1 of 1, 175 aa, 1 stop 
<MW: 19330. pi,: 7.25, NX(S/T): 1 

MLPPKALPSVSWMLLSCLILLCQVQGEETQKELPSPRISCPKGSKAYGSPCYALFLSPKSWM 
DADLACQKRPSGKLVSVLSGAEGSrVSSLVRSISNSYSYIWIGLHDPTQGSEPDGDGWEWSS 
TDVMNYFAWEKNPSTILNPGHCGSLSRSTGFLKWKDYNCDAKLPYVCKFKD 

Important features: 
Signal peptide: 
amino acids 1-26 

C-type lectin domain signature, 
amino acids 146-171 
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FIGURE 184 



CCAGTCTGTCGCCACCTCACTTGGTGTCTGCTGTCCCCGCCAGGCAAGCCTGGGGTGAGAGC 
ACAGAGGAGTGGGCCGGGACC^CGGGGGACGCGGCTGGCGCTCCTGGCGCTGGTGCTGGC 
TGCCTGCGGAGAGCIGGCGCCGGCCCTGCGCTGCTACGTCTGTCCGGAGCCCACAGGAGTGT 
CGGACTGTGTCACCATCGCCACCTGCACCACCAACGAAACCATGTGCAAGACCACACTCTAC 
TCCCGGGAGATAGTGTACCCCTTCCAGGGGGACTCCACGGTGACCAAGTCCTGTGCCAGCAA 
GTGTAAGCCCTCGGATGTGGATGGCATCGGCCAGACCCTGCCCGTGTCCTGCTGCAATACTG 
AGCTGTGCAATGTAGACGGGGCGCCCGCTCTGAACAGCCTCCACTGCGGGGCCCTCACGCTC 
CTCCCACTCTTGAGCCTCCGACTGI&SAGTCCCCGCCCACCCCCATGGCCCTATGCGGCCCA 
GCCCCGAATGCCTTGAAGAAGTGCCCCCTGCACCAGGAAAAAAAAAAAAAAAAA 




BNS0OCID <WO _ 9946287 A2.L> 



wo <i9/4628I 




PCT/US99/05028 



FiRtIRE 18B 

</usr/seqdb2/sst/DNA/Dnaseqs.min/ss.DNA56405 
<subunit 1 of 1, 125 aa, 1 stop 
<MW: 13115, pi: 5.90, NX{S/T): 1 

MRGTRLALLALVLAACGELAPALRCYVCPEPTGVSDCVTIATCTTNETMCKTTLYSREIVYP 
FQGDSTVTKSCASKCKPSDVDGIGQTLPVSCCNTELCNVDGAPALNSLHCGALTLLPLLSLRL 

Important features : 
Signal peptide: 
amino acids 1-17 

N-glycosylation site, 
amino acids 46-49 
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FIGURE 18fi 

CTGCAGTCAGGACTCTGGGACCGCAGGGGGCTCCCGGACCCTGACTCTGCAGCCGAACCGGC 
ACGGTTTCGTGGGGACCCAGGCTTGCAAAGTGACGGTCATTTTCTCTTTCTTTCTCCCTCTT 
GAGTCCTTCTGAGMfiATGGCTCTGGGCGCAGCGGGAGCTACCCGGGTCTTTGTCGCGATGG 
TAGCGGCGGCTCTCGGCGGCCACCCTCTGCTGGGAGTGAGCGCCACCTTGAACTCGGTTCTC 
AATTCCAACGCTATCAAGAACCTGCCCCCACCGCTGGGCGGCGCTGCGGGGCACCCAGGCTC 
TGCAGTCAGCGCCGCGCCGGGAATCCTGTACCCGGGCGGGAATAAGTACCAGACCATTGACA 
ACTACCAGCCGTACCCGTGCGCAGAGGACGAGGAGTGCGGCACTGATGAGTACTGCGCTAGT 
CCCACCCGCGGAGGGGACGCAGGCGTGCAAATCTGTCTCGCCTGCAGGAAGCGCCGAAAACG 
CTGCATGCGTCACGCTATGTGCTGCCCCGGGAATTACTGCAAAAATGGAA.TATGTGTGTCTT 
CTGATCAAAATCATTTCCGAGGAGAAATTGAGGAAACCATCACTGAAAGCTTTGGTAATGAT 
CATAGCACCTTGGATGGGTATTCCAGAAGAACCACCTTGTCTTCAAAAATGTATCACACCAA 
AGGACAAGAAGGTTCTGTTTGTCTCCGGTCATCAGACTGTGCCTCAGGATTGTGTTGTGCTA 
GACACTTCTGGTCCAAGATCTGTAAACCTGTCCTGAAAGAAGGTCAAGTGTGTACCAAGCAT 
AGGAGAAAAGGCTCTCATGGACTAGAAATATTCCAGCGTTGTTACTGTGGAGAAGGTCTGTC 
TTGCCGGATACAGAAAGATCACCATCAAGCCAGTAATTCTTCTAGGCTTCACACTTGTCAGA 
GACAC2A&ACCAGCTATCCAAATGCAGTGAACTCCTTTTATATAATAGATGCTATGAAAACC 
TTTTATGACCTTCATCAACTCAATCCTAAGGATATACAAGTTCTGTGGTTTCAGTTAAGCAT 
TCCAATAACACCTTCCAAAAACCTGGAGTGTAAGAGCTTTGTTTCTTTATGGAACTCCCCTG 
TGATTGCAGTAAATTACTGTATTGTAAATTCTCAGTGTGGCACTTACCTGTAAATGCAATGA 
AACTTTTAATTATTTTTCTAAAGGTGCTGCACTGCCTATTTTTCCTCTTGTTATGTAAATTT 
TTGTACACATTGATTGTTATCTTGACTGACAAATATTCTATATTGAACTGAAGTAAATCATT 
TCAGCTTATAGTTCTTAAAAGCATAACCCTTTACCCCATTTAATTCTAGAGTCTAGAACGCA 
AGGATCTCTTGGAATGACAAATGATAGGTACCTAAAATGTAACATGAAAATACTAGCTTATT 
TTCTGAAATGTACTATCTTAATGCTTAAATTATATTTCCCTTTAGGCTGTGATAGTTTTTGA 
AATAAAATTTAACATTTAAAAAAAAAAAAA 
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FIGURE 187 



</usr/seqclb2/sst/DNA/Dnaseqs .min/ss .DNA57530 
<subunit 1 of 1, 266 aa, 1 stop 
<MW: 28672, pi: 8.85, NX(S/T): 1 

MMALGAAGATRVFVAMVAAALGGHPLLGVSATLNSVLNSNAIKNLPPPLGGAAGHPGSAVSA 
APGILYPGGNKYQTIDNYQPYPCAEDEECGTDEYCASPTRGGDAGVQICIACRKRRKRCMRH 
AMCCPGNYCKNGICVSSDQNHFRGEIEETITESFGNDHSTLDGYSRRTTLSSKMYHTKGQEG 
SVCLRSSDCASGLCCARHFWSKICKPVLKEGQVCTKHRRKGSHGLEIFQRCYCGEGLSCRIQ 

KDHHQASNS SRLHTCQRH 

Important features: 
Signal peptide: 
amino acids 1-23 

N-glycosylation site* 
amino acids 256-259 

Fungal Zn(2)-Cys(6) binuclear cluster domain 
amino acids 110-126 
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FIGURE 188 

TGTGTTTCCCTGCAGTCAGAATTTGGGACNGCAGGGGTTCCCGGACCTGATTTTGCAGCGGA 

ACGGGAAGGTTTTGTGGGACCCAGGTTGAAATGACGGTCATTTTTTTTTCTTTCTCCTTCNG 

GAGTCCTTNTGAGANGATGGTTTTGGGCGCAGCGGGAGCTAACCCGGTTTTTTGTNGCGATG 

GTAGCGGCGGTTTTCGGCGGCCACCTTNTGCTGGGAGTGAGCGCCACCTTGAATCGGTTTTC 

AATTCCAACGNTATCAAGAACCTGCCCCCACCGNTGGGCGGCGCTGCGGGGCACCCAGGNTT 

TGCAGTCAGCGCCGCGCCGGGAATCCTGTACCCGGGCGGGAATAAGTACCAGACCATTGACA 

ATTACCAGCCGTACCCGTGCGCAGAGGACGAGGAGTGCGGCACTGATGAGTACTGCGCTAGT 

CCCACCCGCGGAGGGGANGCGGGCGTGCAAATNTGTNTNGCCTGCAGGAAGCGCCGAAAACG 

CTGCATGCGTCANGCTATGTGCTGCCCCGGGAATTACTGCAAAAATGGAATATGTGTGTNTT 

CTGATCAAAATCATTTCCGAGGAGAAATTGAGGAAACCATCACTGAAAGCTTTGGTAATGAT 
CATAGCACCTTGGATGGG 
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FIGURE 189A 

GAGGARCCTACCGGTACCGGCCGCGCGCTGGTAGTCGCCGGTGTGGCTGCACCTCACCAATC 

CCGTGCGCCGCGGCTGGGCCGTCGGAGAGTGCGTGTGCTTCTCTCCTGCACGCGGTGCTTGG 

GCTCGGCCAGGCGGGGTCCGCCGCCAGGGTTTGAGGATGGGGGAGTAGCTACAGGAAGCGAC 

CCCGCGATGGCAAGGTATATTTTTGTGGAATGAAAAGGAAGTATTAGAAATGAGCTGAAGAC 

CATTCACAGATTAATATTTTTGGGGACAGATTTGTGATGCTTGATTCACCCTTGAAGTAATG 

TAGACAGAAGTTCTCAAATTTGCATATTACATCAACTGGAACCAGCAGTGAATCTTAATGTT 

CACTTAAATCAGAACTTGCATAAGAAAGAGAMfiGGAGTCTGGTTAAATAAAGATGACTATA 

TCAGAGACTTGAAAAGGATCATTCTCTGTTTTCTGATAGTGTATATGGCCATTTTAGTGGGC 

ACAGATCAGGATTTTTACAGTTTACTTGGAGTGTCCAAAACTGCAAGCAGTAGAGAAATAAG 

ACAAGCTTTCAAGAAATTGGCATTGAAGTTACATCCTGATAAAAACCCGAATAACCCAAATG 

CACATGGCGATTTTTTAAAAATAAATAGAGCATATGAAGTACTCAAAGATGAAGATCTACGG 

AAAAAGTATGACAAATATGGAGAAAAGGGACTTGAGGATAATCAAGGTGGCCAGTATGAAAG 

CTGGAACTATTATCGTTATGATTTTGGTATTTATGATGATGATCCTGAAATCATAACATTGG 

AAAGAAGAGAATTTGATGCTGCTGTTAATTCTGGAGAACTGTGGTTTGTAAATTTTTACTCC 

CCAGGCTGTTCACACTGCCATGATTTAGCTCCCACATGGAGAGACTTTGCTAAAGAAGTGGA 

TGGGTTACTTCGAATTGGAGCTGTTAACTGTGGTGATGATAGAATGCTTTGCCGAATGAAAG 

GAGTCAACAGCTATCCCAGTCTCTTCATTTTTCGGTCTGGAATGGCCCCAGTGAAATATCAT 

GGAGACAGATCAAAGGAGAGTTTAGTGAGTTTTGCAATGCAGCATGTTAGAAGTACAGTGAC 

AGAACTTTGGACAGGAAATTTTGTCAACTCCATACAAACTGCTTTTGCTGCTGGTATTGGCT 

GGCTGATCACTTTTTGTTCAAAAGGAGGAGATTGTTTGACTTCACAGACACGACTCAGGCTT 

AGTGGCATGTTGTTTCTCAACTCATTGGATGCTAAAGAAATATATTTGGAAGTAATACATAA 

TCTTCCAGATTTTGAACTACTTTCGGCAAACACACTAGAGGATCGTTTGGCTCATCATCGGT 

GGCTGTTATTTTTTCATTTTGGAAAAAATGAAAATTCAAATGATCCTGAGCTGAAAAAACTA 

AAAACTCTACTTAAAAATGATCATATTCAAGTTGGCAGGTTTGACTGTTCCTCTGCACCAGA 

CATCTGTAGTAATCTGTATGTTTTTCAGCCGTCTCTAGCAGTATTTAAAGGACAAGGAACCA 

AAGAATATGAAATTCATCATGGAAAGAAGATTCTATATGATATACTTGCCTTTGCCAAAGAA 

AGTGTGAATTCTCATGTTACCACGCTTGGACCTCAAAATTTTCCTGCCAATGACAAAGAACC 

ATGGCTTGTTGATTTCTTTGCCCCCTGGTGTCCACCATGTCGAGCTTTACTACCAGAGTTAC 

GAAGAGCATCAAATCTTCTTTATGGTCAGCTTAAGTTTGGTACACTAGATTGTACAGTTCAT 

GAGGGACTCTGTAACATGTATAACATTCAGGCTTATCCAACAACAGTGGTATTCAACCAGTC 

CAACATTCATGAGTATGAAGGACATCACTCTGCTGAACAAATCTTGGAGTTCATAGAGGATC 

TTATGAATCCTTCAGTGGTCTCCCTTACACCCACCACCTTCAACGAACTAGTTACACAAAGA 

AAACACAACGAAGTCTGGATGGTTGATTTCTATTCTCCGTGGTGTCATCCTTGCCAAGTCTT 

AATGCCAGAATGGAAAAGAATGGCCCGGACATTAACTGGACTGATCAACGTGGGCAGTATAG 

ATTGCCAACAGTATCATTCTTTTTGTGCCCAGGAAAACGTTCAAAGATACCCTGAGATAAGA 

TTTTTTCCCCCAAAATCAAATAAAGCTTATCAGTATCACAGTTACAATGGTTGGAATAGGGA 

TGCTTATTCCCTGAGAATCTGGGGTCTAGGATTTTTACCTCAAGTATCCACAGATCTAACAC 

CTCAGACTTTCAGTGAAAAAGTTCTACAAGGGAAAAATCATTGGGTGATTGATTTCTATGCT 

CCTTGGTGTGGACCTTGCCAGAATTTTGCTCCAGAATTTGAGCTCTTGGCTAGGATGATTAA 

AGGAAAAGTGAAAGCTGGAAAAGTAGACTGTCAGGCTTATGCTCAGACATGCCAGAAAGCTG 

GGATCAGGGCCTATCCAACTGTTAAGTTTTATTTCTACGAAAGAGCAAAGAGAAATTTTCAA 

GAAGAGCAGATAAATACCAGAGATGCAAAAGCAATCGCTGCCTTAATAAGTGAAAAATTGGA 

AACTCTCCGAAATCAAGGCAAGAGGAATAAGGATGAACTTIJS&TAATGTTGAAGATGAAGAA 

AAAGTTTAAAAGAAATTCTGACAGATGACATCAGAAGACACCTATTTAGAATGTTACATTTA 

TGATGGGAATGAATGAACATTATCTTAGACTTGCAGTTGTACTGCCAGAATTATCTACAGCA 

CTGGTGTAAAAGAAGGGTCTGCAAACTTTTTCTGTAAAGGGCCGGTTTATAAATATTTTAGA 

CTTTGCAGGCTATAATATATGGTTCACACATGAGAACAAGAATAGAGTCATCATGTATTCTT 

TGTTATTTGCTTTTAACAACCTTTAAAAAATATTAAAACGATTCTTAGCTCAGAGCCATACA 

AAAGTAGGCTGGATTCAGTCCATGGACCATAGATTGCTGTCCCCCTCGACGGACTTATAATG 

TTTCAGGTGGCTGGCTTGAACATGAGTCTGCTGTGCTATCTACATAAATGTCTAAGTTGTAT 

AAAGTCCACTTTCCCTTCACGTTTTTTGGCTGACCTGAAAAGAGGTAACTTAGTTTTTGGTC 

ACTTGTTCTCCTAAAAATGCTATCCCTAACCATATATTTATATTTCGTTTTAAAAACACCCA 

TGATGTGGCACAGTAAACAAACCCTGTTATGCTGTATTATTATGAGGAGATTCTTCATTGTT 

TTCTTTCCTTCTCAAAGGTTGAAAAAATGCTTTTAATTTTTCACAGCCGAGAAACAGTGCAG 
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FIGURE 189B 

CAGTATATGTGCACACAGTAAGTACACAAATTTGAGCAACAGTAAGTGCACAAATTCTGTAG 

TTTGCTGTATCATCCAGGAAAACCTGAGGGAAAAAAATTATAGCAATTAACTGGGCATTGTA 

GAGTATCCTAAATATGTTATCAAGTATTTAGAGTTCTATATTTTAAAGATATATGTGTTCAT 

GTATTTTCTGAAATTGCTTTCATAGAAATTTTCCCACTGATAGTTGATTTTTGAGGCATCTA 

ATATTTACATATTTGCCTTCTGAACTTTGTTTTGACCTGTATCCTTTATTTACATTGGGTTT 

TTCTTTCATAGTTTTGGTTTTTCACTCCTGTCCAGTCTATTTATTATTCAAATAGGAAAAAT 

TACTTTACAGGTTGTTTTACTGTAGCTTATAATGATACTGTAGTTATTCCAGTTACTAGTTT 

ACTGTCAGAGGGCTGCCTTTTTCAGATAAATATTGACATAATAACTGAAGTTATTTTTATAA 

GAAAATCAAGTATATAAATCTAGGAAAGGGATCTTCTAGTTTCTGTGTTGTTTAGACTCAAA 

GAATCACAAATTTGTCAGTAACATGTAGTTGTTTAGTTATAATTCAGAGTGTACAGAATGGT 

AAAAATTCCAATCAGTCAAAAGAGGTCAATGAATTAAAAGGCTTGCAACTTTTTCAAAAAAA 
AAAAAAAAAA 
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FIGURE 190 

</usr/seqcib2/sst/DNA/Dnaseqs.inin/ss.DNA56439 
<subunit 1 of 1, 747 aa, 1 stop 
<MW: 86127, pi: 7.46, NX(S/T): 2 

MGVWLNKDDYIRDLKRIILCFLIVYMAILVGTDQDFYSLLGVSKTASSREIRQAFKKLALKL 
HPDKNPNNPNAHGDFLKINRAYEVLKDEDLRKKYDKYGEKGLEDNQGGQYESWNYYRYDFGI 
YDDDPE I ITLERRE FDAAVNSGELWFVNFYSPGCSHCHDLAPTWRDFAKEVDGLLRIGAVNC 
GDDRMLCRMKGVNSYPSLFIFRSGMAPVKYHGDRSKESLVSFAMQHVRSTVTELWTGNFVNS 
I QTAFAAG I GWL I T FC S KGGDCLTSQTRLRLS GML FLNS LDAKE I YLEVIHNLPDFELLSAN 
TLEDRLAHHRWLLFFHFGKNENSNDPELKKLKTLLKNDHIQVGRFDCSSAPDICSNLYVFQP 
SLAVFKGQGTKEYEIHHGKKILYDILAFAKESVNSHVTTLGPQNFPANDKEPWLVDFFAPWC 
PPCRALLPELRRASNLLYGQLKFGTLDCTVHEGLCNMYNIQAYPTTWFNQSNIHEYEGHHS 
AEQ I LEFIEDLMNPS WSLTPTT FNELVTQRKHNEVWMVDFYS PWCHPCQVLMPEWKRMART 
LTGLINVGSIDCQQYHSFCAQENVQRYPEIRFFPPKSNKAYQYHSYNGWNRDAYSLRIWGLt; 
FLPQVS T DLT PQT FSEKVLQGKNHWVI DFYAPWCGPCQNFAPEFELLARMIKGKVKAGKVDC 
QAYAQTCQKAGIRAYPTVKFYFYERAKRNFQEEQINTRDAKAIAALISEKLETLRNQGKRNKDEL 

Inq>ortant features: 

Endoplasmic reticuliim targeting sequence. 

amino acids 744-747 

Cytochrome c family heme-binding site signature. 

amino acids 158-163 

Nt-dnaJ domain signature, 
amino acids 77-96 

N-glycosylation site, 
amino acids 484-487 
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FIGURE 191 

AGACAGTACCTCCTCCCTAGGACTACACAAGGACTGAACCAGAAGGAAGAGGACAGAGCAAA 

GCC^AACATCATCCTAGAAATCCTTCTGCTTCTGATCACCATCATCTACTCCTACTTGGA 

GTCGTTGGTGAAGTTTTTCATTCCTCAGAGGAGAAAATCTGTGGCTGGGGAGATTGTTCTCA 

TTACTGGAGCTGGGCATGGAATAGGCAGGCAGACTACTTATGAATTTGCAAAACGACAGAGC 

ATATTGGTTCTGTGGGATATTAATAAGCGCGGTGTGGAGGAAACTGCAGCTGAGTGCCGAAA 

ACTAGGCGTCACTGCGCATGCGTATGTGGTAGACTGCAGCAACAGAGAAGAGATCTATCGCT 

CTCTAAATCAGGTGAAGAAAGAAGTGGGTGATGTAACAATCGTGGTGAATAATGCTGGGACA 

GTATATCCAGCCGATCTTCTCAGCACCAAGGATGAAGAGATTACCAAGACATTTGAGGTCAA 

CATCCTAGGACATTTTTGGATCACAAAAGCACTTCTTCCATCGATGATGGAGAGAAATCATG 

GCCACATCGTCACAGTGGCTTCAGTGTGCGGCCACGAAGGGATTCCTTACCTCATCCCATAT 

TGTTCCAGCAAATTTGCCGCTGTTGGCTTTCACAGAGGTCTGACATCAGAACTTCAGGCCTT 

GGGAAAAACTGGTATCAAAACCTCATGTCTCTGCCCAGTTTTTGTGAATACTGGGTTCACCA 

AAAATCCAAGCACAAGATTATGGCCTGTATTGGAGACAGATGAAGTCGTAAGAAGTCTGATA 

GATGGAATACTTACCAATAAGAAAATGATTTTTGTTCCATCGTATATCAATATCTTTCTGAG 

ACTACAGAAGTTTCTTCCTGAACGCGCCTCAGCGATTTTAAATCGTATGCAGAATATTCAAT 

TTGAAGCAGTGGTTGGCCACAAAATCAAAATGAAAISMTAAATAAGCTCCAGCCAGAGATG 

TATGCATGATAATGATATGAATAGTTTCGAATCAATGCTGCAAAGCTTTATTTCACATTTTT 

TCAGTCCTGATAATATTAAAAACATTGGTTTGGCACTAGCAGCAGTCAAACGAACAAGATTA 

ATTACCTGTCTTCCTGTTTCTCAAGAATATTTACGTAGTTTTTCATAGGTCTGTTTTTCCTT 

TCATGCCTCTTAAAAACTTCTGTGCTTACATAAACATACTTAAAAGGTTTTCTTTAAGATAT 

TTTATTTTTCCATTTAAAGGTGGACAAAAGCTACCTCCCTAAAAGTAAATACAAAGAGAACT 

TATTTACACAGGGAAGGTTTAAGACTGTTCAAGTAGCATTCCAATCTGTAGCCATGCCACAG 

AATATCAACAAGAACACAGAATGAGTGCACAGCTAAGAGATCAAGTTTCAGCAGGCAGCTTT 

ATCTCAACCTGGACATATTTTAAGATTCAGCATTTGAAAGATTTCCCTAGCCTCTTCCTTTT 

TCATTAGCCCAAAACGGTGCAACTCTATTCTGGACTTTATTACTTGATTCTGTCTTCTGTAT 

AACTCTGAAGTCCACCAAAAGTGGACCCTCTATATTTCCTCCCTTTTTATAGTCTTATAAGA 

TACATTATGAAAGGTGACCGACTCTATTTTAAATCTCAGAATTTTAAGTTCTAGCCCCATGA 

TAACCTTTTTCTTTGTAATTTATGCTTTCATATATCCTTGGTCCCAGAGATGTTTAGACAAT 

TTTAGGCTCAAAAATTAAAGCTAACACAGGAAAAGGAACTGTACTGGCTATTACATAAGAAA 
CAATGGACCCAAGAGAAGAA 
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FIfitJRE 192 

</usr/seqclb2/sst/DNA/Dnaseqs.min/ss.DNA56409 
<subunit 1 of 1, 300 aa, 1 stop 
<MW: 33655, pi.: 9.31, NX{S/T): 1 

MNIILEILLLLITIIYSYLESLVKFFIPQRRKSVAGEIVLITGAGHGIGRQTTYEFAKRQSI 
LVLWDINKRGVEETAAECRKLGVTAHAYWDCSNREEIYRSLNQVKKEVGDVTIWNNAGTV 
YPADLLSTKDEEITKTFEVNILGHFWITKALLPSMMERNHGHIVTVASVCGHEGIPYLIPYC 
SSKFAAVGFHRGLTSELQALGKTGIKTSCLCPVFVNTGFTKNPSTRLWPVLETDEWRSLID 
GILTNKKMIFVPSYINIFLRLQKFLPERASAILNRMQNIQFEAWGHKIKMK 

Important features: 
Signal peptide: 
amino acids 1-19 

cAMP- and cGMP-dependent protein kinase phosphorylation site, 
amino acids 30-33 and 58-61 

Short-chain alcohol dehydrogenase family protein 
amino acids 165-202, 37-49, 112-122 and 210-219 
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FIGURE 193 

CGGCGGCGGCTGCGGGCGCGAGGTGAGGGGCGCGAGGTGAGGGGCGCGAGGTTCCCAGCAGG 
ATGCCCCGGCTCTGCAGGAAGCTGAAGTGAGAGGCCCGGAGAGGGCCCAGCCCGCCCGGGGC 
AGGAIfiACCAAGGCCCGGCTGTTCCGGCTGTGGCTGGTGCTGGGGTCGGTGTTCATGATCCT 
GCTGATCATCGTGTACTGGGACAGCGCAGGCGCCGCGCACTTCTACTTGCACACGTCCTTCT 
CTAGGCCGCACACGGGGCCGCCGCTGCCCACGCCCGGGCCGGACAGGGACAGGGAGCTCACG 
GCCGACTCCGATGTCGACGAGTTTCTGGACAAGTTTCTCAGTGCTGGCGTGAAGCAGAGCGA 
CCTTCCCAGAAAGGAGACGGAGCAGCCGCCTGCGCCGGGGAGCATGGAGGAGAGCGTGAGAG 
GCTACGACTGGTCCCCGCGCGACGCCCGGCGCAGCCCAGACCAGGGCCGGCAGCAGGCGGAG 
CGGAGGAGCGTGCTGCGGGGCTTCTGCGCCAACTCCAGCCTGGCCTTCCCCACCAAGGAGCG 
CGCATTCGACGACATCCCCAACTCGGAGCTGAGCCACCTGATCGTGGACGACCGGCACGGGG 
CCATCTACTGCTACGTGCCCAAGGTGGCCTGCACCAACTGGAAGCGCGTGATGATCGTGCTG 
AGCGGAAGCCTGCTGCACCGCGGTGCGCCCTACCGCGACCCGCTGCGCATCCCGCGCGAGCA 
CGTGCACAACGCCAGCGCGCACCTGACCTTCAACAAGTTCTGGCGCCGCTACGGGAAGCTCT 
CCCGCCACCTCATGAAGGTCAAGCTCAAGAAGTACACCAAGTTCCTCTTCGTGCGCGACCCC 
TTCGTGCGCCTGATCTCCGCCTTCCGCAGCAAGTTCGAGCTGGAGAACGAGGAGTTCTACCG 
CAAGTTCGCCGTGCCCATGCTGCGGCTGTACGCCAACCACACCAGCCTGCCCGCCTCGGCGC 
GCGAGGCCTTCCGCGCTGGCCTCAAGGTGTCCTTCGCCAACTTCATCCAGTACCTGCTGGAC 
CCGCACACGGAGAAGCTGGCGCCCTTCAACGAGCACTGGCGGCAGGTGTACCGCCTCTGCCA 
CCCGTGCCAGATCGACTACGACTTCGTGGGGAAGCTGGAGACTCTGGACGAGGACGCCGCGC 
AGCTGCTGCAGCTACTCCAGGTGGACCGGCAGCTCCGCTTCCCCCCGAGCTACCGGAACAGG 
ACCGCCAGCAGCTGGGAGGAGGACTGGTTCGCCAAGATCCCCCTGGCCTGGAGGCAGCAGCT 
GTATAAACTCTACGAGGCCGACTTTGTTCTCTTCGGCTACCCCAAGCCCGAAAACCTCCTCC 
GAGACIJSMAGCTTTCGCGTTGCTTTTTCTCGCGTGCCTGGAACCTGACGCACGCGCACTCC 
AGTTTTTTTATGACCTACGATTTTGCAATCTGGGCTTCTTGTTCACTCCACTGCCTCTATCC 
ATTGAGTACTGTATCGATATTGTTTTTTAAGATTAATATATTTCAGGTATTTAATACGA 
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FIGURE 194 

</usr/seqcib2/sst/DNA/Dnaseqs.min/ss.DNA56112 
<subunit 1 of 1, 414 aa, 1 stop 
<MW: 48414, pi: 9.54, NX(S/T): 4 

MTKARLFRLWLVLGSVFMILLIIVYWDSAGAAHETLHTSFSRPHTGPPLPTPGPDRDRELTA 
DSDVDEFLDKFLSAGVKQSDLPRKETEQPPAPGSMEESVRGYDWSPRDARRSPDQGRQQAER 
RS VLRG FCANS S LAFPTKERAFDDI PNS ELSHL I VDDRHGAI Y CYVPKVACTNWKRVMI VLS 
GSLLHRGAPYRDPLRIPREHVHNASAHLTFNKFWRRYGKLSRHLMKVKLKKYTKFLFVRDPF 
VRLISAFRSKFELENEEFYRKFAVPMLRLYANHTSLPASAREAFRAGLKVSFANFIQYLLDP 
HTEKLAPFNEHWRQVYRLCHPCQIDYDFVGKLETLDEDAAQLLQLLQVDRQLRFPPSYRNRT 
ASSWEEDWFAKIPLAWRQQLYKLYEADFVLFGYPKPENLLRD 

Important features: 
Signal peptide: 
amino acids 1-31 

N-glycosylation sites. 

amino acids 134-137, 209-212, 280-283 and 370-373 

TNFR/NGFR family cysteine-rich region protein 
amino acids 329-332 
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FIGURE 195 

TCGGGCCAGAATTCGGCACGAGGCGGCACGAGGGCGACGGCCTCACGGGGCTTTGGAGGTGA 
AAGAGGCCCAGAGTAGAGAGAGAGAGAGACCGACGTACACGGG&ISGCTACGGGAACGCGCT 
ATGCCGGGAAGGTGGTGGTCGTGACCGGGGGCGGGCGCGGCATCGGAGCTGGGATCGTGCGC 
GCCTTCGTGAACAGCGGGGCCCGAGTGGTTATCTGCGACAAGGATGAGTCTGGGGGCCGGGC 
CCTGGAGCAGGAGCTCCCTGGAGCTGTCTTTATCCTCTGTGATGTGACTCAGGAAGATGATG 
TGAAGACCCTGGTTTCTGAGACCATCCGCCGATTTGGCCGCCTGGATTGTGTTGTCAACAAC 
GCTGGCCACCACCCACCCCCACAGAGGCCTGAGGAGACCTCTGCCCAGGGATTCCGCCAGCT 
GCTGGAGCTGAACCTACTGGGGACGTACACCTTGACCAAGCTCGCCCTCCCCTACCTGCGGA 
AGAGTCAAGGGAATGTCATCAACATCTCCAGCCTGGTGGGGGCAATCGGCCAGGCCCAGGCA 
GTTCCCTATGTGGCCACCAAGGGGGCAGTAACAGCCATGACCAAAGCTTTGGCCCTGGATGA 
AAGTCCATATGGTGTCCGAGTCAACTGTATCTCCCCAGGAAACATCTGGACCCCGCTGTGGG 
AGGAGCTGGCAGCCTTAATGCCAGACCCTAGGGCCACAATCCGAGAGGGCATGCTGGCCCAG 
CCACTGGGCCGCATGGGCCAGCCCGCTGAGGTCGGGGCTGCGGCAGTGTTCCTGGCCTCCGA 
AGCCAACTTCTGCACGGGCATTGAACTGCTCGTGACGGGGGGTGCAGAGCTGGGGTACGGGT 
GCAAGGCCAGTCGGAGCACCCCCGTGGACGCCCCCGATATCCCTTCCIfiaTTTCTCTCATTT 
CTACTTGGGGCCCCCTTCCTAGGACTCTCCCACCCCAAACTCCAACCTGTATCAGATGCAGC 
CCCCAAGCCCTTAGACTCTAAGCCCAGTTAGCAAGGTGCCGGGTCACCCTGCAGGTTCCCAT 
AAAAACGATTTGCAGCC 
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FIGURE 196 



</usr/seqdb2/sst/DNA/Dnaseqs.niin/ss .DNA5604 5 
<subunit 1 of i, 270 aa, 1 stop 
<MW: 28317, pi: 6.00, NXfS/T): 1 

MATGTRYAGKVVWTGGGRGIGAGIViU\FVNSGARVVICDKDESGGR?^ 
VTQEDDVKTLVSETIRRFGRLDCWNNAGHHPPPQRPEETSAQGFRQLLELNLLGTYTLTKL 
ALPYLRKSQGNVINISSLVGAIGQAQAVPYVATKGAVTAMTKALAXDESPYGVRVNCISPGN 
IWTPLWEELAALMPDPRATIREGMLAQPLGRMGQPAEVGAAAVFLASEANFCTGIELLVTGG 
AELG YGCKAS RS T PVDAPD I PS 

Important features: 
N-glycosylation site, 
amino acids 138-141 

Short-chain alcohol dehydrogenase family protein 
amino acids 10-22, 81-91, 134-171 and 176-185 
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FIGURE 197 

AGGCGGGCAGCAGCTGCAGGCTGACCTTGCAGCTTGGCGGAMSGACTGGCCTCACAACCTG 

CTGTTTCTTCTTACCATTTCCATCTTCCTGGGGCTGGGCCAGCCCAGGAGCCCCAAAAGCAA 

GAGGAAGGGGCAAGGGCGGCCTGGGCCCCTGGCCCCTGGCCCTCACCAGGTGCCACTGGACC 

TGGTGTCACGGATGAAACCGTATGCCCGCATGGAGGAGTATGAGAGGAACATCGAGGAGATG 

GTGGCCCAGCTGAGGAACAGCTCAGAGCTGGCCCAGAGAAAGTGTGAGGTCAACTTGCAGCT 

GTGGATGTCCAACAAGAGGAGCCTGTCTCCCTGGGGCTACAGCATCAACCACGACCCCAGCC 

GTATCCCCGTGGACCTGCCGGAGGCACGGTGCCTGTGTCTGGGCTGTGTGAACCCCTTCACC 

ATGCAGGAGGACCGCAGCATGGTGAGCGTGCCGGTGTTCAGCCAGGTTCCTGTGCGCCGCCG 

CCTCTGCCCGCCACCGCCCCGCACAGGGCCTTGCCGCCAGCGCGCAGTCATGGAGACCATCG 

CTGTGGGCTGCACCTGCATCTTCIfi^TCACCTGGCCCAGAAGCCAGGCCAGCAGCCCGAGA 

CCATCCTCCTTGCACCTTTGTGCCAAGAAAGGCCTATGAAAAGTAAACACTGACTTTTGAAA 
GCAAG 
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FIGURE 198 



</usr/seqcib2/sst/DNA/Dnaseqs-roin/ss .DNA59294 
<subunit 1 of 1, 180 aa, 1 stop 
<MW: 20437, pi: 9.58, NX(S/T): 1 

MDWPHNLLFLLTISIFLGLGQPRSPKSKRKGQGRPGPLAPGPHQVPLDLVSRMKPYARMEEY 
ERNIEEMVAQLRNSSELAQRKCEVNLQLWMSNKRSLSPWGYSINHDPSRIPVDLPEARCLCL 
GCVNPFTMQEDRSMVSVPVFSQVPVRRRLCPPPPRTGPCRQRAVMETIAVGCTCIF 

Important features: 
Signal peptide: 
amino acids 1-20 

N-glycosylation site. 

amino acids 75-78 

Homologous region to IL-17 
amino acids 96-180. 
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FIGURE 199 

GCGCCGCCAGGCGTAGGCGGGGTGGCCCTTGCGTCTCCCGCTTCCTTGAAAAACCCGGCGGG 

CGAGCGAGGCTGCGGGCCGGCCGCTGCCCTTCCCCACACTCCCCGCCGAGAAGCCTCGCTCG 

GCGCCCAAC4ISGCGGGTGGGCGCTGCGGCCCGCAGCTAACGGCGCTCCTGGCCGCCTGGAT 

CGCGGCTGTGGCGGCGACGGCAGGCCCCGAGGAGGCCGCGCTGCCGCCGGAGCAGAGCCGGG 

TCCAGCCCATGACCGCCTCCAACTGGACGCTGGTGATGGAGGGCGAGTGGATGCTGAAATTT 

TACGCCCCATGGTGTCCATCCTGCCAGCAGACTGATTCAGAATGGGAGGCTTTTGCAAAGAA 

TGGTGAAATACTTCAGATCAGTGTGGGGAAGGTAGATGTCATTCAAGAACCAGGTTTGAGTG 

GCCGCTTCTTTGTCACCACTCTCCCAGCATTTTTTCATGCT^GGATGGGATATTCCGCCGT 

TATCGTGGCCCAGGAATCTTCGAAGACCTGCAGAATTATATCTTAGAGAAGAAATGGCAATC 

AGTCGAGCCTCTGACTGGCTGGAAATCCCCAGCTTCTCTAACGATGTCTGGAATGGCTGGTC 

TTTTTAGCATCTCTGGCAAGATATGGCATCTTCACAACTATTTCACAGTGACTCTTGGAATT 

CCTGCTTGGTGTTCTTATGTGTTTTTCGTCATAGCCACCTTGGTTTTTGGCCTTTTTATGGG 

TCTGGTCTTGGTGGTAATATCAGAATGTTTCTATGTGCCACTTCCAAGGCATTTATCTGAGC 

GTTCTGAGCAGAATCGGAGATCAGAGGAGGCTCATAGAGCTGAACAGTTGCAGGATGCGGAG 

GAGGAAAAAGATGATTCAAATGAAGAAGAAAACAAAGACAGCCTTGTAGATGATGAAGAAGA 

GAAAGAAGATCTTGGCGATGAGGATGAAGCAGAGGAAGAAGAGGAGGAGGACAACTTGGCTG 

CTGGTGTGGATGAGGAGAGAAGTGAGGCCAATGATCAGGGGCCCCCAGGAGAGGACGGTGTG 

ACCCGGGAGGAAGTAGAGCCTGAGGAGGCTGAAGAAGGCATCTCTGAGCAACCCTGCCCAGC 

TGACACAGAGGTGGTGGAAGACTCCTTGAGGCAGCGTAAAAGTCAGCATGCTGACAAGGGAC 

TGI^ATTTAATGATGCGTTTTCAAGAATACACACCAAAACAATATGTCAGCTTCCCTTTGG 

CCTGCAGTTTGTACCAAATCCTTAATTTTTCCTGAATGAGCAAGCTTCTCTTAAAAGATGCT 

CTCTAGTCATTTGGTCTCATGGCAGTAAGCCTCATGTATACTAAGGAGAGTCTTCCAGGTGT 

GACAATCAGGATATAGAAAAACAAACGTAGTGTTGGGATCTGTTTGGAGACTGGGATGGGAA 

CAAGTTCATTTACTTAGGGGTCAGAGAGTCTCGACCAGAGGAGGCCATTCCCAGTCCTAATC 

AGCACCTTCCAGAGACAAGGCTGCAGGCCCTGTGAAATGAAAGCCAAGCAGGAGCCTTGGCT 

CCTGAGCATCCCCAAAGTGTAACGTAGAAGCCTTGCATCCTTTTCTTGTGTAAAGTATTTAT 

TTTTGTCAAATTGCAGGAAACATCAGGCACCACAGTGCATGAAAAATCTTTCACAGCTAGAA 

ATTGAAAGGGCCTTGGGTATAGAGAGCAGCTCAGAAGTCATCCCAGCCCTCTGAATCTCCTG 

TGCTATGTTTTATTTCTTACCTTTAATTTTTCCAGCATTTCCACCATGGGCATTCAGGCTCT 

CCACACTCTTCACTATTATCTCTTGGTCAGAGGACTCCAATAACAGCCAGGTTTACATGAAC 

TGTGTTTGTTCATTCTGACCTAAGGGGTTTAGATAATCAGTAACCATAACCCCTGAAGCTGT 

GACTGCCAAACATCTCAAATGAAATGTTGTGGCCATCAGAGACTCAAAAGGAAGTAAGGATT 

TTACAAGACAGATTAAAAAAAAATTGTTTTGTCCAAAATATAGTTGTTGTTGATTTTTTTTT 

AAGTTTTCTAAGCAATATTTTTCAAGCCAGAAGTCCTCTAAGTCTTGCCAGTACAAGGTAGT 

CTTGTGAAGAAAAGTTGAATACTGTTTTGTTTTCATCTCAAGGGGTTCCCTGGGTCTTGAAC 

TACTTTAATAATAACTAAAAAACCACTTCTGATTTTCCTTCAGTGATGTGCTTTTGGTGAAA 

GAATTAATGAACTCCAGTACCTGAAAGTGAAAGATTTGATTTTGTTTCCATCTTCTGTAATC 

TTCCAAAGAATTATATCTTTGTAAATCTCTCAATACTCAATCTACTGTAAGTACCCAGGGAG 
GCTAATTTCTTT 
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FIGURE 2QQ 

</usr/seqdb2/sst/DNA/Dnaseqs .min/ss .DNA56433 
<subunit 1 cf 1, 349 aa, 1 stop 
<MW: 38952, pi: 4.34, NX{S/T): 1 

MAGGRCGPQLTALLAAWIAAVAATAGPEEAALPPEQSRVQPMTASNWTLVMEGEWML^ 

WCPSCQQTDSEWEAFAKNGEILQISVGKVDVIQEPGLSGRFFVTTLPAFFHAKDGIFRRYRG 

PG I FEDLQNY I LEKKWQSVEPLTGWKS PAS LTMSGMAGLFS I SGKI WHLHNYFTVTLG I PAW 

CSYVFn/IATLVFGLFMGLVLWISECFYVPLPRHLSERSEQNRRSEEAHRAEQLQDAEEEK 

DDSNEEENKDSLVDDEEEKEDLGDEDEAEEEEEEDNUIAGVDEERSEANDQGPPGEDGVTRE 

EVEPEEAEEGISEQPCPADTEWEDSLRQRKSQHADKGL 

Important features : 
Signal peptide: 
amino acids 1-22 

Transmembrane domain: 
amino acids 191-211 

N-glycosylation site, 
amino acids 46-49 

Thioredoxin family proteins. (homologous region to disulfide 

isomerase) 

amino acids 56-72 

Flavodoxin proteins 

amino acids 173-187 
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FIGURE 201 

ATCTGGTTGAACTACTTAAGCTTAATTTGTTAAACTCCGGTAAGTACCTAGCCCACATGATT 
TGACTCAGAGATTCTCTTTTGTCCACAGACAGTCATCTCAGGGGCAGAAAGAAAAGAGCTCC 
CAAATGCTATATCTATTCAGGGGCTCTCAAGAACAM5GAATATCATCCTGATTTAGAAAAT 
TTGGATGAAGATGGATATACTCAATTACACTTCGACTCTCAAAGCAATACCAGGATAGCTGT 
TGTTTCAGAGAAAGGATCGTGTGCTGCATCTCCTCCTTGGCGCCTCATTGCTGTAATTTTGG 
GAATCCTATGCTTGGTAATACTGGTGATAGCTGTGGTCCTGGGTACCATGGGGGTTCTTTCC 
AGCCCTTGTCCTCCTAATTGGATTATATATGAGAAGAGCTGTTATCTATTCAGCATGTCACT 
AAATTCCTGGGATGGAAGTAAAAGACAATGCTGGCAACTGGGCTCTAATCTCCTAAAGATAG 
ACAGCTCAAATGAATTGGGATTTATAGTAAAACAAGTGTCTTCCCAACCTGATAATTCATTT 
TGGATAGGCCTTTCTCGGCCCCAGACTGAGGTACCATGGCTCTGGGAGGATGGATCAACATT 
CTCTTCTAACTTATTTCAGATCAGAACCACAGCTACCCAAGAAAACCCATCTCCAAATTGTG 
TATGGATTCACGTGTCAGTCATTTATGACCAACTGTGTAGTGTGCCCTCATATAGTATTTGT 
GAGAJVGAAGTTTTCAATG2AAGAGGAAGGGTGGAGAAGGAGAGAGAAATATGTGAGGTAGTA 
AGGAGGACAGAAAACAGAACAGAAAAGAGTAACAGCTGAGGTCAAGATAAATGCAGAAAATG 
TTTAGAGAGCTTGGCCAACTGTAATCTTAACCAAGAAATTGAAGGGAGAGGCTGTGATTTCT 
GTATTTGTCGACCTACAGGTAGGCTAGTATTATTTTTCTAGTTAGTAGATCCCTAGACATGG 
AATCAGGGCAGCCAAGCTTGAGTTTTTATTTTTTATTTATTTATTTTTTTGAGATAGGGTCT 
CACTTTGTTACCCAGGCTGGAGTGCAGTGGCACAATCTCGACTCACTGCAGCTATCTCTCGC 
CTCAGCCCCTCAAGTAGCTGGGACTACAGGTGCATGCCACCATGCCAGGCTAATTTTTGGTG 
TTTTTTGTAGAGACTGGGTTTTGCCATGTTGACCAAGCTGGTCTCTAACTCCTGGGCTTAAG 
TGATCTGCCCGCCTTGGCCTCCCAAAGTGCTGGGATTACAGATGTGAGCCACCACACCTGGC 
CCCAAGCTTGAATTTTCATTCTGCCATTGACTTGGCATTTACCTTGGGTAAGCCATAAGCGA 
ATCTTAATTTCTGGCTCTATCAGAGTTGTTTCATGCTCAACAATGCCATTGAAGTGCACGGT 
GTGTTGCCACGATTTGACCCTCAACTTCTAGCAGTATATCAGTTATGAACTGAGGGTGAAAT 
ATATTTCTGAATAGCTAAATGAAGAAATGGGAAAAAATCTTCACCACAGTCAGAGCAATTTT 
ATTATTTTCATCAGTATGATCATAATTATGATTATCATCTTAGTAAAAAGCAGGAACTCCTA 
CTTTTTCTTTATCAATTAAATAGCTCAGAGAGTACATCTGCCATATCTCTAATAGAATCTTT 
^Tf'r^TTTT7T7TT:TTlTlGAGACAGAGTTTCGCTCTTGlTGCCCAGGClGGhGTGCPJiCGG 
CACGATCTCGGCTCACCGCAACCTCCGCCCCCTGGGTTCAAGCAATTCTCCTGCCTCAGCCT 
CCCAAGTAGCTGGGATTACAGTCAGGCACCACCACACCCGGCTAATTTTGTATTTTTTTAGT 
AGAGACAGGGTTTCTCCATGTCGGTCAGGGTAGTCCCGAACTCCTGACCTCAAGTGATCTGC 
CTGCCTCGGCCTCCCAAGTGCTGGGATTACAGGCGTGAGCCACTGCACCCAGCCTAGAATCT 
TGTATAATATGTAATTGTAGGGAAACTGCTCTCATAGGAAAGTTTTCTGCTTTTTAAATACA 
AAAATACATAAAAATACATAAAATCTGATGATGAATATAAAAAAGTAACCAACCTCATTGGA 
ACAAGTATTAACATTTTGGAATATGTTTTATTAGTTTTGTGATGTACTGTTTTACAATTTTT 
ACCATTTTTTTCAGTAATTACTGTAAAATGGTATTATTGGAATGAAACTATATTTCCTCATG 
TGCTGATTTGTCTTATTTTTTTCATACTTTCCCACTGGTGCTATTTTTATTTCCAATGGATA 
TTTCTGTATTACTAGGGAGGCATTTACAGTCCTCTAATGTTGATTAATATGTGAAAAGAAAT 
TGTACCAATTTTACTAAATTATGCAGTTTAAAATGGATGATTTTATGTTATGTGGATTTCAT 
TTCAATAAAAAAAAACTCTTATCAAAAAAAAAAAAAAAAAAT^AAAAAAAAAAAAAAAAAA 
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</usr/seqdb2/sst/DNA/Dnaseqs .min/ss . DNA53912 
<subunit 1 of 1, 201 aa, 1 stop 
<MW: 22563, pi: 4.87, NX(S/T): 1 

MEYHPDLENLDEDGYTQLHFDSQSNTRIAWSEKGSCAASPPWRLIAVILGILCLVILVIAV 
VLGTMGVLSSPCPPNWIIYEKSCYLFSMSLNSWDGSKRQCWQLGSNLLKIDSSNELGFIVKQ 
VSSQPDNSFWIGLSRPQTEVPWLWEDGSTFSSNLFQIRTTATQENPSPNCVWIHVSVIYDQL 
CSVPSYSICEKKFSM 

Important features: 

Type II transmembrane domain: 

amino acids 45-65 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 197-200 

N-myristoylation sites. 

amino acids 35-40 and 151-156 

Homologous region to LDL receptor 
amino acids 34-67 and 70-200. 
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FIGURE 2Q3A 

GGAAGGGGAGGAGCAGGCCACACAGGCACAGGCCGGTGAGGGACCTGCCCAGACCTGGAGGG 
TCTCGCTCTGTCACACAGGCTGGAGTGCAGTGGTGTGATCTTGGCTCATCGTAACCTCCACC 
TCCCGGGTTCAAGTGATTCTCATGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGTGGTGAC 
TTCCAAGAGTGACJCCGTCGGAGGAAA^ISACTCCCCAGTCGCTGCTGCAGACGACACTGTT 
CCTGCTGAGTCTGCTCTTCCTGGTCCAAGGTGCCCACGGCAGGGGCCACAGGGAAGACTTTC 
GCTTCTGCAGCCAGCGGAACCAGACACACAGGAGCAGCCTCCACTACAAACCCACACCAGAC 
CTGCGCATCTCCATCGAGAACTCCGAAGAGGCCCTCACAGTCCATGCCCCTTTCCCTGCAGC 
CCACCCTGCTTCCCGATCCTTCCCTGACCCCAGGGGCCTCTACCACTTCTGCCTCTACTGGA 
ACCGACATGCTGGGAGATTACATCTTCTCTATGGCAAGCGTGACTTCTTGCTGAGTGACAAA 
GCCTCTAGCCTCCTCTGCTTCCAGCACCAGGAGGAGAGCCTGGCTCAGGGCCCCCCGCTGTT 
AGCCACTTCTGTCACCTCCTGGTGGAGCCCTCAGAACATCAGCCTGCCCAGTGCCGCCAGCT 
TCACCTTCTCCTTCCACAGTCCTCCCCACACGGCCGCTCACAATGCCTCGGTGGACATGTGC 
GAGCTCAAAAGGGACCTCCAGCTGCTCAGCCAGTTCCTGAAGCATCCCCAGAAGGCCTCAAG 
GAGGCCCTCGGCTGCCCCCGCCAGCCAGCAGTTGCAGAGCCTGGAGTCGAAACTGACCTCTG 
TGAGATTCATGGGGGACATGGTGTCCTTCGAGGAGGACCGGATCAACGCCACGGTGTGGAAG 
CTCCAGCCCACAGCCGGCCTCCAGGACCTGCACATCCACTCCCGGCAGGAGGAGGAGCAGAG 
CGAGATCATGGAGTACTCGGTGCTGCTGCCTCGAACACTCTTCCAGAGGACGAAAGGCCGGA 
GCGGGGAGGCTGAGAAGAGACTCCTCCTGGTGGACTTCAGCAGCCAAGCCCTGTTCCAGGAC 
AAGAATTCCAGCCAAGTCCTGGGTGAGAAGGTCTTGGGGATTGTGGTACAGAACACCAAAGT 
AGCCAACCTCACGGAGCCCGTGGTGCTCACTTTCCAGCACCAGCTACAGCCGAAGAATGTGA 
CTCTGCAATGTGTGTTCTGGGTTGAAGACCCCACATTGAGCAGCCCGGGGCATTGGAGCAGT 
GCTGGGTGTGAGACCGTCAGGAGAGAAACCCAAACATCCTGCTTCTGCAACCACTTGACCTA 
CTTTGCAGTGCTGATGGTCTCCTCGGTGGAGGTGGACGCCGTGCACAAGCACTACCTGAGCC 
TCCTCTCCTACGTGGGCTGTGTCGTCTCTGCCCTGGCCTGCCTTGTCACCATTGCCGCCTAC 
CTCTGCTCCAGGGTGCCCCTGCCGTGCAGGAGGAAACCTCGGGACTACACCATCAAGGTGCA 
CATGAACCTGCTGCTGGCCGTCTTCCTGCTGGACACGAGCTTCCTGCTCAGCGAGCCGGTGG 
CCCTGACAGGCTCTGAGGCTGGCTGCCGAGCCAGTGCCATCTTCCTGCACTTCTCCCTGCTC 
ACCTGCCTTTCCTGGATGGGCCTCGAGGGGTACAACCTCTACCGACTCGTGGTGGAGGTCTT 
TGGCACCTATGTCCCTGGCTACCTACTCAAGCTGAGCGCCATGGGCTGGGGCTTCCCCATCT 
TTCTGGTGACGCTGGTGGCCCTGGTGGATGTGGACAACTATGGCCCCATCATCTTGGCTGTG 
CATAGGACTCCAGAGGGCGTCATCTACCCTTCCATGTGCTGGATCCGGGACTCCCTGGTCAG 
CTACATCACCAACCTGGGCCTCTTCAGCCTGGTGTTTCTGTTCAACATGGCCATGCTAGCCA 
CCATGGTGGTGCAGATCCTGCGGCTGCGCCCCCACACCCAAAAGTGGTCACATGTGCTGACA 
CTGCTGGGCCTCAGCCTGGTCCTTGGCCTGCCCTGGGCCTTGATCTTCTTCTCCTTTGCTTC 
TGGCACCTTCCAGCTTGTCGTCCTCTACCTTTTCAGCATCATCACCTCCTTCCAAGGCTTCC 
TCATCTTCATCTGGTACTGGTCCATGCGGCTGCAGGCCCGGGGTGGCCCCTCCCCTCTGAAG 
AGCAACTCAGACAGCGCCAGGCTCCCCATCAGCTCGGGCAGCACCTCGTCCAGCCGCATCl^ 
fiGCCTCCAGCCCACCTGCCCATGTGATGAAGCAGAGATGCGGCCTCGTCGCACACTGCCTGT 
GGCCCCCGAGCCAGGCCCAGCCCCAGGCCAGTCAGCCGCAGACTTTGGAAAGCCCAACGACC 
ATGGAGAGATGGGCCGTTGCCATGGTGGACGGACTCCCGGGCTGGGCTTTTGAATTGGCCTT 
GGGGACTACTCGGCTCTCACTCAGCTCCCACGGGACTCAGAAGTGCGCCGCCATGCTGCCTA 
GGGTACTGTCCCCACATCTGTCCCAACCCAGCTGGAGGCCTGGTCTCTCCTTACAACCCCTG 
GGCCCAGCCCTCATTGCTGGGGGCCAGGCCTTGGATCTTGAGGGTCTGGCACATCCTTAATC 
CTGTGCCCCTGCCTGGGACAGAAATGTGGCTCCAGTTGCTCTGTCTCTCGTGGTCACCCTGA 
GGGCACTCTGCATCCTCTGTCATTTTAACCTCAGGTGGCACCCAGGGCGAATGGGGCCCAGG 
GCAGACCTTCAGGGCCAGAGCCCTGGCGGAGGAGAGGCCCTTTGCCAGGAGCACAGCAGCAG 
CTCGCCTACCTCTGAGCCCAGGCCCCCTCCCTCCCTCAGCCCCCCAGTCCTCCCTCCATCTT 
CCCTGGGGTTCTCCTCCTCTCCCAGGGCCTCCTTGCTCCTTCGTTCACAGCTGGGGGTCCCC 
GATTCCAATGCTGTTTTTTGGGGAGTGGTTTCCAGGAGCTGCCTGGTGTCTGCTGTAAATGT 
TTGTCTACTGCACAAGCCTCGGCCTGCCCCTGAGCCAGGCTCGGTACCGATGCGTGGGCTGG 
GCTAGGTCCCTCTGTCCATCTGGGCCTTTGTATGAGCTGCATTGCCCTTGCTCACCCTGACC 
AAGCACACGCCTCAGAGGGGCCCTCAGCCTCTCCTGAAGCCCTCTTGTGGCAAGAACTGTGG 
ACCATGCCAGTCCCGTCTGGTTTCCATCCCACCACTCCAAGGACTGAGACTGACCTCCTCTG 
GTGACACTGGCCTAGAGCCTGACACTCTCCTAAGAGGTTCTCTCCAAGCCCCCAAATAGCTC 
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FIGURE 2Q3B 

CAGGCGCCCTCGGCCGCCCATCATGGTTAATTCTGTCCAACAAACACACACGGGTAGATTGC 

TGGCCTGTTGTAGGTGGTAGGGACACAGATGACCGACCTGGTCACTCC7CCTGCCAACATTC 

AGTCTGGTATGTGAGGCGTGCGTGAAGCAAGAACTCCTGGAGCTACAGGGACAGGGAGCCAT 

CATTCCTGCCTGGGAATCCTGGAAGACTTCCTGCAGGAGTCAGCGTTCAATCTTGACCTTGA 

AGATGGGAAGGATGTTCTTTTTACGTACCAATTCTTTTG7CTTTTGATATTAAAAAGAAGTA 

CATGTTCATTGTAGAGAATTTGGAAACTGTAGAAGAGAATCAAGAAGAAAAATAAAAATCAG 

C T G T T G T AAT CGCC T AGC AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAT^^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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</usr/seqclb2/sst/DNA/Dnaseqs .min/ss . DNA50921 
<subunit 1 of l, 693 aa, 1 stop 
<MW: 77738, pi: 8,87, NX(S/T): 7 

MTPQSLLQTTLFLLSLLFLVQGAHGRGHREDFRFCSQRNQTHRSSLHYKPTPDLRISIENSE 
EALTVHAPFPAAHPASRSFPDPRGLYHFCLYWNRHAGRLHLLYGKRDFLLSDKASSLLCFQH 
QEESLAQGPPLLATSVTSWWSPQNISLPSAASFTFSFHSPPHTAAHNASVDMCELKRDLQLL 
SQFLKHPQKASRRPSAAPASQQLQSLESKLTSVRFMGDMVSFEEDRINATVWKLQPTAGLQD 
LHIHSRQEEEQSEIMEYSVLLPRTLFQRTKGRSGEAEKRLLLVDFSSQALFQDKNSSQVLGE 
KVLGIVVQNTKVANLTEPVVLTFQHQLQPKNVTLQCVFWVEDPTLSSPGKWSSAGCETVRRE 
TQTSCFCNHLTYFAVLMVSSVEVDAVHKHYLSLLSYVGCWSALACLVTIAAYLCSRVPLPC 
RRKPRDYTIKVHMNLLLAVFLLDTSFLLSEPVALTGSEAGCRASAIFLHFSLLTCLSWMGLE 
GYNLYRLWEVFGTYVPGYLLKLSAMGWGFPIFLVTLVALVDVDNYGFIILAVHRTPEGVIY 
PSMCWIRDSLVSYITNLGLFSLVFLFNMAMLATMVVQILRLRPHTQKWSHVLTLLGLSLVLG 
LPWALIFFSFASGTFQLWLYLFSIITSFQGFLIFIWYWSMRLQARGGPSPLKSNSDSARLP 
ISSGSTSSSRI 

Important features: 
Signal peptide: 

amino acids 1-25 

Putative transmembrane domains: 

amino acids 382-398, 402-420, 445-468, 473-491, 519-537, 568-590 
and 634-657 

Microbodies C-terminal targeting signal, 
amino acids 691-693 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 198-201 and 370-373 

N-glycosylation sites. 

amino acids 39-42, 148-151, 171-174, 234-237, 303-306, 324-327 
and 341-344 

G-protein coupled receptors famxly 2 proteins 
amino acids 475-504 , 
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FIGURE 205 



TGCGTGGCCTGCCTTGTCAACAATGCCGCTTACTCTGCTTCCAGGTTGCCCTGCCTTGCAGA 
GGAAANCNTCGGGACTACACCNTCAAGTGCACATGAACCTGCTGCTGGCCGTCTTCCTGCTG 
GACACGAGCTTCCTGCTCAGCGNAGCCGGTGGCCCTGACAGGCTCTGAAGGCTGGCTGCCGA 
GCCAGTGCCATCTTCCTGCACTTCTCCTGCTCACCTGCCTTTCCTGGATGGGCCTCGAGGGG 
TACAACCTCTACCGACTCGTGGTGGAGGTCTTTGGCACCTATGTCCCTGGCTACCTACTCAA 
GCTGAGCGCCATGGGCTGGGGCTTCCCCATCTTTCTGGTGACGCTGGTGGCCCTGGTGGATG 
TGGACAACTATGGCCCCATCATCTTGGCTGTGCATAGGACTCCAGAGGGCGTCATCTACCCT 
TCCATGTGCTGGATCCGGGACTCCCTGGTCAGCTACATCACCAACCTGGGCCTCTTCAGCCT 
GGTGTTTCTGTTCAACATGG 
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FIGURE 206 



CGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCTGGTTCAGGTCCA 
GGTTTTGCTTTGATCCTTTTCAAAAACTGGAGACACAGAAGAGGGCTCTAGGAAAAAGTTTT 
GGATGGGATTATGTGGAAACTACCCTGCGATTCTCTGCTGCCAGAGCAGGCTCGGCGCTTCC 
ACCCCAGTGCAGCCTTCCCCTGGCGGTGGTGAAAGAGACTCGGGAGTCGCTGCTTCCAAAGT 
GCCCGCCGTGAGTGAGCTCTCACCCCAGTCAGCCAAMOAGCCTCTTCGGGCTTCTCCTGCT 
GACATCTGCCCTGGCCGGCCAGAGACAGGGGACTCAGGCGGAATCCAACCTGAGTAGTAAAT 
TCCAGTTTTCCAGCAACAAGGAACAGAACGGAGTACAAGATCCTCAGCATGAGAGAATTATT 
ACTGTGTCTACTAATGGAAGTATTCACAGCCCAAGGTTTCCTCATACTTATCCAAGAAATAC 
GGTCTTGGTATGGAGATTAGTAGCAGTAGAGGAAAATGTATGGATACAACTTACGTTTGATG 
AAAGATTTGGGCTTGAAGACCCAGAAGATGACATATGCAAGTATGATTTTGTAGAAGTTGAG 
GAACCCAGTGATGGAACTATATTAGGGCGCTGGTGTGGTTCTGGTACTGTACCAGGAAAACA 
GATTTCTAAAGGAAATCAAATTAGGATAAGATTTGTATCTGATGAATATTTTCCTTCTGAAC 
CAGGGTTCTGCATCCACTACAACATTGTCATGCCACAATTCACAGAAGCTGTGAGTCCTTCA 
GTGCTACCCCCTTCAGCTTTGCCACTGGACCTGCTTAATAATGCTATAACTGCCTTTAGTAC 
CTTGGAAGACCTTATTCGATATCTTGAACCAGAGAGATGGCAGTTGGACTTAGAAGATCTAT 

ataggccaacttggcaacttcttggcaaggcttttgtttttggaagaaaatccagagtggtg 
gatctgaaccttctaacagaggaggtaagattatacagctgcacacctcgtaacttctcagt 
gtccataagggaagaactaaagagaaccgataccattttctggccaggttgtctcctggtta 
aacgctgtggtgggaactgtgcctgttgtctccacaattgcaatgaatgtcaatgtgtccca 
agcaaagttactaaaaaataccacgaggtccttcagttgagaccaaagaccggtgtcagggg 
attgcacaaatcactcaccgacgtggccctggagcaccatgaggagtgtgactgtgtgtgca 
gagggagcacaggaggai:agccgcatcaccaccagcagctcttgcccagagctgtgcagtgc 
agtggctgattctattagagaacgtatgcgttatctccatccttaatctcagttgtttgctt 

CAAGGACC TT TCATC TTCAGGATT TAjCAGTGCATTCTGAAAGAGGAGACATCAAACAGAATT 

aggagttgtgcaacagctcttttgagaggaggcctaaaggacaggagaaaaggtcttcaatc 
gtggaaagaaaattaaatgttgtattaaatagatcaccagctagtttcagagttaccatgta 
cgtattccactagctgggttctgtatttcagttctttcgatacggcttagggtaatgtcagt 
acaggaaaaaaactgtgcaagtgagcacctgattccgttgccttgcttaactctaaagctcc 

ATGTCCTGGGCCTAAAATCGTATAAAATCTGGATTTTTTTTTTTTTTTTTGCTCATATTCAC 

atatgtaaaccagaacattctatgtactacaaacctggtttttaaaaaggaactatgttgct 

ATGAATTAAACTTGTGTCATGCTGATAGGACAGACTGGATTTTTCATATTTCTTATTAAAAT 

ttctgccatttagaagaagagaactacattcatggtttggaagagataaacctgaaaagaag 

AGTGGCCTTATCTTCACTTTATCGATAAGTCAGTTTATTTGTTTCATTGTGTACATTTTTAT 

attctccttttgacattataactgttggcttttctaatcttgttaaatatatctatttttac 

CAAAGGTATTTAATATTCTTTTTTATGACAACTTAGATCAACTATTTTTAGCTTGGTAAATT 

tttctaaacacaattgttatagccagaggaacaaagatgatataaaatattgttgctctgac 
aaaaatacatgtatttcattctcgtatggtgctagagttagattaatctgcattttaaaaaa 
ctgaattggaatagaattggtaagttgcaaagactttttgaaaataattaaattatcatatc 
ttccattcctgttattggagatgaaaataaaaagcaacttatgaaagtagacattcagatcc 
agccattactaacctattccttttttggggaaatctgagcctagctcagaaaaacataaagc 

ACCTTGAAAAAGACTTGGCAGCTTCCTGATAAAGCGTGCTGTGCTGTGCAGTAGGAACACAT 

cctatttattgtgatgttgtggttttattatcttaaactctgttccatacacttgtataaat 

ACATGGATATTTTTATGTACAGAAGTATGTCTCTTAACCAGTTCACTTATTGTACTCTGGCA 
ATTTAAAAGAAAATCAGTAAAATATTTTGCTTGTAAAATGCTTAATATNGTGCCTAGGTTAT 
GTGGTGACTATTTGAATCAAAAATGTATTGAATCATCAAATAAAAGAATGTGGCTATTTTGG 
GGAGAAAATTAAAAAAAAAAAAAAAAAAAAAGGTTTAGGGATAACAGGGTAATGCGGCC 




FIGURE 207 

MSLFGLLLLTSALAGQRQGTQAESNLSSKFQFSSNKEQNGVQDPQHERIITVSTNGSIHSPR 
FPHTYPRNTVLVWRLVAVEE^^VWIQLTFDERFGLEDPEDDICKyDFVEV•EEPSDGTILGRWC 
GSGTVPGKQISKGNQIRIRFVSDEYFPSEPGFCIHYNIVMPQFTEAVSPSVLPPSALPLDLL 
NNAITAFSTLEDLIRYLEPERWQLDLEDLYRPTWQLLGKAFVFGRKSRWDLNLLTEEVRLY 
SCTPRNFSVSIREELKRTDTIFWPGCLLVKRCGGNCACCLHNCNECQCVFSKVTKKYHEVLQ 
LRPKTGVRGLKKSLTDVALEHHEECDCVCRGSTGG 
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FIGURE 208 

CCCATCTCAAGCTGATCTTGGCACCTCTCATGCTCTGCTCTCTTCAACCAGACCTCTACATT 

CCATTTTGGA.z^GAAGACTAAAAMSGTGTTTCCAATGTGGACACTGAAGAGACAAATTCTTA 

TCCTTTTTAACATAATCCTAATTTCCAAACTCCTTGGGGCTAGATGGTTTCCTAAAACTCTG 

CCCTGTGATGTCACTCTGGATGTTCCAAAGAACCATGTGATCGTGGACTGCACAGACAAGCA 

TTTGACAGAAATTCCTGGAGGTATTCCCACGAACACCACGAACCTCACCCTCACCATTAACC 

ACATACCAGACATCTCCCCAGCGTCCTTTCACAGACTGGACCATCTGGTAGAGATCGATTTC 

AGATGCAACTGTGTACCTATTCCACTGGGGTCAAAAAACAACATGTGCATCAAGAGGCTGCA 

GATTAAACCCAGAAGCTTTAGTGGACTCACTTATTTAAAATCCCTTTACCTGGATGGAAACC 

AGCTACTAGAGATACCGCAGGGCCTCCCGCCTAGCTTACAGCTTCTCAGCCTTGAGGCCAAC 

AACATCTTTTCCATCAGAAAAGAGAATCTAACAGAACTGGCCAACATAGAAATACTCTACCT 

GGGCCAAAACTGTTATTATCGAAATCCTTGTTATGTTTCATATTCAATAGAGAAAGATGCCT 

TCCTAAACTTGACAAAGTTAAAAGTGCTCTCCCTGAAAGATAACAATGTCACAGCCGTCCCT 

AC TG T T T TGCCAT C T ACT T T AACAGAAC TATATCTCTACAACAACATGATTGCAAAAATCCA 

AGAAGATGATTTTAATAACCTCAACCAATTACAAATTCTTGACCTAAGTGGAAATTGCCCTC 

GTTGTTATAATGCCCCATTTCCTTGTGCGCCGTGTAAAAATAATTCTCCCCTACAGATCCCT 

GTAAATGCTTTTGATGCGCTGACAGAATTAAAAGTTTTACGTCTACACAGTAACTCTCTTCA 

GCATGTGCCCCCAAGATGGTTTAAGAACATCAACAAACTCCAGGAACTGGATCTGTCCCAAA 

ACTTCTTGGCCAAAGAAATTGGGGATGCTAAATTTCTGCATTTTCTCCCCAGCCTCATCCAA 

TTGGATCTGTCTTTCAA.TTTTGAACTTCAGGTCTATCGTGCATCTATGAATCTATCACAAGC 

ATTTTCTTCACTGAAAAGCCTGAAAATTCTGCGGATCAGAGGATATGTCTTTAAAGAGTTGA 

AAAGCTTTAACCTCTCGCCATTACATAATCTTCAAAATCTTGAAGTTCTTGATCTTGGCACT 

AACTTTATAAAAATTGCTAACCTCAGCATGTTTAAACAATTTAAAAGACTGAAAGTCATAGA 

TCTTTCAGTGAATAAAATATCACCTTCAGGAGATTCAAGTGAAGTTGGCTTCTGCTCAAATG 

CCAGAACTTCTGTAGAAAGTTATGAACCCCAGGTCCTGGAACAATTACATTATTTCAGATAT 

GATAAGTATGCAAGGAGTTGCAGATTCAAAAACAAAGAGGCTTCTTTCATGTCTGTTAATGA 

AAGCTGCTACAAGTATGGGCAGACCTTGGATCTAAGTAAAAATAGTATATTTTTTGTCAAGT 

CCTCTGATTTTCAGCATCTTTCTTTCCTCAAATGCCTGAATCTGTCAGGAAATCTCATTAGC 

CAAACTCTTAATGGCAGTGAATTCCAACCTTTAGCAGAGCTGAGATATTTGGACTTCTCCAA 

CAACCGGCTTGATTTACTCCATTCAACAGCATTTGAAGAGCTTCACAAACTGGAAGTTCTGG 

ATATAAGCAGTAATAGCCATTATTTTCAATCAGAAGGAATTACTCATATGCTAAACTTTACC 

AAGAACCTAAAGGTTCTGCAGAAACTGATGATGAACGACAATGACATCTCTTCCTCCACCAG 

CAGGACCATGGAGAGTGAGTCTCTTAGAACTCTGGAATTCAGAGGAAATCACTTAGATGTTT 

TATGGAGAGAAGGTGATAACAGATACTTACAATTATTCAAGAATCTGCTAAAATTAGAGGAA 

TTAGACATCTCTAAAAATTCCCTAAGTTTCTTGCCTTCTGGAGTTTTTGATGGTATGCCTCC 

AAATCTAAAGAATCTCTCTTTGGCCAAAAATGGGCTCAAATCTTTCAGTTGGAAGAAACTCC 

AGTGTCTAAAGAACCTGGAAACTTTGGACCTCAGCCACAACCAACTGACCACTGTCCCTGAG 

AGATTATCCAACTGTTCCAGAAGCCTCAAGAATCTGATTCTTAAGAATAATCAAATCAGGAG 

TCTGACGAAGTATTTTCTACAAGATGCCTTCCAGTTGCGATATCTGGATCTCAGCTCAAATA 

AAATCCAGATGATCCAAAAGACCAGCTTCCCAGAAAATGTCCTCAACAATCTGAAGATGTTG 

CTTTTGCATCATAATCGGTTTCTGTGCACCTGTGATGCTGTGTGGTTTGTCTGGTGGGTTAA 

CCATACGGAGGTGACTATTCCTTACCTGGCCACAGATGTGACTTGTGTGGGGCCAGGAGCAC 

ACAAGGGCCAAAGTGTGATCTCCCTGGATCTGTACACCTGTGAGTTAGATCTGACTAACCTG 

ATTCTGTTCTCACTTTCCATATCTGTATCTCTCTTTCTCATGGTGATGATGACAGCAAGTCA 

CCTCTATTTCTGGGATGTGTGGTATATTTACCATTTCTGTAAGGCCAAGATAAAGGGGTATC 

AGCGTCTAATATCACCAGACTGTTGCTATGATGCTTTTATTGTGTATGACACTAAAGACCCA 

GCTGTGACCGAGTGGGTTTTGGCTGAGCTGGTGGCCAAACTGGAAGACCCAAGAGAGAAACA 

TTTTAATTTATGTCTCGAGGAAAGGGACTGGTTACCAGGGCAGCCAGTTCTGGAAAACCTTT 

CCCAGAGCATACAGCTTAGCAAAAAGACAGTGTTTGTGATGACAGACAAGTATGCAAAGACT 

GAAAATTTTAAGATAGCATTTTACTTGTCCCATCAGAGGCTCATGGATGAAAAAGTTGATGT 

GATTATCTTGATATTTCTTGAGAAGCCCTTTCAGAAGTCCAAGTTCCTCCAGCTCCGGAAAA 

GGCTCTGTGGGAGTTCTGTCCTTGAGTGGCCAACAAACCCGCAAGCTCACCCATACTTCTGG 

CAGTGTCTAAAGAACGCCCTGGCCACAGACAATCATGTGGCCTATAGTCAGGTGTTCAAGGA 

AACGGTClftSCCCTTCTTTGCAAAACACAACTGCCTAGTTTACCAAGGAGAGGCCTGGC 
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FIGURE 209 



MVFPMWTLKRQILILFNIILISKLLGARWFPKTLPCDVTLDVPKNHVIVDCTDKHLTEIPGG 
IPTNTTNLTLTINHIPDISPASFHRLDHLVEIDFRCNCVPIPLGSKNNMCIKRLQIKPRSFS 
GLTYLKSLYLDGNQLLEIPQGLPPSLQLLSLEANNIFSIRKENLTELANIEILYLGQNCYYR 
NPCYVSYSIEKDAFLNLTKLKVLSLKDNNVTAVPTVLPSTLTELYLYNNMIAKIQEDDFNNL 
NQLQILDLSGNCPRCYNAPFPCAPCKNNSPLQIPVNAFDALTELKVLRLHSNSLQHVPPRWF 
KNINKLQELDLSQNFLAKEIGDAKFLHFLPSLIQLDLSFNFELQVYRASMNLSQAFSSLKSL 
KILRIRGYVFKELKSFNLSPLKNLQNLEVLDLGTNFIKIANLSMFKQFKRLKVIDLSVNKIS 
PSGDSSEVGFCSNARTSVESYEPQVLEQLHYFRYDKYARSCRFKNKEASFMSVNESCYKYGQ 
TLDLSKNSIFFVKSSDFQHLSFLKCLNLSGNLISQTLNGSEFQPLAELRYLDFSNNRLDLLH 
STAFEELHKLEVLDISSNSHYFQSEGITKMLNFTKNLKVLQKLMMNDNDISSSTSRTMESES 
LRTLEFRGNHLDVLWREGDNRYLQLFKNLLKLEELDISKNSLSFLPSGVFDGMPPNLKNLSL 
AKNGLKSFSWKKLQCLKNLETLDLSHNQLTTVPERLSNCSRSLKNLILKNNQIRSLTKYFLQ 
DAFQLRYLDLSSNKIQMIQKTSFPENVLNNLKMLLLKHNRFLCTCDAVWFVWWVNHTEVTIP 
YLATDVTCVGPGAHKGQSVISLDLYTCELDLTNLILFSLSISVSLFLMVMMTASHLYFWDVW 
YIYHFCKAKIKGYQRLISPDCCYDAFIVYDTKDPAVTEWVLAELVAKLEDPREKHFNLCLEE 
RDWLPGQPVLENLSQSIQLSKKTVFVMTDKYAKTENFKIAFYLSHQRLMDEKVDVIILIFLE 
KP FQKS K FLQLRKRLCGS S VLEWPTNPQAH P Y FWQCLKNALATDNHVAYSQVFKETV 




BNSDOCrD- <W0 9948281 A2.l.> 



wo 99/46281 




rCT/X»S99/05028 
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GGGTACCATTCTGCGCTGCTGCAAGTTACGGAATGAAPlAATTAGAACAACAGAAACMSGAA 

AACATGTTCCTTCAGTCGTCAATGCTGACCTGCATTTTCCTGCTAATATCTGGTTCCTGTGA 

GTTATGCGCCGAAGAAAATTTTTCTAGAAGCTATCCTTGTGATGAGAAAAAGCAAAATGACT 

CAGTTATTGCAGAGTGCAGCAATCGTCGACTACAGGAAGTTCCCCAAACGGTGGGCAAATAT 

GTGACAGAACTAGACCTGTCTGATAATTTCATCACACACATAACGAATGAATCATTTCAAGG 

GCTGCAAAATCTCACTAAAATAAATCTAAACCACAACCCCAATGTACAGCACCAGAACGGAA 

ATCCCGGTATACAATCAAATGGCTTGAATATCACAGACGGGGCATTCCTC7U\.CCTAAAAAAC 

CTAAGGGAGTTACTGCTTGAAGACAACCAGTTACCCCAAATACCCTCTGGTTTGCCAGAGTC 

TTTGACAGAACTTAGTCTAATTCAATVACAATATATACAACATAACTAAAGAGGGCATTTCAA 

GACTTATAAACTTGAAAAATCTCTATTTGGCCTGGAACTGCTATTTTAACAAAGTTTGCGAG 

AAAACTAACATAGAAGATGGAGTATTTGAAACGCTGACAAATTTGGAGTTGCTATCACTATC 

TTTCAATTCTCTTTCACACGTGCCACCCAAACTGCCAAGCTCCCTACGCAAACTTTTTCTGA 

GCAACACCCAGATCAAATACATTAGTGAAGAAGATTTCAAGGGATTGATAAATTTAACATTA 

CTAGATTTAAGCGGGAACTGTCCGAGGTGCTTCAATGCCCCATTTCCATGCGTGCCTTGTGA 

TGGTGGTGCTTCAATTAATATAGATCGTTTTGCTTTTCAAAACTTGACCCAACTTCGATACC 

TAAACCTCTCTAGCACTTCCCTCAGGAAGATTAATGCTGCCTGGTTTAAAAATATGCCTCAT 

CTGAAGGTGCTGGATCTTGAATTCAACTATTTAGTGGGAGAAATAGTCTCTGGGGCATTTTT 

AACGATGCTGCCCCGCTTAGAAATACTTGACTTGTCTTTTAACTATATAAAGGGGAGTTATC 

CACAGCATATTAATATTTCCAGAAACTTCTCTAAACTTTTGTCTCTACGGGCATTGCATTTA 

AGAGGTTATGTGTTCCAGGAACTCAGAGAAGATGATTTCCAGCCCCTGATGCAGCTTCCAAA 

CTTATCGACTATCAACTTGGGTATTAATTTTATTAAGCAAATCGATTTCAAACTTTTCCAAA 

ATTTCTCCAATCTGGAAATTATTTACTTGTCAGAAAACAGAATATCACCGTTGGTAAAAGAT 

ACCCGGCAGAGTTATGCAAATAGTTCCTCTTTTCAACGTCATATCCGGAAACGACGCTCAAC 

AGATTTTGAGTTTGACCCACATTCGAACTTTTATCATTTCACCCGTCCTTTAATAAAGCCAC 

AATGTGCTGCTTATGGAAAAGCCTTAGATTTAAGCCTCAACAGTATTTTCTTCATTGGGCCA 

AACCAATTTGAAAATCTTCCTGACATTGCCTGTTTAAATCTGTCTGCAAATAGCAATGCTCA 

AGTGTTAAGTGGAACTGAATTTTCAGCCATTCCTCATGTCAAATATTTGGATTTGACAAACA 

ATAGACTAGACTTTGATAATGCTAGTGCTCTTACTGAATTGTCCGACTTGGAAGTTCTAGAT 

CTCAGC TATAAT TCACAC TATTTCAGAATAGCAGGCGTAACACATCATCTAGAATTTATTCA 

AAATTTCACAAATCTAAAAGTTTTAAACTTGAGCCACAACAACATTTATACTTTAACAGATA 

AGTATAACCTGGAAAGCAAGTCCCTGGTAGAATTAGTTTTCAGTGGCAATCGCCTTGACATT 

TTGTGGAATGATGATGACAACAGGTATATCTCCATTTTCAAAGGTCTCAAGAATCTGACACG 

TCTGGATTTATCCCTTAATAGGCTGAAGCACATCCCAAATGAAGCATTCCTTAATTTGCCAG 

CGAGTCTCACTGAACTACATATAAATGATAATATGTTAAAGTTTTTTAACTGGACATTACTC 

CAGCAGTTTCCTCGTCTCGAGTTGCTTGACTTACGTGGAAACAAACTACTCTTTTTAACTGA 

TAGCCTATCTGACTTTACATCTTCCCTTCGGACACTGCTGCTGAGTCATAACAGGATTTCCC 

ACCTACCCTCTGGCTTTCTTTCTGAAGTCAGTAGTCTGAAGCACCTCGATTTAAGTTCCAAT 

CTGCTAAAAACAATCAACA7ATCCGCACTTGAAACTAAGACCACCACCAAATTATCTATGTT 

GGAACTACACGGAAACCCCTTTGAATGCACCTGTGACATTGGAGATTTCCGAAGATGGATGG 

ATGAACATCTGAATGTCAAAATTCCCAGACTGGTAGATGTCATTTGTGCCAGTCCTGGGGAT 

CAAAGAGGGAAGAGTATTGTGAGTCTGGAGCTAACAACTTGTGTTTCAGATGTCACTGCAGT 

GATATTATTTTTCTTCACGTTCTTTATCACCACCATGGTTATGTTGGCTGCCCTGGCTCACC 

ATTTGTTTTACTGGGATGTTTGGTTTATATATAATGTGTGTTTAGCTAAGGTAAAAGGCTAC 

AGGTCTCTTTCCACATCCCAAACTTTCTATGATGCTTACATTTCTTATGACACCAAAGATGC 

CTCTGTTACTGACTGGGTGATAAATGAGCTGCGCTACCACCTTGAAGAGAGCCGAGACAAAA 

ACGTTCTCCTTTGTCTAGAGGAGAGGGATTGGGACCCGGGATTGGCCATCATCGACAACCTC 

ATGCAGAGCATCAACCAAAGCAAGAAAACAGTATTTGTTTTAACCAAAAAATATGCAAAAAG 

CTGGAACTTTAAAACAGCTTTTTACTTGGCTTTGCAGAGGCTAATGGATGAGAACATGGATG 

TGATTATATTTATCCTGCTGGAGCCAGTGTTACAGCATTCTCAGTATTTGAGGCTACGGCAG 

CGGATCTGTAAGAGCTCCATCCTCCAGTGGCCTGACAACCCGAAGGCAGAAGGCTTGTTTTG 

GCAAACTCTGAGAAATGTGGTCTTGACTGAAAATGATTCACGGTATAACAATATGTATGTCG 

ATTCCATTAAGCAATACIMCTGACGTTAAGTCATGATTTCGCGCCATAATAAAGATGCAAA 

GGAATGACATTTCTGTATTAGTTATCTATTGCTATGTAACAAATTATCCCAAAACTTAGTGG 

TTTAAAACAACACATTTGCTGGCCCACAGTTTTTGAGGGTCAGGAGTCCAGGCCCAGCATAA 
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FIGURE 210R 

CTGGGTCCTCTGCTCAGGGTGTCTCAGAGGCTGCAATGTAGGTGTTCACCAGAGACATAGGC 
ATCACTGGGGTCACACTCATGTGGTTGTTTTCTGGATTCAATTCCTCCTGGGCTATTGGCCA 
AAGGCTATACTCATGTAAGCCATGCGAGCCTCTCCCACAAGGCAGCTTGCTTCATCAGAGCT 
AGCAAAAAAGAGAGGTTGCTAGCAAGATGAAGTCACAATCTTTTGTAATCGAATCAAAAAAG 
TGATATCTCATCACTTTGGCCATATTCTATTTGTTAGAAGTAAACCACAGGTCCCACCAGCT 
CCATGGGAGTGACCACCTCAGTCCAGGGJ^iWVCAGCTGAAGACCAAGATGGTGAGCTCTGAT 
TGCTTCAGTTGGTCATCAACTATTTTCCCTTGACTGCTGTCCTGGGATGGCCTGCTATCTTG 
ATGATAGATTGTGAATATCAGGAGGCAGGGATCACTGTGGACCATCTTAGCAGTTGACCTAA 
CACATCTTCTTTTCAATATCTAAGAACTTTTGCCACTGTGACTAATGGTCCTAATATTAAGC 
TGTTGTTTATATTTATCATATATCTATGGCTACATGGTTATATTATGCTGTGGTTGCGTTCG 
GTTTTATTTACAGTTGCTTTTACAAATATTTGCTGTAACATTTGACTTCTAAGGTTTAGATG 
CCATTTAAGAACTGAGATGGATAGCTTTTAAAGCATCTTTTACTTCTTACCATTTTTTAAAA 
GTATGCAGCTAAATTCGAAGCTTTTGGTCTATATTGTTAATTGCCATTGCTGTAAATCTTAA 
AATGAATGAATAAAAATGTTTCATTTTACAAAAAAAAAAAAAAAA 
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FIGURE 211 



MENMFLQSSMLTCIFLLISGSCELCAEENFSRSYPCDEKKQNDSVIAECSNRRLQEVPQTVG 
KYVTELDLSDNFITHITNESFQGLQNLTKINLNHNPNVQHQNGNPGIQSNGLNITDGAFLNL 
KNLRELLLEDNQLPQIPSGLPESLTELSLIQNNIYNITKEGISRLINLKNLYLAWNCYFNKV 
CEKTNIEDGVFETLTNLELLSLSFNSLSKVPPKLPSSLRKLFLSNTQIKYISEEDFKGLINL 
TLLDLSGNCPRCFNAPFPCVPCDGGASINIDRFAFQNLTQLRYLNLSSTSLRKINAAWFKNM 
PHLKVLDLEFNYLVGEIVSGAFLTMLPRLEILDLSFNYIKGSYPQHINISRNFSKLLSLRAL 
HLRGYVFQELREDDFQPLMQLPNLSTINLGINFIKQIDFKLFQNFSNLEIIYLSENRISPLV 
KDTRQSYANSSSFQRHIRKRRSTDFEFDPHSNFYHFTRPLIKPQCAAYGKALDLSLNSIFFI 
GPNQFENLPDIACLNLSANSNAQVLSGTEFSAIPHVKYLDLTNNRLDFDNASALTELSDLEV 
LDLSYNSHYFRIAGVTHHLEFIQNFTNLKVLNLSHNNIYTLTDKYNLESKSLVELVFSGNRL 
DILWNDDDNRYISIFKGLKNLTRLDLSLNRLKHIPNEAFLNLPASLTELHINDNMLKFFNWT 
LLQQFPRLELLDLRGNKLLFLTDSLSDFTSSLRTLLLSHNRISHLPSGFLSEVSSLKHLDLS 
SNLLKTINKSALETKTTTKLSMLELHGNPFECTCDIGDFRRWMDEHLNVKIPRLVDVICASP 
GDQRGKSIVSLELTTCVSDVTAVILFFFTFFITTMVMIJiAlJ^HLFYWDVWFIYWCLJJ^ 
GYRSLSTSQTFYDAYISYDTKDASVTDWVINELRYHLEESRDKNVLLCLEERDWDPGLAIID 
NLMQS INQSKKTVFVLTKKYAKSWNFKTAFYLALQRLMDENMDVI I FILLEPVLQHSQYLRL 
RQRICKSS ILQWPDNPKAEGLFWQTLRNWLTENDSRYNNMYVDS IKQY 
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FIGURE 212 

CCAGGTCCAA.CTGCACCTCGGTTCTATCGATTGAATTCCCCGGGGATCCTCTAGAGATCCCT 
CGACCTCGACCCACGCGTCCGCCAAGCTGGCCCTGCACGGCTGCAAGGGAGGCTCCTGTGGA 
CAGGCCAGGCAGGTGGGCCTCAGGAGGTGCCTCCAGGCGGCCAGTGGGCCTGAGGCCCCAGC 
AAGGGCTAGGGTCCATCTCCAGTCCCAGGACACAGCAGCGGCCACCATGGCCACGCCTGGGC 
TCCAGCAGCATCAGCAGCCCCCAGGACCGGGGAGGCACAGGTGGCCCCCACCACCCGGAGGA 
GCAGCTCCTGCCCCTGTCCGGGGGATGACTGATTCTCCTCCGCCAGGCCACCCAGAGGAGAA 
GGCCACCCCGCCTGGAGGCACAGGCCM5AGGGGCTCTCAGGAGGTGCTGCTGATGTGGCTT 
CTGGTGTTGGCAGTGGGCGGCACAGAGCACGCCTACCGGCCCGGCCGTAGGGTGTGTGCTGT 
CCGGGCTCACGGGGACCCTGTCTCCGAGTCGTTCGTGCAGCGTGTGTACCAGCCCTTCCTCA 
CCACCTGCGACGGGCACCGGGCCTGCAGCACCTACCGAACCATCTATAGGACCGCCTACCGC 
CGCAGCCCTGGGCTGGCCCCTGCCAGGCCTCGCTACGCGTGCTGCCCCGGCTGGAAGAGGAC 
CAGCGGGCTTCCTGGGGCCTGTGGAGCAGCAATATGCCAGCCGCCATGCCGGAACGGAGGGA 
GCTGTGTCCAGCCTGGCCGCTGCCGCTGCCCTGCAGGATGGCGGGGTGACACTTGCCAGTCA 
GATGTGGATGAATGCAGTGCTAGGAGGGGCGGCTGTCCCCAGCGCTGCATCAACACCGCCGG 
CAGTTACTGGTGCCAGTGTTGGGAGGGGCACAGCCTGTCTGCAGACGGTACACTCTGTGTGC 
CCAAGGGAGGGCCCCCCAGGGTGGCCCCCAACCCGACAGGAGTGGACAGTGCAATGAAGGAA 
GAAGTGCAGAGGCTGCAGTCCAGGGTGGACCTGCTGGAGGAGAAGCTGCAGCTGGTGCTGGC 
CCCACTGCACAGCCTGGCCTCGCAGGCACTGGAGCATGGGCTCCCGGACCCCGGCAGCCTCC 
TGGTGCACTCCTTCCAGCAGCTCGGCCGCATCGACTCCCTGAGCGAGCAGATTTCCTTCCTG 
GAGGAGCAGCTGGGGTCCTGCTCCTGCAAGAAAGACTCGTG&CTGCCCAGCGCCCCAGGCTG 
GACTGAGCCCCTCACGCCGCCCTGCAGCCCCCATGCCCCTGCCCAACATGCTGGGGGTCCAG 
AAGCCACCTCGGGGTGACTGAGCGGAAGGCCAGGCAGGGCCTTCCTCCTCTTCCTCCTCCCC 
TTCCTCGGGAGGCTCCCCAGACCCTGGCATGGGATGGGCTGGGATCTTCTCTGTGAATCCAC 
CCCTGGCTACCCCCACCCTGGCTACCCCAACGGCATCCCAAGGCCAGGTGGGCCCTCAGCTG 
AGGGAAGGTACGAGCTCCCTGCTGGAGCCTGGGACCCATGGCACAGGCCAGGCAGCCCGGAG 
GCTGGGTGGGGCCTCAGTGGGGGCTGCTGCCTGACCCCCAGCACAATAAAAATGAAACGTGA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGCGACTCTAGAGT 
CGACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATGGTTACAAAT 
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FIGURE 213 



MRGSQEVLLMWLLVLAVGGTEHAYRPGRRVCAVRAHGDPVSESFVQRVYQPFLTTCDGHRAC 
STYRTIYRTAYRRSPGLAPARPRYACCPGWKRTSGLPGACGAAICQPPCRNGGSCVQPGRCR 
CPAGWRGDTCQSDVDECSARRGGCPQRCINTAGSYWCQCWEGHSLSADGTLCVPKGGPPRVA 
PNPTGVDSAMKEEVQRLQSRVDLLEEKLQLVLAPLHSLASQALEHGLPDPGSLLVHSFQQLG 
RIDSLSEQISFLEEQLGSCSCKKDS 




FIGURE 214 



GCCAGGCAGGTGGGCCTCAGGAGGTGCCTCCAGGCGGCCAGTGGGCCTGAGGCCCCAGCAAG 
GGCTAGGGTCCATCTCCAGTCCCAGGACACAGCAGCGGCCACCATGGCCACGCCTGGGCTCC 
AGCAGCATCAGAGCAGCCCCTGTGGTTGGCAGCAAAGTTCAGCTTGGCTGGGCCCGCTGTGA 
GGGGCTTCGCGCTACGCCCTGCGGTGTCCCGAGGGCTGAGGTCTCCTCATCTTCTCCCTAGC 
AGTGGATGAGCAACCCAACGGGGGCCCGGGGAGGGGAACTGGCCCCGAGGGAGAGGAACCCC 
AAAGCCACATCTGTAGCCAGGATGAGCAGTGTGAATCCAGGCAGCCCCCAGGACCGGGGAGG 
CACAGGTGGCCCCCACCACCCGGAGGAGCAGCTCCTGCCCCTGTCCGGGGGATGACTGATTC 
TCCTCCGCCAGGCCACCCAGAGGAGAAGGCCACCCCGCCTGGAGGCACAGGCCMGAGGGGC 
TCTCAGGAGGTGCTGCTGATGTGGCTTCTGGTGTTGGCAGTGGGCGGCACAGAGCACGCCTA 
CCGGCCCGGCCGTAGGGTGTGTGCTGTCCGGGCTCACGGGGACCCTGTCTCCGAGTCGTTCG 
TGCAGCGTGTGTACCAGCCCTTCCTCACCACCTGCGACGGGCACCGGGCCTGCAGCACCTAC 
CGAACCATCTATAGGACCGCCTACCGCCGCAGCCCTGGGCTGGCCCCTGCCAGGCCTCGCTA 
CGCGTGCTGCCCCGGCTGGAAGAGGACCAGCGGGCTTCCTGGGGCCTGTGGAGCAGCAATAT 
GCCAGCCGCCATGCCGGT^CGGAGGGAGCTGTGTCCAGCCTGGCCGCTGCCGCTGCCCTGCA 
GGATGGCGGGGTGACACTTGCCAGTCAGATGTGGATGAATGCAGTGCTAGGAGGGGCGGCTG 
TCCCCAGCGCTGCATCAACACCGCCGGCAGTTACTGGTGCCAGTGTTGGGAGGGGCACAGCC 
TGTCTGCAGACGGTACACTCTGTGTGCCCAAGGGAGGGCCCCCCAGGGTGGCCCCCAACCCG 
ACAGGAGTGGACAGTGCAATGAAGGAAGAAGTGCAGAGGCTGCAGTCCAGGGTGGACCTGCT 
GGAGGAGAAGCTGCAGCTGGTGCTGGCCCCACTGCACAGCCTGGCCTCGCAGGCACTGGAGC 
ATGGGCTCCCGGACCCCGGCAGCCTCCTGGTGCACTCCTTCCAGCAGCTCGGCCGCATCGAC 
TCCCTGAGCGAGCAGATTTCCTTCCTGGAGGAGCAGCTGGGGTCCTGCTCCTGCAAGAAAGA 
CTCGTGACTGCCCAGCGCTCCAGGCTGGACTGAGCCCCTCACGCCGCCCTGCAGCCCCCATG 
CCCCTGCCCAACATGCTGGGGGTCCAGAAGCCACCTCGGGGTGACTGAGCGGAAGGCCAGGC 
* AGGGCCTTCCTCCTCTTCCTCCTCCCCTTCCTCGGGAGGCTCCCCAGACCCTGGCATGGGAT 
GGGCTGGGATCTTCTCTGTGAATCCACCCCTGGCTACCCCCACCCTGGCTACCCCAACGGCA 
TCCCAAGGCCAGGTGGACCCTCAGCTGAGGGAAGGTACGAGCTCCCTGCTGGAGCCTGGGAC 
CCATGGCACAGGCCAGGCAGCCCGGAGGCTGGGTGGGGCCTCAGTGGGGGCTGCTGCCTGAC 
CCCCAGCACAATAAAAATGAAACGTG 
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FIGURE 21 B 



MRGSQEVLLMWLLVLAVGGTEHAYRPGRRVCAVRAHGDPVSESFVQRVYQPFLTTCDGHRAC 
STYRTIYRTAYRRSPGLAPARPRYACCPGWKRTSGLPGACG.:\AICQPPCRNGGSCVQPGRCR 
CPAGWRGDTCQSDVDECSARRGGCPQRCINTAGSYWCQCWEGHSLSADGTLCVPKGGPPRVA 
PNPTGVDSAMKEEVQRLQSRVDLLEEKLQLVLAPLHSLASQALEHGLPDPGSLLVHSFQQLG 
RIDSLSEQISFLEEQLGSCSCKKDS 
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FIGURE 216 



CCCACGCGTCCGAAGCTGGCCCTGCACGGCTGCAAGGGAGGCTCCTGTGGACAGGCCAGGCA 

GGTGGGCCTCAGGAGGTGCCTCCAGGCGGCCAGTGGGCCTGAGGCCCCAGCAAGGGCTAGGG 

TCCATCTCCAGTCCCAGGACACAGCAGCGGCCACCATGGCCACGCCTGGGCTCCAGCAGCAT 

CAGCAGCCCCCAGGACCGGGGAGGCACAGGTGGCCCCCACCACCCGGAGGAGCAGCTCCTGC 

CCCTGTCCGGGGGATGACTGATTCTCCTCCGCCAGGCCACCCAGAGGAGAAGGCCACCCCGC 

CTGGAGGCACAGGCC&IfiAGGGGCTCTCAGGAGGTGCTGCTGATGTGGCTTCTGGTGTTGGC 

AGTGGGCGGCACAGAGCACGCCTACCGGCCCGGCCGTAGGGTGTGTGCTGTCCGGGCTCACG 

GGGACCCTGTCTCCGAGTCGTTCGTGCAGCGTGTGTACCAGCCCTTCCTCACCACCTGCGAC 

GGGCACCGGGCCTGCAGCACCTACCGAACCATCTATAGGACCGCCTACCGCCGCAGCCCTGG 

GCTGGCCCCTGCCAGGCCTCGCTACGCGTGCTGCCCCGGCTGGAAGAGGACCAGCGGGCTTC 

CTGGGGCCTGTGGAGCAGCAATATGCCAGCCGCCATGCCGGAACGGAGGGAGCTGTGTCCAG 

CCTGGCCGCTGCCGCTGCCCTGCAGGATGGCGGGGTGACACTTGCCAGTCAGATGTGGATGA 

ATGCAGTGCTAGGAGGGGCGGCTGTCCCCAGCGCTGCGTCAACACCGCCGGCAGTTACTGGT 

GCCAGTGTTGGGAGGGGCACAGCCTGTCTGCAGACGGTACACTCTGTGTGCCCAAGGGAGGG 

CCCCCCAGGGTGGCCCCCAACCCGACAGGAGTGGACAGTGCAATGAAGGAAGAAGTGCAGAG 

GCTGCAGTCCAGGGTGGACCTGCTGGAGGAGAAGCTGCAGCTGGTGCTGGCCCCACTGCACA 

GCCTGGCCTCGCAGGCACTGGAGCATGGGCTCCCGGACCCCGGCAGCCTCCTGGTGCACTCC 

TTCCAGCAGCTCGGCCGCATCGACTCCCTGAGCGAGCAGATTTCCTTCCTGGAGGAGCAGCT 

GGGGTCCTGCTCCTGCAAGAAAGACTCGTS&CTGCCCAGCGCCCCAGGCTGGACTGAGCCCC 

TCACGCCGCCCTGCAGCCCCCATGCCCCTGCCCAACATGCTGGGGGTCCAGAAGCCAGCTCG 

GGGTGACTGAGCGGAAGGCCAGGCAGGGCCTTCCTCCTCTTCCTCCTCCCCTTCCTCGGGAG 

GCTCCCCAGACCCTGGCATGGGATGGGCTGGGATCTTCTCTGTGAATCCACCCCTGGCTACC 

CCCACCCTGGCTACCCCAACGGCATCCCAAGGCCAGGTGGGCCCTCAGCTGAGGGAAGGTAC 

GAGCTCCCTGCTGGAGCCTGGGACCCATGGCACAGGCCAGGCAGCCCGGAGGCTGGGTGGGG 

CCTCAGTGGGGGCTGCTGCCTGACCCCCAGCACAATAAA/VATGAAACGTG 
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FIGURE 217 



MRGSQEVLLMWLLVLAVGGTEHAYRPGRRVCAVRAHGDPVSESFVQRVYQPFLTTCDGHRAC 
STYRTIYR7AYRRSPGLAPARPRYACCPGWKRTSGLPGACGAAICQPPCRNGGSCVQPGRCR 
CPAGWRGDTCQSDVDECSARRGGCPQRCVNTAGSYWCQCWEGHSL3ADGTLCVPKGGPPRVA 
PNPTGVDSAMKEEVQRLQSRVDLLEEKLQLVLAPLHSLASQALEHGLPDPGSLLVHSFQQLG 
RIDSLSEQISFLEEQLGSCSCKKDS 
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FIGURE 218 



GGTTGCCACAGCTGGTTTAGGGCCCCGACCACTGGGGCCCCTTGTCAGGAGGAGACAGCCTC 

CCGGCCCGGGGAGGACAAGTCGCTGCCACCTTTGGCTGCCGACGTGATTCCCTGGGACGGTC 

CGTTTCCTGCCGTCAGCTGCCGGCCGAGTTGGGTCTCCGTGTTTCAGGCCGGCTCCCCCTTC 

CTGGTCTCCCTTCTCCCGCTGGGCCGGTTTATCGGGAGGAGATTGTCTTCCAGGGCTAGCAA 

TTGGACTTTTGATGATGTTTGACCCAGCGGCAGGAATAGCAGGCAACGTGATTTCAAAGCTG 

GGCTCAGCCTCTGTTTCTTCTCTCGTGTAATCGCAAAACCCATTTTGGAGCAGGAATTCCAA 

TCaifiTCTGTGATGGTGGTGAGAAAGAAGGTGACACGGAPATGGGAGAAACTCCCAGGCAGG 

AACACCTTTTGCTGTGATGGCCGCGTCATGATGGCCCGGCAAAAGGGCATTTTCTACCTGAC 

CCTTTTCCTCATCCTGGGGACATGTACACTCTTCTTCGCCTTTGAGTGCCGCTACCTGGCTG 

TTCAGCTGTCTCCTGCCATCCCTGTATTTGCTGCCATGCTCTTCCTTTTCTCCATGGCTACA 

CTGTTGAGGACCAGCTTCAGTGACCCTGGAGTGATTCCTCGGGCGCTACCAGATGAAGCAGC 

TTTCATAGAAATGGAGATAGAAGCTACCAATGGTGCGGTGCCCCAGGGCCAGCGACCACCGC 

CTCGTATCAAGAATTTCCAGATAAACAACCAGATTGTGAAACTGAAATACTGTTACACATGC 

AAGATCTTCCGGCCTCCCCGGGCCTCCCATTGCAGCATCTGTGACAACTGTGTGGAGCGCTT 

CGACCATCACTGCCCCTGGGTGGGGAATTGTGTTGGAAAGAGGAACTACCGCTACTTCTACC 

TCTTCATCCTTTCTCTCTCCCTCCTCACAATCTATGTCTTCGCCTTCAACATCGTCTATGTG 

GCCCTCAAATCTTTGAAAATTGGCTTCTTGGAGACATTGAAAGAAACTCCTGGAACTGTTCT 

AGAAGTCCTCATTTGCTTCTTTACACTCTGGTCCGTCGTGGGACTGACTGGATTTCATACTT 

TCCTCGTGGCTCTCAACCAGACAACCAATGAAGACATCAAAGGATCATGGACAGGGAAGAAT 

CGCGTCCAGAATCCCTACAGCCATGGCAATATTGTGAAGAACTGCTGTGAAGTGCTGTGTGG 

CCCCTTGCCCCCCAGTGTGCTGGATCGAAGGGGTATTTTGCCACTGGAGGAAAGTGGAAGTC 

GACCTCCCAGTACTCAAGAGACCAGTAGCAGCCTCTTGCCACAGAGCCCAGCCCCCACAGAA 

CACCTGAACTCAAATGAGATGCCGGAGGACAGCAGCACTCCCGAAGAGATGCCACCTCCAGA 

GCCCCCAGAGCCACCACAGGAGGCAGCTGAAGCTGAGAAGIASCCTATCTATGGAAGAGACT 

TTTGTTTGTGTTTAATTAGGGCTATGAGAGATTTCAGGTGAGAAGTTAAACCTGAGACAGAG 

AGCAAGTAAGCTGTCCCTTTTAACTGTTTTTCTTTGGTCTTTAGTCACCCAGTTGCACACTG 

GCATTTTCTTGCTGCAAGCTTTTTTAAATTTCTGAACTCAAGGCAGTGGCAGAAGATGTCAG 

TCACCTCTGATAACTGGAAAAATGGGTCTCTTGGGCCCTGGCACTGGTTCTCCATGGCCTCA 

GCCACAGGGTCCCCTTGGACCCCCTCTCTTCCCTCCAGATCCCAGCCCTCCTGCTTGGGGTC 

ACTGGTCTCATTCTGGGGCTAAAAGTTTTTGAGACTGGCTCAAATCCTCCCAAGCTGCTGCA 

CGTGCTGAGTCCAGAGGCAGTCACAGAGACCTCTGGCCAGGGGATCCTAACTGGGTTCTTGG 

GGTCTTCAGGACTGAAGAGGAGGGAGAGTGGGGTCAGAAGATTCTCCTGGCCACCAAGTGCC 

AGCATTGCCCACAAATCCTTTTAGGAATGGGACAGGTACCTTCCACTTGTTG7ANNNNNNNN 

NNNNNNNNNNNNNNNNNNNTTGTTTTTCCTTTTGACTCCTGCTCCCATTAGGAGCAGGAATG 

GCAGTAATAAAAGTCTGCACTTTGGTCATTTCTTTTCCTCAGAGGAAGCCCGAGTGCTCACT 

TAAACACTATCCCCTCAGACTCCCTGTGTGAGGCCTGCAGAGGCCCTGAATGCACAAATGGG 

AAACCAAGGCACAGAGAGGCTCTCCTCTCCTCTCCTCTCCCCCGATGTACCCTCAAAAAAAA 

AAAAATGCTAACCAGTTCTTCCATTAAGCCTCGGCTGAGTGAGGGAAAGCCCAGCACTGCTG 

CCCTCTCGGGTAACTCACCCTAAGGCCTCGGCCCACCTCTGGCTATGGTAACCACACTGGGG 

GCTTCCTCCAAGCCCCGCTCTTCCAGCACTTCCACCGGCAGAGTCCCAGAGCCACTTCACCC 

TGGGGGTGGGCTGTGGCCCCCAGTCAGCTCTGCTCAGGACCTGCTCTATTTCAGGGAAGAAG 

ATTTATGTATTATATGTGGCTATATTTCCTAGAGCACCTGTGTTTTCCTCTTTCTAAGCCAG 

GGTCCTGTCTGGATGACTTATGCGGTGGGGGAGTGTAAACCGGAACTTTTCATCTATTTGAA 

GGCGATTAAACTGTGTCTAATGCA 
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FIGURE 219 

MS^/MWRKKVTRKWEKLPGRNTFCCDGRVMMARQKGIFYLTLFLILGTCTLFFAFECRYLAV 
QLS PAI P VFAAML FL FSMATLLRT S FS DPGVI PRALPDEAAF I EME I EATNGAVPQGQRPP P 
RIKNFQINNQIVKLKYCYTCKIFRPPRASHCSICDNCVERFDHHCPWVGNCVGKRNYRYFYL 
FILSLSLLTIYVFAFNIVYVALKSLKIGFLETLKETPGTVLEVLICFFTLWSWGLTGFHTF 
LVALNQTTNEDIKGSWTGKNRVQNPYSHGNIVKNCCEVLCGPLPPSVLDRRGILPLEESGSR 
PPSTQETSSSLLPQSPAPTEHLNSNEMPEDSSTPEEMPPPEPPEPPQEAAEAEK 
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FIGURE 220 



AAAACCCTGTATTTTTTACAATGCAAATAGACAATNANCCTGGAGGTCTTTGAATTAGGTAT 

TATAGGGATGGTGGGGTTGATTTTTNTTCCTGGAGGCTTTTGGCTTTGGACTCTCNCTTTCT 

CCCACAGAGCNCTTCGACCATCACTGCCCCTGGGTGGGGAAT7GTGTTGGAAAGAGGAACTA 

CCGCTANTTCTACCTCTTCATCCTTTNTCTCTCCCNCCTCACAATCTATGTCTTCGCCTTCA 
ACATCGT 
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FIGURE 221 

GTTGTGTCCTTCAGCAAAACAGTGGATTTAAATCTCCTTGCACAAGCTTGAGAGCAACACAA 

T C TAT GAG G AAAG AAAG AAAGAAAAAAACC G A?. CC T G AC AAAAAAG AAG AAAAAG AAG AAG A 

AAAAAAP.TCMfiAAAACCATCCAGCCAAAAATGCACAATTCTATCTCTTGGGCAATCTTCAC 

GGGGCTGGCTGCTCTGTGTCTCTTCCAAGGAGTGCCCGTGCGCAGCGGAGATGCCACCTTCC 

CCAAAGCTATGGACAACGTGACGGTCCGGCAGGGGGAGAGCGCCACCCTCAGGTGCACTATT 

GACAACCGGGTCACCCGGGTGGCCTGGCTAAACCGCAGCACCATCCTCTATGCTGGGAATGA 

CAAGTGGTGCCTGGATCCTCGCGTGGTCCTTCTGAGCAACACCCAAACGCAGTACAGCATCG 

AGATCCAGAACGTGGATGTGTATGACGAGGGCCCTTACACCTGCTCGGTGCAGACAGACAAC 

CACCCAAAGACCTCTAGGGTCCACCTCATTGTGCAAGTATCTCCCAAAATTGTAGAGATTTC 

TTCAGATATCTCCATTAATGAAGGGAACAATATTAGCCTCACCTGCATAGCAACTGGTAGAC 

CAGAGCCTACGGTTACTTGGAGACACATCTCTCCCAAAGCGGTTGGCTTTGTGAGTGAAGAC 

GAATACTTGGAAATTCAGGGCATCACCCGGGAGCAGTCAGGGGACTACGAGTGCAGTGCCTC 

CAATGACGTGGCCGCGCCCGTGGTACGGAGAGTAAAGGTCACCGTGAACTATCCACCATACA 

TTTCAGAAGCCAAGGGTACAGGTGTCCCCGTGGGACAAAAGGGGACACTGCAGTGTGAAGCC 

TCAGCAGTCCCCTCAGCAGAATTCCAGTGGTACAAGGATGACAAAAGACTGATTGAAGGAAA 

GAAAGGGGTGAAAGTGGAAAACAGACCTTTCCTCTCAAAACTCATCTTCTTCAATGTCTCTG 

AACATGACTATGGGAACTACACTTGCGTGGCCTCCAACAAGCTGGGCCACACCAATGCCAGC 

ATCATGCTATTTGGTCCAGGCGCCGTCAGCGAGGTGAGCAACGGCACGTCGAGGAGGGCAGG 

CTGCGTCTGGCTGCTGCCTCTTCTGGTCTTGCACCTGCTTCTCAAATTT2S&TGTGAGTGCC 

ACTTCCCCACCCGGGAAAGGCTGCCGCCACCACCACCACCAACACAACAGCAATGGCAACAC 

CGACAGCAACCAATCAGATATATACAAATGAAATTAGAAGAAACACAGCCTCATGGGACAGA 

AATTTGAGGGAGGGGAACAAAGAATACTTTGGGGGGAAAAGAGTTTTAAAAAAGAAATTGAA 

AATTGCCTTGCAGATATTTAGGTACAATGGAGTTTTCTTTTCCCAAACGGGAAGAACACAGC 

ACACCCGGCTTGGACCCACTGCAAGCTGCATCGTGCAACCTCTTTGGTGCCAGTGTGGGCAA 

GGGCTCAGCCTCTCTGCCCACAGAGTGCCCCCACGTGGAACATTCTGGAGCTGGCCATCCCA 

AATTCAATCAGTCCATAGAGACGAACAGAATGAGACCTTCCGGCCCAAGCGTGGCGCTGCGG 

GCACTTTGGTAGACTGTGCCACCACGGCGTGTGTTGTGAAACGTGAAATAAAAAGAGCAAAA 

AAAAA 
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MKTIQPKMHNSISWAIFTGLAALCLFQGVPVRSGDATFPKAMDN'/TVRQGESATLRCTIDNR 
VTRVAWLNRSTILYAGNDKWCLDPRWLLSNTQTQYSI£IQN\'D\A'DEGPyTCSVQTDNHPK 
TSRVHLIVQVSPKIVEISSDISINEGNNISLTCIATGRFEPTVTWRHISPKAVGFVSEDEYL 
EIQGITREQSGDYECSASNDVAAPWRRVKVTVNYPPYISEAKGTGVPVGQKGTLQCEASAV 
PSAEFQWYKDDKRLIEGKKGVKVENRPFLSKLIFFWSEHDYGNYTCVASNKLGHTNASIML 
FGPGAVSEVSNGTSRRAGCVWLLPLLVLHLLLKF 
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FIGURE 223 

GAAAAAAAATCATGMAACCATCCAGCCAAAAATGCACAATTCTATCTCTTGGGCMTCTTC 
ACGGGGCTGGCTGCTCTGTGTCTCTTCCAAGGAGTGCCCGTGCGCAGCGGAGATGCCACCTT 
CCCCAAAGCTATGGACAACGTGACGGTCCGGCAGGGGGAGAGCGCCACCCTCAGGTGCACTA 
TTGACAACCGGGTCACCCGGGTGGCCTGGCTAAACCGCAGCACCATCCTCTATGCTGGGAAT 
GACAAGTGGTGCCTGGATCCTCGCGTGGTCCTTCTGAGCAACACCCAAACGCAGTACAGCAT 
CGAGATCCAGA.ACGTGGATGTGTATGACGAGGGCCCTTACACCTGCTCGGTGCAGACAGACA 
ACCACCCAAAGACCTCTAGGGTCCACCTCATTGTGCAAGTATCTCCCAAAATTGTAGAGATT 
TCTTCAGATATCTCCATTAATGAAGGGAACAATATTAGCCTCACCTGCATAGCAACTGGTAG 
ACCAGAG 
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ATGGCTGGTGACGGCGGGGCCGGGCAGGGGACCGGGGCCGCGGCCCGGGAGCGGGCCAGCTG 
CCGGGAGCCCTGAATCACCGCCTGGCCCGACTCCACCMfiAACGTCGCGCTGCAGGAGCTGG 
GAGCTGGCAGCAACGTGGGATTCCAGA.2^.GGGGACAAGACAGCTGTTAGGCTCACGCACGCAG 
CTGGAGCTGGTCTTAGCAGGTGCCTCTCTACTGCTGGCTGCACTGCTTCTGGGCTGCCTTGT 
GGCCCTAGGGGTCCAGTACCACAGAGACCCATCCCACAGCACCTGCCTTACAGAGGCCTGCA 
TTCGAGTGGCTGGAAAAATCCTGGAGTCCCTGGACCGAGGGGTGAGCCCCTGTGAGGACTTT 
TACCAGTTCTCCTGTGGGGGCTGGATTCGGAGGAACCCCCTGCCCGATGGGCGTTCTCGCTG 
GAACACCTTCAACAGCCTCTGGGACCAAAACCAGGCCATACTGAAGCACCTGCTTGAAAACA 
CCACCTTCAACTCCAGCAGTGAAGCTGAGCAGAAGACACAGCGCTTCTACCTATCTTGCCTA 
CAGGTGGAGCGCATTGAGGAGCTGGGAGCCCAGCCACTGAGAGACCTCATTGAGAAGATTGG 
TGGTTGGAACATTACGGGGCCCTGGGACCAGGACAACTTTATGGAGGTGTTGAAGGCAGTAG 
CAGGGACCTACAGGGCCACCCCATTCTTCACCGTCTACATCAGTGCCGACTCTAAGAGTTCC 
AACAGCAATGTTATCCAGGTGGACCAGTCTGGGCTCTTTCTGCCCTCTCGGGATTACTACTT 
AAACAGAACTGCCAATGAGAAAGTGCTCACTGCCTATCTGGATTACATGGAGGAACTGGGGA 
TGCTGCTGGGTGGGCGGCCCACCTCCACGAGGGAGCAGATGCAGCAGGTGCTGGAGTTGGAG 
ATACAGCTGGCCAACATCACAGTGCCCCAGGACCAGCGGCGCGACGAGGAGAAGATCTACCA 
CAAGATGAGCATTTCGGAGCTGCAGGCTCTGGCGCCCTCCATGGACTGGCTTGAGTTCCTGT 
CTTTCTTGCTGTCACCATTGGAGTTGAGTGACTCTGAGCCTGTGGTGGTGTATGGGATGGAT 
TATTTGCAGCAGGTGTCAGAGCTCATCAACCGCACGGAACCAAGCATCCTGAACAATTACCT 
GATCTGGAACCTGGTGCAAAAGACAACCTCAAGCCTGGACCGACGCTTTGAGTCTGCACAAG 
AGAAGCTGCTGGAGACCCTCTATGGCACTAAGAAGTCCTGTGTGCCGAGGTGGCAGACCTGC 
ATCTCCAACACGGATGACGCCCTTGGCTTTGCTTTGGGGTCACTCTTCGTGAAGGCCACGTT 
TGACCGGCAAAGCAAAGAAATTGCAGAGGGGATGATCAGCGAAATCCGGACCGCATTTGAGG 
AGGCCCTGGGACAGCTGGTTTGGATGGATGAGAAGACCCGCCAGGCAGCCAAGGAGAAAGCA 
GATGCCATCTATGATATGATTGGTTTCCCAGACTTTATCCTGGAGCCCAAAGAGCTGGATGA 
TGTTTATGACGGGTACGAAATTTCTGAAGATTCTTTCTTCCAAAACATGTTGAATTTGTACA 
ACTTCTCTGCCAAGGTTATGGCTGACCAGCTCCGCAAGCCTCCCAGCCGAGACCAGTGGAGC 
ATGACCCCCCAGACAGTGAATGCCTACTACCTTCCAACTAAGAATGAGATCGTCTTCCCCGC 
TGGCATCCTGCAGGCCCCCTTCTATGCCCGCAACCACCCCAAGGCCCTGAACTTCGGTGGCA 
TCGGTGTGGTCATGGGCCATGAGTTGACGCATGCCTTTGATGACCAAGGGCGCGAGTATGAC 
AAAGAAGGGAACCTGCGGCCCTGGTGGCAGAATGAGTCCCTGGCAGCCTTCCGGAACCACAC 
GGCCTGCATGGAGGAACAGTACAATCAATACCAGGTCAATGGGGAGAGGCTCAACGGCCGCC 
AGACGCTGGGGGAGAACATTACTGACAACGGGGGGCTGAAGGCTGCCTACAATGCTTACAAA 
GCATGGCTGAGAAAGCATGGGGAGGAGCAGCAACTGCCAGCCGTGGGGCTCACCAACCACCA 
GCTCTTCTTCGTGGGATTTGCCCAGGTGTGGTGCTCGGTCCGCACACCAGAGAGCTCTCACG 
AGGGGCTGGTGACCGACCCCCACAGCCCTGCCCGCTTCCGCGTGCTGGGCACTCTCTCCAAC 
TCCCGTGACTTCCTGCGGCACTTCGGCTGCCCTGTCGGCTCCCCCATGAACCCAGGGCAGCT 
GTGTGAGGTGTGGIASACCTGGATCAGGGGAGAAATGGCCAGCTGTCACCAGACCTGGGGCA 
GCTCTCCTGACAAAGCTGTTTGCTCTTGGGTTGGGAGGAAGCAAATGCAAGCTGGGCTGGGT 
CTAGTCCCTCCCCCCCACAGGTGACATGAGTACAGACCCTCCTCAATCACCACATTGTGCCT 
CTGCTTTGGGGGTGCCCCTGCCTCCAGCAGAGCCCCCACCATTCACTGTGACATCTTTCCGT 
GTCACCCTGCCTGGAAGAGGTCTGGGTGGGGAGGCCAGTTCCCATAGGAAGGAGTCTGCC 
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MNVALQELGAGSNVGFQKGTRQLLGSRTQLELVLAGASLLLAALLLGCLVALGVQYHRDPSH 
STCLTEACIRVAGKILE3LDRGVSPCEDFYQFSCGGWIRRNPLPDGRSRWNTFNSLWDQNQA 
ILKHLLENTTFNSSSEAEQKTQRFYLSCLQVERIEELGAQPLRDLIEKIGGWNITGPWDQDN 
FMEVLKAVAGTYRATPFFTVYISADSKSSNSNVIQVDQSGLFLPSRDYYLNRTANEIO/LTAY 
LDYMEELGMLLGGRPTSTREQMQQVLELEIQLANITVPQDQRRDEEKIYHKMSISELQALAP 
SMDWLEFLSFLLSPLELSDSEPVWYGMDYLQQVSELINRTEPSILNNYLIWNLVQKTTSSL 
DRRFESAQEKLLETLYGTKKSCVPRWQTCISNTDDALGFALG3LP/KATFDRQSKEIAEGMI 
SEIRTAFEEALGQLVWMDEKTRQAAKEKADAIYDMIGFPDFILEPKELDDVYDGYEISEDSF 
FQNMLNLYNFSAKVMADQLRKPPSRDQWSMTPQTVNAYYLPTKNEIVFPAGILQAPFYARNH 
PKALNFGGIGWMGHELTHAFDDQGREYDKEGNLRPWWQNESLAAFRNHTACMEEQYNQYQV 
NGERLNGRQTLGENITDNGGLKAAYNAYKAWLRKHGEEQQLPAVGLTNHQLFFVGFAQVWCS 
VRTPESSKEGLVTDPHSPARFRVLGTLSNSRDFLRHFGCPVGSPMNPGQLCEVW 
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FIGURE 226A 

GCCCGGCCCTCCGCCCTCCGCACTCCCGCCTCCCTCCCTCCGCCCGCTCCCGCGCCCTCCTC 
CCTCCCTCCTCCCCAGCTGTCCCGTTCGCGTCATGCCGAGCCTCCCGGCCCCGCCGGCCCCG 
CTGCTGCTCCTCGGGCTGCTGCTGCTCGGCTCCCGGCCGGCCCGCGGCGCCGGCCCAGAGCC 
CCCCGTGCTGCCCATCCGTTCTGAGAAGGAGCCGCTGCCCGTTCGGGGAGCGGCAGGTAGGT 
GGGCGCCCGGGGGAGGCGCGGGCGGGGAGTCGGGCTCGGGGCGAGTCAGCGCCAGCCCGGAG 
GGGGCGCGGGGCGCAGGTGGCTCGGCGCGGCGGGCGGCCCGGAGGGTGGGCGGGGGCAGAAG 
GGCGCGGTGCCTGGGACCCGGGACCCGCGGGCAGCCCCCGGGGCGGCACACGGCGCGAGCTG 
GGCAGCGGCCTCCAGCCAAGCCCGTCCCCGCAGGCTGCACCTTCGGCGGGAAGGTCTATGCC 
TTGGACGAGACGTGGCACCCGGACCTAGGGGAGCCATTCGGGGTGATGCGCTGCGTGCTGTG 
CGCCTGCGAGGCGCAGTGGGGTCGCCGTACCAGGGGCCCTGGCAGGGTCAGCTGCAAGAACA 
TCAAACCAGAGTGCCCAACCCCGGCCTGTGGGCAGCCGCGCCAGCTGCCGGGACACTGCTGC 
CAGACCTGCCCCCAGGACTTCGTGGCGCTGCTGACAGGGCCGAGGTCGCAGGCGGTGGCACG 
AGCCCGAGTCTCGCTGCTGCGCTCTAGCCTCCGCTTCTCTATCTCCTACAGGCGGCTGGACC 
GCCCTACCAGGATCCGCTTCTCAGACTCCAATGGCAGTGTCCTGTTTGAGCACCCTGCAGCC 
CCCACCCAAGATGGCCTGGTCTGTGGGGTGTGGCGGGCAGTGCCTCGGTTGTCTCTGCGGCT 
CCTTAGGGCAGAACAGCTGCATGTGGCACTTGTGACACTCACTCACCCTTCAGGGGAGGTCT 
GGGGGCCTCTCATCCGGCACCGGGCCCTGTCCCCAGAGACCTTCAGTGCCATCCTGACTCTA 
GAAGGCCCCCACCAGCAGGGCGTAGGGGGCATCACCCTGCTCACTCTCAGTGACACAGAGGA 
CTCCTTGCATTTTTTGCTGCTCTTCCGAGGCCTTGCAGGACTAACCCAGGTTCCCTTGAGGC 
TCCAGATTCTACACCAGGGGCAGCTACTGCGAGAACTTCAGGCCAATGTCTCAGCCCAGGAA 
CCAGGCTTTGCTGAGGTGCTGCCCAACCTGACAGTCCAGGAGATGGACTGGCTGGTGCTGGG 
GGAGCTGCAGATGGCCCTGGAGTGGGCAGGCAGGCCAGGGCTGCGCATCAGTGGACACATTG 
CTGCCAGGAAGAGCTGCGACGTCCTGCAAAGTGTCCTTTGTGGGGCTAATGCCCTGATCCCA 
GTCCAAACGGGTGCTGCCGGCTCAGCCAGCCTCACTCTGCTAGGAAATGGCNCCCTGATCCT 
CCAGGTGCAATTGGTAGGGACAACCAGTGAGGTGGTGGCCATGACACTGGAAACCAAGCCTC 
AGCGGAGGGATCAGCCCACTGTCCTGTGCCACATGGCTGGCCTATCCTCCCCTGCCCCCAGG 
CCGTGGGTATCTGCCCTGGGCTGGGGTGCCCGAGGGGCTCATATGCTGCTGCAGAATGAGCT 
CTTCCTGAACGTGGGCACCAAGGACTTCCCAGACGGAGAGCTTCGGGGGCAACGTGGCTGCC 
CTGCCCTACTGTGGGGCATAGCGCCCGCCCTGCCCGTGCCCCTAGCAGGAGCCCTGGTGCTA 
CCCCCTGTGAAGAGCCAAGCAGCAGGGCACGCCTGGCTTTCCTTGGATACCCACTGTCACCT 
GCACTATGAAGTGCTGCTGGCTGGGCTTGGTGGCTCAGAACAAGGCACTGTCACTGCCCACC 
TCCTTGGGCCTCCTGGAACGCCAGGGCCTCGGCGGCTGCTGAAGGGATTCTATGGCTCAGAG 
GCCCAGGGTGTGGTGAAGGACCTGGAGCCGGAACTGCTGCGGCACCTGGCAAAAGGCATGGC 
TTCCCTGATGATCACCACCAAGGTAGCCCCAGAGGGGAGCTCCGAGGGCAGCCTCTCCTCCC 
AGGTGCACATAGCCAACCAATGTGAGGTTGGCGGACTGCGCCTGGAGGCGGCCGGGGCCGAG 
GGGGTGCGGGCGCTGGGGGCTCCGGATACAGCCTCTGCTGCGCCGCCTGTGGTGCCTGGTCT 
CCCGGCCCTAGCGCCCGCCAAACCTGGTGGTCCTGGGCGGCCCCGAGACCCCAACACATGCT 
TCTTCGAGGGGCAGCAGCGCCCCCACGGGGCTCGCTGGGCGCCCAACTACGACCCGCTCTGC 
TCACTCTGCACCTGCCAGAGACGAACGGTGATCTGTGACCCGGTGGTGTGCCCACCGCCCAG 
CTGCCCACACCCGGTGCAGGCTCCCGACCAGTGCTGCCCTGTTTGCCCTGGCTGCTATTTTG 
ATGGTGACCGGAGCTGGCGGGCAGCGGGTACGCGGTGGCACCCCGTTGTGCCCCCCTTTGGC 
TTAATTAAGTGTGCTGTCTGCACCTGCAAGCAGGGGGGCACTGGAGAGGTGCACTGTGAGAA 
GGTGCAGTGTCCCCGGCTGGCCTGTGCCCAGCCTGTGCGTGTCAACCCCACCGACTGCTGCA 
AACAGTGTCCAGGTGAGGCCCACCCCCAGCTGGGGGACCCCATGCAGGCTGATGGGCCCCGG 
GGCTGCCGTTTTGCTGGGCAGTGGTTCCCAGAGAGTCAGAGCTGGCACCCCTCAGTGCCCCC 
GTTTGGAGAGATGAGCTGTATCACCTGCAGATGTGGGGTAAGTGGGGAGCAGAGGCTTGTGT 
GAGGTGGGTACTGGGAGCCTGGTCTGGAGTAGGGAGACCTTCCCAGGGAGGTCCCTGAAGAA 
GCTGAAGGTCACTGTGTCCCAGTGCCTCTGGGGGACACTCAGTGTCTGCTCTGTCTTGTACC 
AGGCAGGGGTGCCTCACTGTGAGCGGGATGACTGTTCACTGCCACTGTCCTGTGGCTCGGGG 
AAGGAGAGTCGATGCTGTTCCCGCTGCACGGCCCACCGGCGGCGTAAGTGAGGGAGTCCAGG 
GTCAGCAGCTGTGAGTGGAGGGCTCACCTGCCTGTGGGACTCCTGATCAGGGAAGGGAGCAC 
TCACTGTGTGCAGGAACAGTGCAGCCTGCCTCACAAGTGCCATTCCAATCCACCCTCACAGC 
AACCTGGTGGAATTGTTATTTATGACCTTTTCTTTACAAATGAGATTTCTGAAGCTCAGAGA 
AATTAAGCAACGAGATGAAGGTCACCCAGCTGTGTGCACTGACCTGTTTAGAAAATACTGGC 
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CTTTCTGGGACCAAGGCAGGGATGCTTTGCCCTGCCCTCTATGCCTCTCTGTGCCTCTCCAC 
TCCCTCTCCCCTCCTCCAACATTCCCTCCCTTCTGTCTCCAGCAGCCCCAGAGACCAGAACT 
GATCCAGAGCTGGAGAAAGAAGCCGAAGGCTCTTAGGGAGCAGCCAGAGGGCCAAGTGACCA 
AGAGGATGGGGCCTGAGCTGGGGAAGGGGTGGCATCGAGGACCTTCTTGCATTCTCCTGTGG 
GAAGCCCAGTGCCTTTGCTCCTCTGTCCTGCCTCTACTCCCACCCCCACTACCTCTGGGAAC 
CACAGCTCCACAAGGGGGAGAGGCAGCTGGGCCAGACCGAGGTCACAGCCACTCCAAGTCCT 
GCCCTGCCACCCTCGGCCTCTGTCCTGGAAGCCCCACCCCTTTCTTCCTGTACATAAT6TCA 
CTGGCTTGTTGGGATTTTTAATTTATCTTCACTCAGCACCAAGGGCCCCGGACACTCCACTC 
CTGCTGCCCCTGAGCTGAGCAGAGTCATTATTGGAGAGTTTTGTATTTATTAAAACATTTCT 
TTTTCAGTCTTTGGGCATGAGGTTGGCTCTTTGTGGCCAGGAACCTGAGTGGGGCCTGGTGG 
AGAAGGGGCNGAGAGTAGGAGGTGAGAGAGAGGAGCTCTGACACTTGGGGAGCTGAAAGAGA 
CCTGGAGAGGCAGAGGATAGCGTGGCNNTTGGCTGGCATNCCTGGGTTCCGCAGAGGGGCTG 
GGGATGGTTCTTGAGATGGTCTAGAGACTCAAGAATTTAGGGAAGTAGAAGCAGGATTTTGA 
CTCAAGTTTAGTTTCCCACATCGCTGGCCTGTTTGCTGACTTCATGTTTGAAGTTGCTCCAG 
AGAGAGAATCAAAGGTGTCACCAGCCCCTCTCTCCCTCCTTCCCTTCCCTTCCCTTTCTTTC 
CCTCCCCTCCCCTCCCCTCCCCTCCCCTCC 
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GGCCGAGCGGGGGTGCTGCGCGGCGGCCGTGATGGCTGGTGACGGCGGGGCCGGGCAGGGGA 

CCGGGGCCGCGGCCCGGGAGCGGGCCAGCTGCCGGGAGCCCTGAATCACCGCCTGGCCCGAC 

TCCACCATGAACGTCGCGCTGCAGGAGCTGGGAGCTGGCAGCAACGTGGGATTCCAGAAGGG 

GACAAGACAGCTGTTAGGCTCACGCACGCAGCTGGAGCTGGTCTTAGCAGGTGCCTCTCTAC 

TGCTGGCTGCACTGCTTCTGGGCTGCCTTGTGGCCCTAGGGGTCCAGTACCACAGAGACCCA 

TCCCACAGCACCTGCCTTACAGAGGCCTGCATTCGAGTGGCTGGAAA/iATCCTGGAGTCCCT 

GGACCGAGGGGTGAGCCCCTGTGAGGACTTTTACCAGTTCTCCTGTGGGGGCTGGATTCGGA 

GGAACCCCCTGCCCGATGGGCGTTCTCGCTGGAACACCTTCAACAGCCTCTGGGACCAAAAC 

CAGGCCATACTGAAGCACCTGCTTGAAAACACCACCTTCAACTCCAGCAGTGAAGCTGAGCA 

GAAGACACAGCGCTTCTACCTATCTTGCCTACAGGTGGAGCGCATTGAGGAGCTGGGAGCCC 

AGCCACTGAGAGACCTCATTGAGAAGATTGGTGGTTGGAACATTACGGGGCCCTGGGACCAG 

GACAACTTTATGGAGGTGTTGAAGGCAGTAGCAGGGACCTACAGGGCCACCCCATTCTTCAC 

CGTCTACATCAGTGCCGACTCTAAGAGTTCCAACAGCAATGTTATCCAGGTGGACCAGTCTG 

GGCTCTTTCTGCCCTCTCGGGATTACTACTTAAACAGAACTGCCAATGAGAAAGTAAGGAAC 

ATCTTCCGAACCCCCATCCCTACCCCTGGCTGAGCTGGGCTGATCCCTGTTGACTTTTCCCT 

TTGCCAAGGGTCAGAGCAGGGAAGGTGAGCCTATCCTGTCACCTAGTGAACAAACTGCCCCT 

CCTTTCTTTCTTCTTTTCTTCCTCCCTCCCTCCCTTTCTTCCCCTTTTCCTTCCTTCCTTCC 

TCTTATTCTTCTAGTAGGTTTCATAGACACCTACTGTGTGCCAGGTCCAGTGGGGGAATTCG 

GAGATATAAGTTTCCGAGCCATTGCCACAGGAAGCGTTCAGTGTCGATGGGTTCATGGACCT 

AGATAGGCTGATAACAAAGCTCACAAGAGGGTCCTGAGGATTCAGGAGAGACTTATGGAGCC 
AGCAAAGTCTTCCTGAAGAGATTGCATTTGAGCCAGGTCCTGTAG 



FIGURE 228 

ATGCCTACTACCTTCCAACTAAGAATGAGATCGTCTTCCCCGCTGGCATCCTGCAGGCCCCC 

TTCTATGCCCGCAACCACCCCAAGGCCCTGAACTTCGGTGGCATCGGTGTGGTCATGGGCCA 

TGAGTTGACGCATGCCTTTGATGACCAAGGGCGCGAGTATGACAAAGAAGGGAACCTGCGGC 

CCTGGTGGCAGAATGAGTCCCTGGCAGCCTTCCGGAACCACACGGCCTGCATGGAGGAACAG 

TACAATCAATACCAGGTCAATGGGGAGAGGCTCAACGGCCGCCAGACGCTGGGGGAGAACAT 

TGCTGACAACGGGGGGCTGAAGGCTGCCTACAATGCTTACAAAGCATGGCTGAGAAAGCATG 

GGGAGGAGCAGCAACTGCCAGCCGTGGGGCTCACCAACCACCAGCTCTTCTTCGTGGGATTT 

GCCCAGGTGTGGTGCTCGGTCCGCACACCAGAGAGCTCTCACGAGGGGCTGGTGACCGACCC 

CCACAGCCCTGCCCGCTTCCGCGTGCTGGGCACTCTCTCCAACTCCCGTGACTTCCTGCGGC 

ACTTCGGCTGCCCTGTCGGCTCCCCCATGAACCCAGGGCAGCTGTGTGAGGTGTGGTAGACC 

TGGATCAGGGGAGAAATGGCCAGCTGTCACCAGACCTGGGGCAGCTCTCCTGACAAAGCTGT 

TTGCTCTTGGGTTGGGAGGAAGCAAATGCAAGCTGGGCTGGGTCTAGTCCCTCCCCCCCACA 

GGTGACATGAGTACAGACCCTCCTCAATCACCACATTGTGCCTCTGCTTTGGGGGTGCCCCT 

GCCTCCAGCAGAGCCCCCACCATTCACTGTGACATCTTTCCGTGTCACCCTGCCTGGAAGAG 

GTCTGGGTGGGGAGGCCAGTTCCCATAGGAAGGAGTCTGCCTCTTCTGTCCCCAGGCTCACT 

CAGCCTGGCGGCCATGGGGCCTGCCGTGCCTGCCCCACTGTGACCCACAGGCCTGGGTGGTG 

TACCTCCTGGACTTCTCCCCAGGCTCACTCAGTGCGCACTTAGGGGTGGACTCAGCTCTGTC 

TGGCTCACCCTCACGGGCTACCCCCACCTCACCCTGTGCTCCTTGTGCCACTGCTCCCAGTG 

CTGCTGCTGACCTTCACTGACAGCTCCTAGTGGAAGCCCAAGGGCCTCTGAAAGCCTCCTGC 

TGCCCACTGTTTCCCTGGGCTGAGAGGGGAAGTGCATATGTGTAGCGGGTACTGGTTCCTGT 

GTCTTAGGGCACAAGCCTTAGCAAATGATTGATTCTCCCTGGACAAAGCAGGAAAGCAGATA 

GAGCAGGGAAAAGGAAGAACAGAGTTTATTTTTACAGAAAAGAGGGTGGGAGGGTGTGGTCT 

TGGCCCTTATAGGACC 
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